NCOHA-WSC: Handling Noise and Class Overlap in Web Service Classification

Authors

  • Feng Zhang School of Computer Science and Engineering, Shandong University of Science and Technology, China https://orcid.org/0000-0003-2646-9854
  • Lin Xue School of Computer Science and Engineering, Shandong University of Science and Technology, China
  • Huiling Li School of Computer Science and Technology, Shandong University of Technology, China https://orcid.org/0000-0001-8106-6340
  • Cong Liu NOVA Information Management School, Nova University of Lisbon, Portugal https://orcid.org/0000-0002-2665-7153

DOI:

https://doi.org/10.47852/bonviewJCCE62028810

Keywords:

Web service classification, noise handling, class overlap, confidence learning, prior distribution

Abstract

With the rapid growth in the number of Web services, accurate and efficient Web service classification has become crucial for improving the quality-of-service discovery. However, existing classification approaches often overlook the issues of noise and class overlap inherent in Web service data, which leads to degraded classification precision. To address these challenges, this paper proposes NCOHA-WSC, an approach for Web service classification designed to handle both noise and class overlap and to be easily integrated with existing machine learning–based service classification models. Specifically, noisy samples in the training data are filtered using confidence learning and information entropy, thereby reducing the negative impact of noise on the classification model during preprocessing. In addition, during the testing phase, the prediction results for overlapping services are corrected based on the label prior distribution, further improving classification precision. Experiments conducted on the real-world ProgrammableWeb dataset demonstrate that NCOHA-WSC is compatible with mainstream Web service classification models and can enhance the Macro-F1 performance of models such as ServeNet and CARL-Net to varying degrees. These results indicate that the proposed approach effectively mitigates the impact of noisy data on Web service classification and improves the precision of existing models in the presence of overlapping service classes.



Received: 16 December 2025 | Revised: 12 February 2026 | Accepted: 3 April 2026



Conflicts of Interest

The authors declare that they have no conflicts of interest to this work.



Data Availability Statement

The data that support the findings of this study are openly available on GitHub at https://github.com/HIT-ICES/Correted-ProgrammableWeb-dataset.git.



Author Contribution Statement

Feng Zhang: Writing – original draft, Writing – review & editing, Visualization, Project administration, Formal analysis. Lin Xue: Writing – review & editing, Software, Validation, Project administration. Huiling Li: Writing – original draft, Writing – review & editing, Methodology, Investigation. Cong Liu: Writing – review & editing, Conceptualization, Resources, Data curation, Supervision, Funding acquisition.

Downloads

Published

2026-05-12

Issue

Section

Research Articles

How to Cite

Zhang, F., Xue, L., Li, H., & Liu, C. (2026). NCOHA-WSC: Handling Noise and Class Overlap in Web Service Classification. Journal of Computational and Cognitive Engineering. https://doi.org/10.47852/bonviewJCCE62028810

Funding data