A testing data validity assessment method and testing data validation platform based on SOA

Zhang, Beige; Li, Chun; Shah, Nazaraf; Fei, Xiang; Jiang, Lihong; Cai, Hongming

doi:10.1007/s11761-018-0242-4

A testing data validity assessment method and testing data validation platform based on SOA

Special Issue Paper
Published: 27 September 2018

Volume 12, pages 201–209, (2018)
Cite this article

Service Oriented Computing and Applications Aims and scope Submit manuscript

Beige Zhang¹,
Chun Li²,
Nazaraf Shah³,
Xiang Fei³,
Lihong Jiang¹ &
…
Hongming Cai ORCID: orcid.org/0000-0003-0190-6907¹

370 Accesses
Explore all metrics

Abstract

In modern manufacturing, ensuring the quality of component testing data is highly valued by both product manufacturers and component suppliers. However, in common component quality analysis processes, testing data are assumed to be valid, which might not be true. Therefore, assessing the validity of component testing data would be important. Many existing data analysis platforms are separated from enterprises’ own systems, which makes the inspection data analysis incoherent to their business process. In this paper, we propose a testing data quality assessment method and a testing data validation platform based on SOA. The platform provides reliable third-party testing data validation service via RESTful APIs, so that the services can be seamlessly integrated to enterprise systems. The testing data validity assessment method, which is the core of the platform, is implemented by detecting illegal behavior in data recording. The detection is a combination of behavior analysis and a positive and unlabeled learning process.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Efficient and Flexible Test Automation in Production Systems Engineering

From Industry 4.0 to Industry 5.0: Challenges and Opportunities in the Testing Inspection and Certification (TIC) Industry

Automobile Parts Detection and Traceability Based on Blockchain and Federated Learning

References

Hazen BT, Boone CA, Ezell JD, Jones-Farmer LA (2014) Data quality for data science, predictive analytics, and big data in supply chain management: an introduction to the problem and suggestions for research and applications. Int J Prod Econ 154:72–80
Article Google Scholar
Cai L, Zhu Y (2015) The challenges of data quality and data quality assessment in the big data era. Data Sci J 14:2. https://doi.org/10.5334/dsj-2015-002
Article Google Scholar
Elkan C, Noto K (2008) Learning classifiers from only positive and unlabeled data. In: Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining, ACM, pp 213–220
Duan Y, Fu G, Zhou N, Sun X, Narendra NC, Hu B (2015) Everything as a service (XaaS) on the cloud: origins, current and future trends. In: 2015 IEEE 8th international conference on cloud computing (CLOUD), IEEE, pp 621–628
Kerdoudi ML, Tibermacine C, Sadou S (2016) Opening web applications for third-party development: a service-oriented solution. Serv Oriented Comput Appl 10(4):437–463
Article Google Scholar
Wang RY, Strong DM (1996) Beyond accuracy: what data quality means to data consumers. J Manag Inf Syst 12(4):5–33
Article Google Scholar
Woodall P, Oberhofer M, Borek A (2014) A classification of data quality assessment and improvement methods. Int J Inf Qual 163(4):298–321
Article Google Scholar
Pipino LL, Lee YW, Wang RY (2002) Data quality assessment. Commun ACM 45(4):211–218
Article Google Scholar
Batini C, Cappiello C, Francalanci C, Maurino A (2009) Methodologies for data quality assessment and improvement. ACM Comput Surv (CSUR) 41(3):16
Article Google Scholar
Myrick ML, Priore RJ, Freese RP, Blackburn JC (2015) US Patent No. 9,170,154. Washington, DC: U.S. Patent and Trademark Office
Gimelli A, Sannino R (2018) A multi-variable multi-objective methodology for experimental data and thermodynamic analysis validation: an application to micro gas turbines. Appl Therm Eng 134:501–512
Article Google Scholar
Rieck K, Trinius P, Willems C, Holz T (2011) Automatic analysis of malware behavior using machine learning. J Comput Secur 19(4):639–668
Article Google Scholar
Saad S, Traore I, Ghorbani A, Sayed B, Zhao D, Lu W, Hakimian P (2011) Detecting P2P botnets through network behavior analysis and machine learning. In: 2011 Ninth annual international conference on privacy, security and trust (PST), IEEE, pp 174–180
Witten, Ian H., et al. Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann, 2016
Liu H, Motoda H (eds) (1998) Feature extraction, construction and selection: a data mining perspective, vol 453. Springer, Berlin
MATH Google Scholar
Zhou X, Belkin M (2014) Semi-supervised learning. In: Academic Press Library in signal processing, vol 1, Elsevier, pp 1239–1269
Hady MFA, Schwenker F (2013) Semi-supervised learning. In: Handbook on neural information processing, Springer, Berlin, pp 215–239
Yang P, Liu W, Yang J (2017) Positive unlabeled learning via wrapper-based adaptive sampling. In: Proceedings of the 26th international joint conference on artificial intelligence, AAAI Press, pp 3273–3279
Xu Y, Xu C, Xu C, Tao D (2017) Multi-positive and unlabeled learning. In: Proceedings of the 26th international joint conference on artificial intelligence, AAAI Press, pp 3182–3188
Fusilier DH, Montes-y-Gómez M, Rosso P, Cabrera RG (2015) Detecting positive and negative deceptive opinions using PU-learning. Inf Process Manag 51(4):433–443
Article Google Scholar
Lemos AL, Florian D, Boualem B (2016) Web service composition: a survey of techniques and tools. ACM Comput Surv (CSUR) 48(3):33
Google Scholar
Tsai WT, Sun X, Balasooriya J (2010) Service-oriented cloud computing architecture. In: 2010 seventh international conference on information technology: new generations (ITNG), IEEE, pp 684–689
“What is Cloud Computing?”. Amazon Web Services. https://aws.amazon.com/what-is-cloud-computing/. Accessed 20 Mar 2013
Mumbaikar S, Padiya P (2013) Web services based on soap and rest principles. Int J Sci Res Publ 3(5):1–4
Google Scholar
Lampesberger H (2016) Technologies for web and cloud service interaction: a survey. Serv Oriented Comput Appl 10(2):71–110
Article Google Scholar
Curbera F, Duftler M, Khalaf R, Nagy W, Mukhi N, Weerawarana S (2002) Unraveling the web services web: an introduction to SOAP, WSDL, and UDDI. IEEE Internet Comput 6(2):86–93
Article Google Scholar
Yates A, Beal K, Keenan S, McLaren W, Pignatelli M, Ritchie GR, Flicek P (2014) The ensemble REST API: ensemble data for any language. Bioinformatics 31(1):143–145
Article Google Scholar
Dittrich J, Quiané-Ruiz JA (2012) Efficient big data processing in Hadoop MapReduce. Proc VLDB Endow 5(12):2014–2015
Article Google Scholar
Taylor RC (2010) An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics. In: BMC bioinformatics, vol 11, no 12, BioMed Central, p S1
Mott R (2005) Smith–waterman algorithm. eLS, London
Book Google Scholar

Download references

Acknowledgements

This research is supported by the Shanghai Institute of Precision Measurement Project under Grand No. SAST2017-128 and the National Natural Science Foundation of China under Grant No. 61373030.

Author information

Authors and Affiliations

School of Software, Shanghai Jiao Tong University, Shanghai, China
Beige Zhang, Lihong Jiang & Hongming Cai
China Shanghai Institute of Precision Measurement and Testing, Shanghai, China
Chun Li
Faculty of Engineering and Computing, Coventry University, Coventry, UK
Nazaraf Shah & Xiang Fei

Authors

Beige Zhang
View author publications
You can also search for this author inPubMed Google Scholar
Chun Li
View author publications
You can also search for this author inPubMed Google Scholar
Nazaraf Shah
View author publications
You can also search for this author inPubMed Google Scholar
Xiang Fei
View author publications
You can also search for this author inPubMed Google Scholar
Lihong Jiang
View author publications
You can also search for this author inPubMed Google Scholar
Hongming Cai
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Hongming Cai.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, B., Li, C., Shah, N. et al. A testing data validity assessment method and testing data validation platform based on SOA. SOCA 12, 201–209 (2018). https://doi.org/10.1007/s11761-018-0242-4

Download citation

Received: 04 March 2018
Revised: 30 June 2018
Accepted: 17 September 2018
Published: 27 September 2018
Issue Date: December 2018
DOI: https://doi.org/10.1007/s11761-018-0242-4

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A testing data validity assessment method and testing data validation platform based on SOA

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Efficient and Flexible Test Automation in Production Systems Engineering

From Industry 4.0 to Industry 5.0: Challenges and Opportunities in the Testing Inspection and Certification (TIC) Industry

Automobile Parts Detection and Traceability Based on Blockchain and Federated Learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now