ABSTRACT
Continuous Integration (CI) reduces risk in software development, but a CI build usually brings huge time and resource consumption. Machine learning methods have been employed to cut the expenses of CI and provide instant feedback by predicting CI results. Nevertheless, effective learning requires massive training data which is not available for a new project. Moreover, due to the diversified characteristics of different projects, reusing models built on other projects leads to poor performance. To address this problem, we propose a novel active online model adaptation approach ACONA, which dynamically adapts a pool of classifiers trained on various projects to a new project using only a small fraction of new data it actively selects. With empirical study on Travis CI, we show that ACONA achieves an improvement of F-Measure by 40.0% while reducing Accumulated Error by 63.2% and the adapted model outperforms existing approaches.
- Paul M Duvall, Steve Matyas, and Andrew Glover. 2007. Continuous integration: improving software quality and reducing risk. Pearson Education. Google ScholarDigital Library
- Jacqui Finlay, Russel Pears, and Andy M Connor. 2014. Data stream mining for predicting software build outcomes using source code metrics. Information and Software Technology 56, 2 (2014), 183--198. Google ScholarDigital Library
- Ahmed E Hassan and Ken Zhang. 2006. Using decision trees to predict the certification result of a build. In 21st IEEE/ACM International Conference on Automated Software Engineering, 2006. ASE'06. IEEE, 189--198. Google ScholarDigital Library
- Michael Hilton, Timothy Tunnell, Kai Huang, Darko Marinov, and Danny Dig. 2016. Usage, costs, and benefits of continuous integration in open-source projects. In Automated Software Engineering (ASE), 2016 31st IEEE/ACM International Conference on. IEEE, 426--437. Google ScholarDigital Library
- Ansong Ni and Ming Li. 2017. Cost-effective build outcome prediction using cascaded classifiers. In Proceedings of the 14th International Conference on Mining Software Repositories. IEEE Press, 455--458. Google ScholarDigital Library
- N. Nagappan T. Zimmermann, E. Giger H. Gall, and B. Murphy. 2009. Cross-project defect prediction: a large scale experiment on data vs. domain vs. process. In Proceedings of the ACM SIGSOFT symposium on The foundations of software engineering. ACM, 91--100. Google ScholarDigital Library
- Martin Zinkevich. 2003. Online convex programming and generalized infinitesimal gradient ascent. In Proceedings of the 20th International Conference on Machine Learning (ICML-03). 928--936. Google ScholarDigital Library
Index Terms
- ACONA: active online model adaptation for predicting continuous integration build failures
Recommendations
Online Passive-Aggressive Active learning
We investigate online active learning techniques for online classification tasks. Unlike traditional supervised learning approaches, either batch or online learning, which often require to request class labels of each incoming instance, online active ...
Active learning using on-line algorithms
KDD '11: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data miningThis paper describes a new technique and analysis for using on-line learning algorithms to solve active learning problems. Our algorithm is called Active Vote, and it works by actively selecting instances that force several perturbed copies of an on-...
ALBIF: Active Learning with BandIt Feedbacks
Advances in Knowledge Discovery and Data MiningAbstractOnline active learning algorithms reduce human labeling costs by querying only a subset of informative incoming instances from the data stream to update the classification model. Active learning for online multiclass classification under complete ...
Comments