Abstract
Learning in evolving environments involves learning from data where the statistical characteristics can change over time. Current change detection algorithms that are used online for data streams detect whether a change has occurred in the data but there is always a detection delay. None of the existing online techniques can accurately pin-point the exact location of when the change starts to occur, which can be critical. We present a novel method Change Angle and we show, for the first time, how to pin-point online the location at which change starts to occur. We apply our Change Angle method in the application area of software revision control using Mozilla data, where it is important to detect not only the presence of change but also to pin-point accurately the location of when change starts to occur.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Baena-Garcıa, M., del Campo-Ávila, J., Fidalgo, R., Bifet, A., Gavaldà, R., Morales-Bueno, R.: Early drift detection method. In: Proceedings of the 4th International Workshop on Knowledge Discovery from Data Streams, pp. 77–86 (2006)
Bifet, A., Gavaldà, R.: Learning from time-changing data with adaptive windowing. In: Proceedings of the 7th SIAM International Conference on Data Mining (SDM), pp. 443–448 (2007)
Bifet, A., Gavaldà, R.: Adaptive learning from evolving data streams. In: Adams, N.M., Robardet, C., Siebes, A., Boulicaut, J.-F. (eds.) IDA 2009. LNCS, vol. 5772, pp. 249–260. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-03915-7_22
Domingos, P., Hulten, G.: Mining high-speed data streams. In: Proceedings of the 6th ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 71–80 (2000)
Gama, J., Medas, P., Castillo, G., Rodrigues, P.: Learning with drift detection. In: Bazzan, A.L.C., Labidi, S. (eds.) SBIA 2004. LNCS (LNAI), vol. 3171, pp. 286–295. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-28645-5_29
Gama, J., Žliobaitė, I., Bifet, A., Pechenizkiy, M., Bouchachia, A.: A survey on concept drift adaptation. ACM Comput. Surv. 46(4), 1–35 (2014)
Hoeffding, W.: Probability inequalities for sums of bounded random variables. J. Am. Stat. Assoc. 58(301), 13–30 (1963)
Huang, D.T.J., Koh, Y.S., Dobbie, G., Bifet, A.: Drift detection using stream volatility. In: Appice, A., Rodrigues, P.P., Santos Costa, V., Soares, C., Gama, J., Jorge, A. (eds.) ECML PKDD 2015. LNCS (LNAI), vol. 9284, pp. 417–432. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-23528-8_26
Huang, D.T.J., Koh, Y.S., Dobbie, G., Pears, R.: Detecting volatility shift in data streams. In: Proceedings of the 10th IEEE ICDM, pp. 863–868 (2014)
Kanji, G.K.: 100 Statistical Tests. Sage Publications, London (2006)
Page, E.S.: Continuous inspection schemes. Biometrika 41, 100–115 (1954)
Pears, R., Sakthithasan, S., Koh, Y.S.: Detecting concept change in dynamic data streams. Mach. Learn. 97(3), 259–293 (2014)
Ross, G.J., Adams, N.M., Tasoulis, D.K., Hand, D.J.: Exponentially weighted moving average charts for detecting concept drift. Pattern Recogn. Lett. 33(2), 191–198 (2012)
Russell, S., Norvig, P.: Artificial Intelligence: A Modern Approach, vol. 25 (1995)
Acknowledgment
We thank all Mozilla engineers that were involved in the development of this research project especially Kyle Lahnakoski, Joel Maher, and Jonathan Griffin for helping us acquire access to the relevant data and validate the work.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Huang, D.T.J., Koh, Y.S., Dobbie, G. (2019). Interpreting Intermittent Bugs in Mozilla Applications Using Change Angle. In: Islam, R., et al. Data Mining. AusDM 2018. Communications in Computer and Information Science, vol 996. Springer, Singapore. https://doi.org/10.1007/978-981-13-6661-1_25
Download citation
DOI: https://doi.org/10.1007/978-981-13-6661-1_25
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-6660-4
Online ISBN: 978-981-13-6661-1
eBook Packages: Computer ScienceComputer Science (R0)