ABSTRACT
Over the past few years, the volume and types of data related to software engineering has grown at an unprecedented rate and shows no sign of slowing. This turn of events has led to a veritable gold rush, as researchers attempt to mine raw data and extract nuggets of insight. A very real danger is that the landscape may become a Wild West where inexperienced software "cowboys" sell hastily generated models to unsophisticated business users, without any concern for best or safe practices. Given the current enthusiasm for data analysis in software engineering, it is time to review how we using those techniques and can we use them better. While there may be no single best "right" way to analyze software data, there are many wrong ways. As data techniques mature, we need to move to a new era where data scientists understand and share the strengths and drawbacks of the many methods that might be deployed in industry. In this highly interactive panel skilled practitioners and academics can (a) broadcast their insights and (b) hear the issues of newcomers in this field.
- Christian Bird, Nachiappan Nagappan, Premkumar Devanbu, Harald Gall, and Brendan Murphy. Does distributed development affect software quality? An empirical case study of Windows Vista. In Proc. of the 31st Intl. Conference on Software Engineering (ICSE '09), pp. 518-528. Google ScholarDigital Library
- Thomas J. Ostrand, Elaine J. Weyuker, and Robert M. Bell. 2004. Where the bugs are. In Proceedings of the 2004 ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA '04), pp. 86-96. Google ScholarDigital Library
- Ayse Tosun Misirli, Ayse Bener, and Resat Kale. AI-Based Software Defect Predictors: Applications and Benefits in a Case Study. In Proceedings of the Twenty-Second Innovative Applications of Artificial Intelligence Conference (IAAI-10), pp. 1748-1755Google Scholar
- Tim Menzies, Zach Milton, Burak Turhan, Bojan Cukic, Yue Jiang, and Ayşe Bener. Defect prediction from static code features: current results, limitations, new approaches. In Automated Software Engineering, Volume 17, Issue 4, pp 375-407. December 2010 Google ScholarDigital Library
Index Terms
- Analyzing software data: after the gold rush (a goldfish-bowl panel)
Comments