Abstract
Context and motivation: Today’s industry specifications, in particular those of the automotive industry, are complex and voluminous. At Mercedes-Benz, a specification and its referenced documents often sums up to 3,000 pages. Question/problem: A common way to ensure the quality in such natural language specifications is technical review. Given such large specifications, reviewers have major problems in finding defects, especially consistency or completeness defects, between requirements with related information, spread over the various documents. Principal ideas/results: In this paper, we investigate two specifications from Mercedes-Benz, whether requirements with related information spread over many sections of many documents can be automatically classified and extracted using text classification algorithms to support reviewers with their work. We further research enhancements to improve these classifiers. The results of this work demonstrate that an automatic classification of requirements for multiple aspects is feasible with high accuracy. Contribution: In this paper, we show how an automatic classification of requirements can be used to improve the review process. We discuss the limitations and potentials of using this approach.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Houdek, F.: Challenges in Automotive Requirements Engineering. In: Industrial Presentations by Requirements Engineering: Foundation for Software Quality, Essen (2010)
Knauss, E., Houmb, S., Schneider, K., Islam, S., Jürjens, J.: Supporting Requirements Engineers in Recognising Security Issues. In: Berry, D., Franch, X. (eds.) REFSQ 2011. LNCS, vol. 6606, pp. 4–18. Springer, Heidelberg (2011)
Ott, D., Raschke, A.: Review Improvement by Requirements Classification at Mercedes-Benz: Limits of Empirical Studies in Educational Environments. In: International Workshop on Empirical Requirements Engineering (EMPIRE), Chicago (2012)
Ott, D.: Defects in Natural Language Requirement Specifications at Mercedes-Benz: An Investigation Using a Combination of Legacy Data and Expert Opinion. In: International Requirements Engineering Conference, Chicago (2012)
Hussain, I., Ormandjieva, O., Kosseim, L.: LASR: A Tool for Large Scale Annotation of Software Requirements. In: International Workshop on Empirical Requirements Engineering (EMPIRE), Chicago (2012)
Song, X., Hwong, B.: Categorizing Requirements for a Contract-Based System Integration Project. In: International Requirements Engineering Conference, Chicago (2012)
Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques: Practical Machine Learning Tools and Techniques. The Morgan Kaufmann Series in Data Management Systems. Elsevier Science (2011)
Houdek, F., Peach, B.: Das Tuersteuergeraet – eine Beispielspezifikation (engl.: The doors closure module – an example specification), Fraunhofer IESE (2002)
Gnesi, S., Lami, G., Trentanni, G., Fabbrini, F., Fusani, M.: An automatic tool for the analysis of natural language requirements. International Journal of Computer Systems Science & Engineering 20, 53–62 (2005)
Moser, T., Winkler, D., Heindl, M., Biffl, S.: Requirements management with semantic technology: An empirical study on automated requirements categorization and conflict analysis. In: Mouratidis, H., Rolland, C. (eds.) CAiSE 2011. LNCS, vol. 6741, pp. 3–17. Springer, Heidelberg (2011)
Ko, Y., Park, S., Seo, J., Choi, S.: Using classification techniques for informal requirements in the requirements analysis-supporting system. Information and Software Technology 49, 1128–1140 (2007)
Runeson, P., Hoest, M.: Guidelines for conducting and reporting case study research in software engineering. Empirical Software Engineering 14, 131–164 (2009)
Fagan, M.: Design and code inspections to reduce errors in program development. IBM Journal of Research and Development 15(3), 182 (1976)
Gilb, T., Graham, D.: Software Inspection. In: Finzi, S. (ed.), Addison-Wesley (1994)
Aurum, A., Petersson, H., Wohlin, C.: State-of-the-art: Software Inspections after 25 Years. Software Testing, Verification and Reliability 12(3), 133–154 (2002)
Wilson, W., Rosenberg, L., Hyatt, L.: Automated analysis of requirement specifications. In: Proceedings of the 19th International Conference on Software Engineering (ICSE 1997), pp. 161–171. IEEE (1997)
Gervasi, V., Zowghi, D.: Reasoning about inconsistencies in natural language requirements. ACM Trans. Softw. Eng. Methodol. 14(3), 277–330 (2005)
Wang, S., Manning, C.D.: Baselines and Bigrams: Simple, Good Sentiment and Topic Classification. In: ACL vol. (2), pp. 90–94 (2012)
Hollink, V., Kamps, J., Monz, C., De Rijke, M.: Monolingual document retrieval for European languages. Information retrieval 7(1), 33–52 (2004)
Leuser, J.: Herausforderungen für halbautomatische Traceability-Erkennung (Challenges for Semi-automatic Trace Recovery). In: Systems Engineering Infrastructure Conference (2009)
Carletta, J.: Squips and Discussions Assessing agreement on classification tasks: The kappa statistic. Computional Linguistics 22(2), 249–254 (1996)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. ACM Press, Addison Wesley
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann, Waltham (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ott, D. (2013). Automatic Requirement Categorization of Large Natural Language Specifications at Mercedes-Benz for Review Improvements. In: Doerr, J., Opdahl, A.L. (eds) Requirements Engineering: Foundation for Software Quality. REFSQ 2013. Lecture Notes in Computer Science, vol 7830. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37422-7_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-37422-7_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37421-0
Online ISBN: 978-3-642-37422-7
eBook Packages: Computer ScienceComputer Science (R0)