Skip to main content
Log in

TiQi: answering unstructured natural language trace queries

  • RE 2014
  • Published:
Requirements Engineering Aims and scope Submit manuscript

Abstract

Software traceability is a required element in the development and certification of safety-critical software systems. However, trace links, which are created at significant cost and effort, are often underutilized in practice due primarily to the fact that project stakeholders often lack the skills needed to formulate complex trace queries. To mitigate this problem, we present a solution which transforms spoken or written natural language queries into structured query language (SQL). TiQi includes a general database query mechanism and a domain-specific model populated with trace query concepts, project-specific terminology, token disambiguators, and query transformation rules. We report results from four different experiments exploring user preferences for natural language queries, accuracy of the generated trace queries, efficacy of the underlying disambiguators, and stability of the trace query concepts. Experiments are conducted against two different datasets and show that users have a preference for written NL queries. Queries were transformed at accuracy rates ranging from 47 to 93 %.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Ali N, Guéhéneuc Y-G, Antoniol G (2013) Trustrace: mining software repositories to improve the accuracy of requirement traceability links. IEEE Trans Softw Eng 39(5):725–741

    Article  Google Scholar 

  2. Androutsopoulos I, Ritchie G (2000) Database interfaces. In: Handbook of natural language processing, Marcel Dekker Inc. pp 209–240

  3. Cleland-Huang J, Gotel O, Hayes JH, Mäder P, Zisman A (2014) Software traceability: trends and future directions. In: Proceedings of the on future of software engineering, FOSE 2014, Hyderabad, India, May 31–June 7, 2014, pp 55–69

  4. Cleland-Huang J, Heimdahl MPE, Hayes JH, Lutz RR, Maeder P (2012) Trace queries for safety requirements in high assurance systems. In: Proceedings of requirements engineering: foundation for software quality–18th international working conference, REFSQ 2012, Essen, Germany, March 19–22, 2012, pp 179–193

  5. Collaborative lifecycle management: design, test, analyze, develop, deliver, integrated by design, Accessed 9/6/2013

  6. Cormen T, Leiserson C, Rivest R, Stein C (eds) Introduction to algorithms (2nd ed.). MIT Press and McGrawHill (2001)

  7. Czauderna A, Cleland-Huang J, Çinar M, Berenbach B (2012) Just-in-time traceability for mechatronics systems. In: Second IEEE international workshop on requirements engineering for systems, services, and systems-of-systems, RESS 2012, Chicago, IL, September 25, 2012, pp 1–9

  8. Frost RA, Amour BS, Fortier RJ (2013) An event based denotational semantics for natural language queries to data represented in triple stores. In: 2013 IEEE seventh international conference on semantic computing, Irvine, CA, September 16–18, 2013, pp 142–145

  9. Göker MH, Thompson CA, Arajärvi S, Hua K (2007) Connecting people with questions to people with answers. KI 21(4):23–26

    MATH  Google Scholar 

  10. Gotel O, Cleland-Huang J, Huffman Hayes J, Zisman A, Egyed A, Grnbacher P, Dekhtyar A, Antoniol G, Maletic J, Mder P (2012) Traceability fundamentals. In: Cleland-Huang J, Gotel O, Zisman A (eds) Software and systems traceability. Springer, London, pp 3–22

    Chapter  Google Scholar 

  11. Gotel OCZ, Finkelstein A (1994) An analysis of the requirements traceability problem. In: Proceedings of the first IEEE international conference on requirements engineering, ICRE ’94, Colorado Springs, Colorado, April 18–21, 1994, pp 94–101

  12. Gouvêa E, Moreno-Daniel A, Reddy A, Chengalvarayan R, Thomson DL, Ljolje A (2013) The at&t speech API: a study on practical challenges for customized speech to text service. In: INTERSPEECH 2013, 14th annual conference of the international speech communication association, Lyon, France, August 25–29, 2013, pp 2071–2073

  13. Guerra E, de Lara J, Kolovos D, Paige R (2010) Inter-modelling: from theory to practice. In: Petriu D, Rouquette N, Haugen Ø (eds) Model driven engineering languages and systems, volume 6394 of lecture notes in computer science. Springer, Berlin, pp 376–391. doi:10.1007/978-3-642-16145-2

    Google Scholar 

  14. Hayes JH, Dekhtyar A, Sundaram SK, Holbrook EA, Vadlamudi S, April A (2007) Requirements tracing on target (retro): improving software maintenance through traceability recovery. ISSE 3(3):193–202

    Google Scholar 

  15. Huang X, Baker J, Reddy R (2014) A historical perspective of speech recognition. Commun ACM 57(1):94–103

    Article  Google Scholar 

  16. Jaakkola H, Thalheim B (2003) Visual sql: high-quality er-based query treatment. In: Jeusfeld M, Pastor Ó (eds) Conceptual modeling for novel application domains, volume 2814 of lecture notes in computer science. Springer, Berlin, pp 129–139. doi:10.1007/978-3-540-39597-3

    Google Scholar 

  17. Jarke M, Krause J, Vassiliou Y (1986) Studies in the evaluation of a domain-independent natural language query system. In: Cooperative interfaces to information systems. Springer, pp 101–130

  18. Jarke M, Krause J, Vassiliou Y, Stohr EA, Turner JA, White NH (1985) Evaluation and assessment of a domain-independent natural language query system. IEEE Database Eng Bull 8(3):34–44

    Google Scholar 

  19. Kim H-J, Korth HF, Silberschatz A (1988) PICASSO: a graphical query language. Softw Pract Exp 18:169–203

    Article  Google Scholar 

  20. Lempia DL, Miller SP (2009) Requirements engineering management handbook. National Technical Information Service (NTIS)

  21. Li Y, Yang H, Jagadish HV (2006) Term disambiguation in natural language query for XML. In: Flexible query answering systems, 7th International Conference, FQAS 2006, Milan, Italy, June 7–10, 2006, Proceedings, pp 133–146

  22. Lin J, Lin CC, Cleland-Huang J, Settimi R, Amaya J, Bedford G, Berenbach B, Khadra OB, Duan C, Zou X (2006) Poirot: a distributed tool supporting enterprise-wide automated traceability. In: 14th IEEE international conference on requirements engineering (RE 2006), 11–15 September 2006. Minneapolis/St. Paul, Minnesota, pp 356–357

  23. Lucia AD, Fasano F, Oliveto R, Tortora G (2010) Fine-grained management of software artefacts: the ADAMS system. Softw Pract Exp 40(11):1007–1034

    Article  Google Scholar 

  24. Mäder P, Cleland-Huang J (2013) A visual language for modeling and executing traceability queries. Softw Syst Model 12(3):537–553

    Article  Google Scholar 

  25. Mäder P, Jones PL, Zhang Y, Cleland-Huang J (2013) Strategic traceability for safety-critical projects. IEEE Softw 30(3):58–66

    Article  Google Scholar 

  26. Maletic JI, Collard ML (2009) Tql: a query language to support traceability. In: TEFSE ’09: Proceedings of the 2009 ICSE workshop on traceability in emerging forms of software engineering. Washington, DC, IEEE Computer Society, pp 16–20

  27. Meng HH, Siu KC (2002) Semiautomatic acquisition of semantic structures for understanding domain-specific natural language queries. IEEE Trans Knowl Data Eng 14(1):172–181

    Article  Google Scholar 

  28. Miller GA, Fellbaum C (2007) Wordnet then and now. Lang Resour Eval 41(2):209–214

    Article  Google Scholar 

  29. Minock M (2010) C-phrase: a system for building robust natural language interfaces to databases. Data Knowl Eng 69(3):290–302

    Article  Google Scholar 

  30. Popescu A-M, Etzioni O, Kautz HA (2003) Towards a theory of natural language interfaces to databases. In: IUI, pp 149–157

  31. Post from HP Quality Center Support Forum (2009) http://h30499.www3.hp.com/t5/ITRC-Quality-Center-Forum/traceability-query/m-p/4505026., Accessed 1/18/2015

  32. Pruski P, Lohar S, Aquanette R, Ott G, Amornborvornwong S, Rasin A, Cleland-Huang J (2014) Tiqi: Towards natural language trace queries. In: IEEE 22nd international requirements engineering conference, RE 2014, Karlskrona, Sweden, August 25–29, 2014, pp 123–132

  33. Rempel P, Mäder P, Kuschke T, Cleland-Huang J (2014) Mind the gap: assessing the conformance of software traceability to relevant guidelines. In: 36th international conference on software engineering (ICSE)

  34. Rempel P, Mäder P, Kuschke T, Philippow I (2013) Requirements traceability across organizational boundaries - a survey and taxonomy. In: Proceedings of requirements engineering: foundation for software quality: 19th international working conference, REFSQ 2013, Essen, Germany, April 8–11, 2013, pp 125–140

  35. Shwartz SP (1982) Problems with domain-independent natural language database access systems. In: Proceedings of the 20th annual meeting on association for computational linguistics, ACL ’82, Stroudsburg, PA. Association for Computational Linguistics,pp 60–62

  36. Silveira N, Dozat T, de Marneffe M, Bowman SR, Connor M, Bauer J, Manning CD (2014) A gold standard dependency corpus for english. In: Proceedings of the ninth international conference on language resources and evaluation (LREC-2014), Reykjavik, Iceland, May 26–31, 2014, pp 2897–2904

  37. Störrle H (2011) VMQL: a visual language for ad-hoc model querying. J Vis Lang Comput 22(1):3–29

    Article  Google Scholar 

  38. Vassiliou Y, Jarke M, Stohr E, Turner J, White N (1983) Natural language for database queries: a laboratory study. MIS Q 7(4):47–61

    Article  Google Scholar 

  39. Winkler S, von Pilgrim J (2010) A survey of traceability in requirements engineering and model-driven development. Softw Syst Model 9(4):529–565

    Article  Google Scholar 

  40. Zhang Y, Witte R, Rilling J, Haarslev V (2006) An ontology-based approach for the recovery of traceability links. In: 3rd International workshop on metamodels, schemas, grammars, and ontologies for reverse engineering (ATEM 2006), Genoa, Italy, October 1st

  41. Zloof M. Query by example. In: Proceedings of the NCC. AFIPS

Download references

Acknowledgments

The work described in this paper was partially funded by NSF Grant CCF-1319680.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jane Cleland-Huang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pruski, P., Lohar, S., Goss, W. et al. TiQi: answering unstructured natural language trace queries. Requirements Eng 20, 215–232 (2015). https://doi.org/10.1007/s00766-015-0224-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00766-015-0224-4

Keywords

Navigation