Skip to main content

VISH: Does Your Smart Home Dialogue System Also Need Training Data?

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12128))

Abstract

The main objective of smart homes is to improve the quality of life and comfort of their inhabitants through automation systems and ambient intelligence. Voice-based interaction like dialogue systems is the current emerging trend in these systems. Natural Language Understanding (NLU) model can identify the end-users’ intentions in the utterances provided to spoken dialogue systems. The utility of dialogue systems is reliant on the quality of NLU models, which is in turn significantly dependent on the availability of a high-quality and sufficiently large corpus for training, containing diverse utterance structures. However, building such corpora is a complex task even for companies possessing significant human and infrastructure resources. On the other hand, the existing corpora for the smart home domain are either concerned with web services, focus on direct goals only, follow static command structure, or are not publicly available in English language which limits the development of goal-oriented dialogue systems for smart homes. In this paper, we propose a generic method to create training data for the NLU component using a generative grammar-based approach. Our method outputs, Voice Interaction in Smart Home (VISH) dataset consisting of five million unique utterances for the smart home. This dataset can greatly facilitate research in the area of voice-based dialogue systems for smart homes. We evaluate the approach by using VISH to train several state-of-the-art NLU models. Our experiment results demonstrate the capability of the corpus to support the development of goal-oriented voice-based dialogue systems in the context of smart homes.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://ifttt.com/.

  2. 2.

    https://nodered.org/.

  3. 3.

    https://www.w3.org/2019/wot/td.

  4. 4.

    https://bildungsportal.sachsen.de/umfragen/limesurvey/index.php/777955.

  5. 5.

    https://www.nltk.org/.

  6. 6.

    https://rasa.com/.

  7. 7.

    https://github.com/PolyAI-LDN/polyai-models#models.

  8. 8.

    https://spacy.io/.

  9. 9.

    https://vsr.informatik.tu-chemnitz.de/projects/2019/growth/.

  10. 10.

    https://github.com/rodrigopivi/Chatito.

  11. 11.

    https://github.com/SimGus/Chatette.

References

  1. International Organization for Standardization/International Electrotechnical Commission 14977:1996 information technology-syntactic meta-language-extended BNF. In: Standard. International Organization for Standardization, Geneva, CH (1996). http://standards.iso.org/ittf/PubliclyAvailableStandards/

  2. Barricelli, B.R., Valtolina, S.: Designing for end-user development in the Internet of Things. In: Díaz, P., Pipek, V., Ardito, C., Jensen, C., Aedo, I., Boden, A. (eds.) IS-EUD 2015. LNCS, vol. 9083, pp. 9–24. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18425-8_2

    Chapter  Google Scholar 

  3. Bertin, N., et al.: Voicehome-2, an extended corpus for multichannel speech processing in real homes. Speech Commun. 106, 68–78 (2019)

    Article  Google Scholar 

  4. Campagna, G., Xu, S., Moradshahi, M., Socher, R., Lam, M.S.: Genie: a generator of natural language semantic parsers for virtual assistant commands. In: Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, pp. 394–410. ACM (2019)

    Google Scholar 

  5. Catania, V., Delfa, G.C.L., Monteleone, S., Patti, D., Ventura, D., Torre, G.L.: Goose: goal oriented orchestration for smart environments. Int. J. Ad Hoc Ubiquit. Comput. 32(3), 159–170 (2019)

    Article  Google Scholar 

  6. Clark, M., Newman, M.W., Dutta, P.: Devices and data and agents, oh my: how smart home abstractions prime end-user mental models. Proc. ACM Interact. Mob. Wearable Ubiquit. Technol. 1(3), 44 (2017)

    Article  Google Scholar 

  7. Cristoforetti, L., et al.: The DIRHA simulated corpus. In: LREC, pp. 2629–2634 (2014)

    Google Scholar 

  8. Georgievski, I., Aiello, M.: Automated planning for ubiquitous computing. ACM Comput. Surv. (CSUR) 49(4), 1–46 (2016)

    Article  Google Scholar 

  9. Kollar, T., et al.: The alexa meaning representation language. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol. 3 (Industry Papers), pp. 177–184 (2018)

    Google Scholar 

  10. Li, T.J.-J., Labutov, I., Myers, B.A., Azaria, A., Rudnicky, A.I., Mitchell, T.M.: Teaching agents when they fail: end user development in goal-oriented conversational agents. In: Moore, R.J., Szymanski, M.H., Arar, R., Ren, G.-J. (eds.) Studies in Conversational UX Design. HIS, pp. 119–137. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-95579-7_6

    Chapter  Google Scholar 

  11. Luger, E., Sellen, A.: “Like Having a Really Bad PA” the gulf between user expectation and experience of conversational agents. In: Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, pp. 5286–5297 (2016)

    Google Scholar 

  12. Mayer, S., Verborgh, R., Kovatsch, M., Mattern, F.: Smart configuration of smart environments. IEEE Trans. Autom. Sci. Eng. 13(3), 1247–1255 (2016)

    Article  Google Scholar 

  13. Noura, M., Gaedke, M.: An automated cyclic planning framework based on plan-do-check-act for web of things composition. In: Proceedings of the 10th ACM Conference on Web Science, pp. 205–214 (2019)

    Google Scholar 

  14. Noura, M., Gaedke, M.: WoTDL: Web of things description language for automatic composition. In: 2019 IEEE/WIC/ACM International Conference on Web Intelligence (WI), pp. 413–417. IEEE (2019)

    Google Scholar 

  15. Noura, M., Heil, S., Gaedke, M.: GrOWTH: goal-oriented end user development for web of things devices. In: Mikkonen, T., Klamma, R., Hernández, J. (eds.) ICWE 2018. LNCS, vol. 10845, pp. 358–365. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91662-0_29

    Chapter  Google Scholar 

  16. Noura, M., Heil, S., Gaedke, M.: Webifying heterogenous Internet of Things devices. In: Bakaev, M., Frasincar, F., Ko, I.-Y. (eds.) ICWE 2019. LNCS, vol. 11496, pp. 509–513. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-19274-7_36

    Chapter  Google Scholar 

  17. Palanca, J., Val, E., Garcia-Fornes, A., Billhardt, H., Corchado, J.M., Julián, V.: Designing a goal-oriented smart-home environment. Inf. Syst. Front. 20(1), 125–142 (2016). https://doi.org/10.1007/s10796-016-9670-x

    Article  Google Scholar 

  18. Portet, F., et al.: Context-aware voice-based interaction in smart home-vocadom@ a4h corpus collection and empirical assessment of its usefulness. In: 2019 IEEE Intl Conference on Dependable, Autonomic and Secure Computing, International Conference on Pervasive Intelligence and Computing, International Conference on Cloud and Big Data Computing, International Conference on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech), pp. 811–818. IEEE (2019)

    Google Scholar 

  19. Shilin, I., Kovriguina, L., Mouromtsev, D., Wohlgenannt, G., Ivanitskiy, R.: A method for dataset creation for dialogue state classification in voice control systems for the Internet of Things. In: R. Piotrowski’s Readings in Language Engineering and Applied Linguistics, pp. 96–106 (2018)

    Google Scholar 

  20. Tahir, A.: Smart home scenarios (2019). https://doi.org/10.6084/m9.figshare.8327096.v1

  21. Tsiami, A., Rodomagoulakis, I., Giannoulis, P., Katsamanis, A., Potamianos, G., Maragos, P.: Athena: a Greek multi-sensory database for home automation control uthor: isidoros rodomagoulakis (ntua, greece). In: INTERSPEECH (2014)

    Google Scholar 

  22. Vacher, M., Lecouteux, B., Chahuara, P., Portet, F., Meillon, B., Bonnefond, N.: The sweet-home speech and multimodal corpus for home automation interaction (2014)

    Google Scholar 

  23. Wang, X., Yuan, C.: Recent advances on human-computer dialogue. CAAI Trans. Intell. Technol. 1(4), 303–312 (2016)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mahda Noura .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Noura, M., Heil, S., Gaedke, M. (2020). VISH: Does Your Smart Home Dialogue System Also Need Training Data?. In: Bielikova, M., Mikkonen, T., Pautasso, C. (eds) Web Engineering. ICWE 2020. Lecture Notes in Computer Science(), vol 12128. Springer, Cham. https://doi.org/10.1007/978-3-030-50578-3_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-50578-3_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-50577-6

  • Online ISBN: 978-3-030-50578-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics