VISH: Does Your Smart Home Dialogue System Also Need Training Data?

Noura, Mahda; Heil, Sebastian; Gaedke, Martin

doi:10.1007/978-3-030-50578-3_13

VISH: Does Your Smart Home Dialogue System Also Need Training Data?

Conference paper
First Online: 10 June 2020

1617 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12128))

Abstract

The main objective of smart homes is to improve the quality of life and comfort of their inhabitants through automation systems and ambient intelligence. Voice-based interaction like dialogue systems is the current emerging trend in these systems. Natural Language Understanding (NLU) model can identify the end-users’ intentions in the utterances provided to spoken dialogue systems. The utility of dialogue systems is reliant on the quality of NLU models, which is in turn significantly dependent on the availability of a high-quality and sufficiently large corpus for training, containing diverse utterance structures. However, building such corpora is a complex task even for companies possessing significant human and infrastructure resources. On the other hand, the existing corpora for the smart home domain are either concerned with web services, focus on direct goals only, follow static command structure, or are not publicly available in English language which limits the development of goal-oriented dialogue systems for smart homes. In this paper, we propose a generic method to create training data for the NLU component using a generative grammar-based approach. Our method outputs, Voice Interaction in Smart Home (VISH) dataset consisting of five million unique utterances for the smart home. This dataset can greatly facilitate research in the area of voice-based dialogue systems for smart homes. We evaluate the approach by using VISH to train several state-of-the-art NLU models. Our experiment results demonstrate the capability of the corpus to support the development of goal-oriented voice-based dialogue systems in the context of smart homes.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

References

International Organization for Standardization/International Electrotechnical Commission 14977:1996 information technology-syntactic meta-language-extended BNF. In: Standard. International Organization for Standardization, Geneva, CH (1996). http://standards.iso.org/ittf/PubliclyAvailableStandards/
Barricelli, B.R., Valtolina, S.: Designing for end-user development in the Internet of Things. In: Díaz, P., Pipek, V., Ardito, C., Jensen, C., Aedo, I., Boden, A. (eds.) IS-EUD 2015. LNCS, vol. 9083, pp. 9–24. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18425-8_2
Chapter Google Scholar
Bertin, N., et al.: Voicehome-2, an extended corpus for multichannel speech processing in real homes. Speech Commun. 106, 68–78 (2019)
Article Google Scholar
Campagna, G., Xu, S., Moradshahi, M., Socher, R., Lam, M.S.: Genie: a generator of natural language semantic parsers for virtual assistant commands. In: Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, pp. 394–410. ACM (2019)
Google Scholar
Catania, V., Delfa, G.C.L., Monteleone, S., Patti, D., Ventura, D., Torre, G.L.: Goose: goal oriented orchestration for smart environments. Int. J. Ad Hoc Ubiquit. Comput. 32(3), 159–170 (2019)
Article Google Scholar
Clark, M., Newman, M.W., Dutta, P.: Devices and data and agents, oh my: how smart home abstractions prime end-user mental models. Proc. ACM Interact. Mob. Wearable Ubiquit. Technol. 1(3), 44 (2017)
Article Google Scholar
Cristoforetti, L., et al.: The DIRHA simulated corpus. In: LREC, pp. 2629–2634 (2014)
Google Scholar
Georgievski, I., Aiello, M.: Automated planning for ubiquitous computing. ACM Comput. Surv. (CSUR) 49(4), 1–46 (2016)
Article Google Scholar
Kollar, T., et al.: The alexa meaning representation language. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol. 3 (Industry Papers), pp. 177–184 (2018)
Google Scholar
Li, T.J.-J., Labutov, I., Myers, B.A., Azaria, A., Rudnicky, A.I., Mitchell, T.M.: Teaching agents when they fail: end user development in goal-oriented conversational agents. In: Moore, R.J., Szymanski, M.H., Arar, R., Ren, G.-J. (eds.) Studies in Conversational UX Design. HIS, pp. 119–137. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-95579-7_6
Chapter Google Scholar
Luger, E., Sellen, A.: “Like Having a Really Bad PA” the gulf between user expectation and experience of conversational agents. In: Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, pp. 5286–5297 (2016)
Google Scholar
Mayer, S., Verborgh, R., Kovatsch, M., Mattern, F.: Smart configuration of smart environments. IEEE Trans. Autom. Sci. Eng. 13(3), 1247–1255 (2016)
Article Google Scholar
Noura, M., Gaedke, M.: An automated cyclic planning framework based on plan-do-check-act for web of things composition. In: Proceedings of the 10th ACM Conference on Web Science, pp. 205–214 (2019)
Google Scholar
Noura, M., Gaedke, M.: WoTDL: Web of things description language for automatic composition. In: 2019 IEEE/WIC/ACM International Conference on Web Intelligence (WI), pp. 413–417. IEEE (2019)
Google Scholar
Noura, M., Heil, S., Gaedke, M.: GrOWTH: goal-oriented end user development for web of things devices. In: Mikkonen, T., Klamma, R., Hernández, J. (eds.) ICWE 2018. LNCS, vol. 10845, pp. 358–365. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91662-0_29
Chapter Google Scholar
Noura, M., Heil, S., Gaedke, M.: Webifying heterogenous Internet of Things devices. In: Bakaev, M., Frasincar, F., Ko, I.-Y. (eds.) ICWE 2019. LNCS, vol. 11496, pp. 509–513. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-19274-7_36
Chapter Google Scholar
Palanca, J., Val, E., Garcia-Fornes, A., Billhardt, H., Corchado, J.M., Julián, V.: Designing a goal-oriented smart-home environment. Inf. Syst. Front. 20(1), 125–142 (2016). https://doi.org/10.1007/s10796-016-9670-x
Article Google Scholar
Portet, F., et al.: Context-aware voice-based interaction in smart home-vocadom@ a4h corpus collection and empirical assessment of its usefulness. In: 2019 IEEE Intl Conference on Dependable, Autonomic and Secure Computing, International Conference on Pervasive Intelligence and Computing, International Conference on Cloud and Big Data Computing, International Conference on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech), pp. 811–818. IEEE (2019)
Google Scholar
Shilin, I., Kovriguina, L., Mouromtsev, D., Wohlgenannt, G., Ivanitskiy, R.: A method for dataset creation for dialogue state classification in voice control systems for the Internet of Things. In: R. Piotrowski’s Readings in Language Engineering and Applied Linguistics, pp. 96–106 (2018)
Google Scholar
Tahir, A.: Smart home scenarios (2019). https://doi.org/10.6084/m9.figshare.8327096.v1
Tsiami, A., Rodomagoulakis, I., Giannoulis, P., Katsamanis, A., Potamianos, G., Maragos, P.: Athena: a Greek multi-sensory database for home automation control uthor: isidoros rodomagoulakis (ntua, greece). In: INTERSPEECH (2014)
Google Scholar
Vacher, M., Lecouteux, B., Chahuara, P., Portet, F., Meillon, B., Bonnefond, N.: The sweet-home speech and multimodal corpus for home automation interaction (2014)
Google Scholar
Wang, X., Yuan, C.: Recent advances on human-computer dialogue. CAAI Trans. Intell. Technol. 1(4), 303–312 (2016)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Technische Universität Chemnitz, Chemnitz, Germany
Mahda Noura, Sebastian Heil & Martin Gaedke

Authors

Mahda Noura
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Heil
View author publications
You can also search for this author in PubMed Google Scholar
Martin Gaedke
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mahda Noura .

Editor information

Editors and Affiliations

Slovak University of Technology, Bratislava, Slovakia
Maria Bielikova
University of Helsinki, Helsinki, Finland
Tommi Mikkonen
University of Lugano (USI), Lugano, Switzerland
Cesare Pautasso

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Noura, M., Heil, S., Gaedke, M. (2020). VISH: Does Your Smart Home Dialogue System Also Need Training Data?. In: Bielikova, M., Mikkonen, T., Pautasso, C. (eds) Web Engineering. ICWE 2020. Lecture Notes in Computer Science(), vol 12128. Springer, Cham. https://doi.org/10.1007/978-3-030-50578-3_13

Download citation

DOI: https://doi.org/10.1007/978-3-030-50578-3_13
Published: 10 June 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-50577-6
Online ISBN: 978-3-030-50578-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics