Abstract
The main objective of smart homes is to improve the quality of life and comfort of their inhabitants through automation systems and ambient intelligence. Voice-based interaction like dialogue systems is the current emerging trend in these systems. Natural Language Understanding (NLU) model can identify the end-users’ intentions in the utterances provided to spoken dialogue systems. The utility of dialogue systems is reliant on the quality of NLU models, which is in turn significantly dependent on the availability of a high-quality and sufficiently large corpus for training, containing diverse utterance structures. However, building such corpora is a complex task even for companies possessing significant human and infrastructure resources. On the other hand, the existing corpora for the smart home domain are either concerned with web services, focus on direct goals only, follow static command structure, or are not publicly available in English language which limits the development of goal-oriented dialogue systems for smart homes. In this paper, we propose a generic method to create training data for the NLU component using a generative grammar-based approach. Our method outputs, Voice Interaction in Smart Home (VISH) dataset consisting of five million unique utterances for the smart home. This dataset can greatly facilitate research in the area of voice-based dialogue systems for smart homes. We evaluate the approach by using VISH to train several state-of-the-art NLU models. Our experiment results demonstrate the capability of the corpus to support the development of goal-oriented voice-based dialogue systems in the context of smart homes.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
References
International Organization for Standardization/International Electrotechnical Commission 14977:1996 information technology-syntactic meta-language-extended BNF. In: Standard. International Organization for Standardization, Geneva, CH (1996). http://standards.iso.org/ittf/PubliclyAvailableStandards/
Barricelli, B.R., Valtolina, S.: Designing for end-user development in the Internet of Things. In: Díaz, P., Pipek, V., Ardito, C., Jensen, C., Aedo, I., Boden, A. (eds.) IS-EUD 2015. LNCS, vol. 9083, pp. 9–24. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18425-8_2
Bertin, N., et al.: Voicehome-2, an extended corpus for multichannel speech processing in real homes. Speech Commun. 106, 68–78 (2019)
Campagna, G., Xu, S., Moradshahi, M., Socher, R., Lam, M.S.: Genie: a generator of natural language semantic parsers for virtual assistant commands. In: Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, pp. 394–410. ACM (2019)
Catania, V., Delfa, G.C.L., Monteleone, S., Patti, D., Ventura, D., Torre, G.L.: Goose: goal oriented orchestration for smart environments. Int. J. Ad Hoc Ubiquit. Comput. 32(3), 159–170 (2019)
Clark, M., Newman, M.W., Dutta, P.: Devices and data and agents, oh my: how smart home abstractions prime end-user mental models. Proc. ACM Interact. Mob. Wearable Ubiquit. Technol. 1(3), 44 (2017)
Cristoforetti, L., et al.: The DIRHA simulated corpus. In: LREC, pp. 2629–2634 (2014)
Georgievski, I., Aiello, M.: Automated planning for ubiquitous computing. ACM Comput. Surv. (CSUR) 49(4), 1–46 (2016)
Kollar, T., et al.: The alexa meaning representation language. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol. 3 (Industry Papers), pp. 177–184 (2018)
Li, T.J.-J., Labutov, I., Myers, B.A., Azaria, A., Rudnicky, A.I., Mitchell, T.M.: Teaching agents when they fail: end user development in goal-oriented conversational agents. In: Moore, R.J., Szymanski, M.H., Arar, R., Ren, G.-J. (eds.) Studies in Conversational UX Design. HIS, pp. 119–137. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-95579-7_6
Luger, E., Sellen, A.: “Like Having a Really Bad PA” the gulf between user expectation and experience of conversational agents. In: Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, pp. 5286–5297 (2016)
Mayer, S., Verborgh, R., Kovatsch, M., Mattern, F.: Smart configuration of smart environments. IEEE Trans. Autom. Sci. Eng. 13(3), 1247–1255 (2016)
Noura, M., Gaedke, M.: An automated cyclic planning framework based on plan-do-check-act for web of things composition. In: Proceedings of the 10th ACM Conference on Web Science, pp. 205–214 (2019)
Noura, M., Gaedke, M.: WoTDL: Web of things description language for automatic composition. In: 2019 IEEE/WIC/ACM International Conference on Web Intelligence (WI), pp. 413–417. IEEE (2019)
Noura, M., Heil, S., Gaedke, M.: GrOWTH: goal-oriented end user development for web of things devices. In: Mikkonen, T., Klamma, R., Hernández, J. (eds.) ICWE 2018. LNCS, vol. 10845, pp. 358–365. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91662-0_29
Noura, M., Heil, S., Gaedke, M.: Webifying heterogenous Internet of Things devices. In: Bakaev, M., Frasincar, F., Ko, I.-Y. (eds.) ICWE 2019. LNCS, vol. 11496, pp. 509–513. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-19274-7_36
Palanca, J., Val, E., Garcia-Fornes, A., Billhardt, H., Corchado, J.M., Julián, V.: Designing a goal-oriented smart-home environment. Inf. Syst. Front. 20(1), 125–142 (2016). https://doi.org/10.1007/s10796-016-9670-x
Portet, F., et al.: Context-aware voice-based interaction in smart home-vocadom@ a4h corpus collection and empirical assessment of its usefulness. In: 2019 IEEE Intl Conference on Dependable, Autonomic and Secure Computing, International Conference on Pervasive Intelligence and Computing, International Conference on Cloud and Big Data Computing, International Conference on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech), pp. 811–818. IEEE (2019)
Shilin, I., Kovriguina, L., Mouromtsev, D., Wohlgenannt, G., Ivanitskiy, R.: A method for dataset creation for dialogue state classification in voice control systems for the Internet of Things. In: R. Piotrowski’s Readings in Language Engineering and Applied Linguistics, pp. 96–106 (2018)
Tahir, A.: Smart home scenarios (2019). https://doi.org/10.6084/m9.figshare.8327096.v1
Tsiami, A., Rodomagoulakis, I., Giannoulis, P., Katsamanis, A., Potamianos, G., Maragos, P.: Athena: a Greek multi-sensory database for home automation control uthor: isidoros rodomagoulakis (ntua, greece). In: INTERSPEECH (2014)
Vacher, M., Lecouteux, B., Chahuara, P., Portet, F., Meillon, B., Bonnefond, N.: The sweet-home speech and multimodal corpus for home automation interaction (2014)
Wang, X., Yuan, C.: Recent advances on human-computer dialogue. CAAI Trans. Intell. Technol. 1(4), 303–312 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Noura, M., Heil, S., Gaedke, M. (2020). VISH: Does Your Smart Home Dialogue System Also Need Training Data?. In: Bielikova, M., Mikkonen, T., Pautasso, C. (eds) Web Engineering. ICWE 2020. Lecture Notes in Computer Science(), vol 12128. Springer, Cham. https://doi.org/10.1007/978-3-030-50578-3_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-50578-3_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-50577-6
Online ISBN: 978-3-030-50578-3
eBook Packages: Computer ScienceComputer Science (R0)