Notes
The Ginga-NCL extension implementation presented in this work can be accessed at https://github.com/marinaivanov/ginga-mulsemedia/tree/multimodal
Video of the application running at: https://1drv.ms/u/s!AvKXFJ6ipz46hqgcEWo39TPNhFC4JQ?e=xyetJ1
NCL applications and the script can be accessed at http://bit.do/fHiw4
References
ABNT (2021) Digital terrestrial television - Data coding and transmission specification for digital broadcasting - Part 2: Ginga-NCL for fixed and mobile receivers - XML application language for application coding (In portuguese)
Barreto F, de Abreu RS, Montevecchi EBB, Josué MI, Valentim PA, Muchaluat-Saade DC (2020) Extending ginga-ncl to specify multimodal interactions with multiple users (in portuguese). In: Proceedings of the brazilian symposium on multimedia and the web, pp. 281–288
Bolt RA (1980) “Put-that-there” voice and gesture at the graphics interface. In: Proceedings of the 7th annual conference on Computer graphics and interactive techniques, pp. 262–270
Card S, MORAN T, Newell A (1986) The model human processor- an engineering model of human performance. Handbook of perception and human performance 2(45–1)
Carvalho LAMC, Guimarães AP, Macêdo HT (2008) Architectures for interactive vocal environment to brazilian digital tv middleware. In: Proceedings of the 2008 euro american conference on telematics and information systems, EATIS ’08. ACM, New York, NY, USA
Casanova MA, Tucherman L, Lima MJD, Rangel Netto JL, Rodriquez N, Soares LF (1991) The nested context model for hyperdocuments. In: Proceedings of the third annual ACM conference on Hypertext, pp. 193–201
Costa RMdR, Moreno MF, Soares LFG (2009) Ginga-ncl: Supporting multiple devices. In: Proceedings of the XV brazilian symposium on multimedia and the web, webmedia ’09. association for computing machinery, New York, NY, USA
Danesh A, Saini M, El Saddik A (2015) A proxemic multimedia interaction over the internet of things International conference on multimedia modeling, pp. 1–12. Springer
de Farias BC, de Lima Filho EB, Maia OB, Souto E (2020) Extensions to middleware ginga for integration with iot environments. In: 2020 IEEE International conference on consumer electronics (ICCE), pp. 1–5. IEEE
de Lima ES, Feijó B, Barbosa S, da Silva FG, Furtado AL, Ciarlini AE, Pozzer CT (2011) Multimodal, multi-user and adaptive interaction for interactive storytelling applications. In: 2011 Brazilian symposium on games and digital entertainment, pp. 206–214. IEEE
Feit AM, Williams S, Toledo A, Paradiso A, Kulkarni H, Kane S, Morris MR (2017) Toward everyday gaze input: Accuracy and precision of eye tracking and implications for design. In: Proceedings of the 2017 chi conference on human factors in computing systems, chi ’17, p. 1118–1130. association for computing machinery, New York, NY, USA. https://doi.org/10.1145/3025453.3025599
Furht B (2008) Encyclopedia of multimedia Springer Science & Business Media
Gowing M, Ahmadi A, Destelle F, Monaghan DS, O’Connor NE, Moran K (2014) Kinect vs. low-cost inertial sensing for gesture recognition. In: International conference on multimedia modeling, pp. 484–495. Springer
Guedes ÁLV, de Albuquerque Azevedo RG, Barbosa SDJ (2017) Extending multimedia languages to support multimodal user interactions. Multimedia Tools and Applications 76(4):5691–5720
Hunkeler U, Truong HL, Stanford-Clark A (2008) Mqtt-s—a publish/subscribe protocol for wireless sensor networks. In: 2008 3Rd international conference on communication systems software and middleware and workshops (COMSWARE’08), pp. 791–798. IEEE
Ierusalimschy R (2006) Programming in lua Roberto Ierusalimschy
ITU (2009) Nested context language (ncl) and ginga-ncl for iptv services. http://www.itu.int/rec/T-REC-H.761-200904-S. ITU-T Rec. H.761
Klyne G, Carroll JJ (2006) Resource description framework (rdf): Concepts and abstract syntax
Luque FP, Galloso I, Feijoo C, Martín CA, Cisneros G (2014) Integration of multisensorial stimuli and multimodal interaction in a hybrid 3dtv system. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 11(1s):1–22
Mo S, Yang W, Wang G, Liao Q (2020) Emotion recognition with facial landmark heatmaps. In: International conference on multimedia modeling, pp. 278–289. Springer
Montevecchi EBB, Josué MI, Barreto F, de Abreu RS, Muchaluat-Saade DC (2020) Providing eye gaze interaction for ginga-ncl applications (in portuguese). In: Proceedings of the brazilian symposium on multimedia and the web, pp. 297–303
Muchaluat-Saade DC, Soares LFG (2002) Xconnector & xtemplate: Improving the expressiveness and reuse in web authoring languages. New Rev Hypermedia Multimed J 8(1):139–169
Pedrosa D, Martins Jr JAC, Melo EL, Teixeira CA (2011) A multimodal interaction component for digital television. In: Proceedings of the 2011 ACM Symposium on Applied Computing, pp. 1253–1258
Pereira DMG, e Silva FJdS, Carlos de Salles SN, dos Santos DV, Coutinho LR, Guedes ÁL (2021) An ontology-based approach to integrate tv and iot middlewares. Multimed Tools Appl 80(2):1813–1837
Silva ECO, dos Santos JA, Muchaluat-Saade DC (2013) Jns: an alternative authoring language for specifying ncl multimedia documents. In: 2013 IEEE International conference on multimedia and expo workshops (ICMEW), pp. 1–6. IEEE
Soares LFG, Rodrigues RF (2005) Nested context model 3.0: Part 1–ncm core. Technical Report of the Informatics Department of PUC-Rio 12
Soares LFG, Rodrigues RF, Moreno MF (2007) Ginga-ncl: the declarative environment of the brazilian digital tv system. J Brazilian Comput Soc 12 (4):37–46
Soares LFG, Rodrigues RF, Muchaluat-Saade DC (2000) Modeling, authoring and formatting hypermedia documents in the hyperprop system. Multimedia systems 8(2):118–134
Tobii (2020) This is eye tracking. https://www.tobii.com/group/about/this-is-eye-tracking. Acessed in: May 2020
Turabzadeh S, Meng H, Swash RM, Pleva M, Juhar J (2018) Facial expression emotion detection for real-time embedded systems. Technologies 6(1):17
Turk M (2014) Multimodal interaction: a review. Pattern Recogn Lett 36:189–195
Turunen M, Kallinen A, Sànchez I., Riekki J, Hella J, Olsson T, Melto A, Rajaniemi JP, Hakulinen J, Mäkinen E et al (2009) Multimodal interaction with speech and physical touch interface in a media center application. In: Proceedings of the international conference on advances in computer enterntainment technology, pp. 19–26
Union IT (2009) Itu-r h761: Nested context language (ncl) and ginga-ncl for iptv services
W3C (2008) Synchronized multimedia integration language - smil 3.0 specification. http://www.w3c.org/TR/SMIL3. World-Wide Web Consortium Rec.
W3C (2014) Html5: A vocabulary and associated apis for html and xhtml. https://www.w3.org/TR/html5/. World-Wide Web Consortium Rec.
Acknowledgments
The authors would like to thank CAPES, CAPES PRINT, CNPq, INCT-MACC and FAPERJ for the partial financial support of this work.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Barreto, F., de Abreu, R., Josué, M.I.P. et al. Providing multimodal and multi-user interactions for digital tv applications. Multimed Tools Appl 82, 4821–4846 (2023). https://doi.org/10.1007/s11042-021-11847-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-11847-3