Providing multimodal and multi-user interactions for digital tv applications

Barreto, Fábio; de Abreu, Raphael S.; Josué, Marina I. P.; Montevecchi, Eyre Brasil B.; Valentim, Pedro Alves; Muchaluat-Saade, Débora C.

doi:10.1007/s11042-021-11847-3

Providing multimodal and multi-user interactions for digital tv applications

1215: Multimodal Interaction and IoT Applications
Published: 18 July 2022

Volume 82, pages 4821–4846, (2023)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Fábio Barreto ORCID: orcid.org/0000-0003-4842-5238^1,2,
Raphael S. de Abreu^1,2,
Marina I. P. Josué¹,
Eyre Brasil B. Montevecchi¹,
Pedro Alves Valentim¹ &
…
Débora C. Muchaluat-Saade¹

235 Accesses
1 Altmetric
Explore all metrics

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Notes

https://assistant.google.com/platforms/tv/
The Ginga-NCL extension implementation presented in this work can be accessed at https://github.com/marinaivanov/ginga-mulsemedia/tree/multimodal
https://github.com/TeleMidia/ginga
https://gaming.tobii.com/tobii-eye-tracker-4c/
https://help.tobii.com/hc/en-us/articles/213414285-Specifications-for-the-Tobii-Eye-Tracker-4C
Video of the application running at: https://1drv.ms/u/s!AvKXFJ6ipz46hqgcEWo39TPNhFC4JQ?e=xyetJ1
NCL applications and the script can be accessed at http://bit.do/fHiw4
https://forumsbtvd.org.br/tv3_0/

References

ABNT (2021) Digital terrestrial television - Data coding and transmission specification for digital broadcasting - Part 2: Ginga-NCL for fixed and mobile receivers - XML application language for application coding (In portuguese)
Barreto F, de Abreu RS, Montevecchi EBB, Josué MI, Valentim PA, Muchaluat-Saade DC (2020) Extending ginga-ncl to specify multimodal interactions with multiple users (in portuguese). In: Proceedings of the brazilian symposium on multimedia and the web, pp. 281–288
Bolt RA (1980) “Put-that-there” voice and gesture at the graphics interface. In: Proceedings of the 7th annual conference on Computer graphics and interactive techniques, pp. 262–270
Card S, MORAN T, Newell A (1986) The model human processor- an engineering model of human performance. Handbook of perception and human performance 2(45–1)
Carvalho LAMC, Guimarães AP, Macêdo HT (2008) Architectures for interactive vocal environment to brazilian digital tv middleware. In: Proceedings of the 2008 euro american conference on telematics and information systems, EATIS ’08. ACM, New York, NY, USA
Casanova MA, Tucherman L, Lima MJD, Rangel Netto JL, Rodriquez N, Soares LF (1991) The nested context model for hyperdocuments. In: Proceedings of the third annual ACM conference on Hypertext, pp. 193–201
Costa RMdR, Moreno MF, Soares LFG (2009) Ginga-ncl: Supporting multiple devices. In: Proceedings of the XV brazilian symposium on multimedia and the web, webmedia ’09. association for computing machinery, New York, NY, USA
Danesh A, Saini M, El Saddik A (2015) A proxemic multimedia interaction over the internet of things International conference on multimedia modeling, pp. 1–12. Springer
de Farias BC, de Lima Filho EB, Maia OB, Souto E (2020) Extensions to middleware ginga for integration with iot environments. In: 2020 IEEE International conference on consumer electronics (ICCE), pp. 1–5. IEEE
de Lima ES, Feijó B, Barbosa S, da Silva FG, Furtado AL, Ciarlini AE, Pozzer CT (2011) Multimodal, multi-user and adaptive interaction for interactive storytelling applications. In: 2011 Brazilian symposium on games and digital entertainment, pp. 206–214. IEEE
Feit AM, Williams S, Toledo A, Paradiso A, Kulkarni H, Kane S, Morris MR (2017) Toward everyday gaze input: Accuracy and precision of eye tracking and implications for design. In: Proceedings of the 2017 chi conference on human factors in computing systems, chi ’17, p. 1118–1130. association for computing machinery, New York, NY, USA. https://doi.org/10.1145/3025453.3025599
Furht B (2008) Encyclopedia of multimedia Springer Science & Business Media
Gowing M, Ahmadi A, Destelle F, Monaghan DS, O’Connor NE, Moran K (2014) Kinect vs. low-cost inertial sensing for gesture recognition. In: International conference on multimedia modeling, pp. 484–495. Springer
Guedes ÁLV, de Albuquerque Azevedo RG, Barbosa SDJ (2017) Extending multimedia languages to support multimodal user interactions. Multimedia Tools and Applications 76(4):5691–5720
Article Google Scholar
Hunkeler U, Truong HL, Stanford-Clark A (2008) Mqtt-s—a publish/subscribe protocol for wireless sensor networks. In: 2008 3Rd international conference on communication systems software and middleware and workshops (COMSWARE’08), pp. 791–798. IEEE
Ierusalimschy R (2006) Programming in lua Roberto Ierusalimschy
ITU (2009) Nested context language (ncl) and ginga-ncl for iptv services. http://www.itu.int/rec/T-REC-H.761-200904-S. ITU-T Rec. H.761
Klyne G, Carroll JJ (2006) Resource description framework (rdf): Concepts and abstract syntax
Luque FP, Galloso I, Feijoo C, Martín CA, Cisneros G (2014) Integration of multisensorial stimuli and multimodal interaction in a hybrid 3dtv system. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 11(1s):1–22
Article Google Scholar
Mo S, Yang W, Wang G, Liao Q (2020) Emotion recognition with facial landmark heatmaps. In: International conference on multimedia modeling, pp. 278–289. Springer
Montevecchi EBB, Josué MI, Barreto F, de Abreu RS, Muchaluat-Saade DC (2020) Providing eye gaze interaction for ginga-ncl applications (in portuguese). In: Proceedings of the brazilian symposium on multimedia and the web, pp. 297–303
Muchaluat-Saade DC, Soares LFG (2002) Xconnector & xtemplate: Improving the expressiveness and reuse in web authoring languages. New Rev Hypermedia Multimed J 8(1):139–169
Article Google Scholar
Pedrosa D, Martins Jr JAC, Melo EL, Teixeira CA (2011) A multimodal interaction component for digital television. In: Proceedings of the 2011 ACM Symposium on Applied Computing, pp. 1253–1258
Pereira DMG, e Silva FJdS, Carlos de Salles SN, dos Santos DV, Coutinho LR, Guedes ÁL (2021) An ontology-based approach to integrate tv and iot middlewares. Multimed Tools Appl 80(2):1813–1837
Article Google Scholar
Silva ECO, dos Santos JA, Muchaluat-Saade DC (2013) Jns: an alternative authoring language for specifying ncl multimedia documents. In: 2013 IEEE International conference on multimedia and expo workshops (ICMEW), pp. 1–6. IEEE
Soares LFG, Rodrigues RF (2005) Nested context model 3.0: Part 1–ncm core. Technical Report of the Informatics Department of PUC-Rio 12
Soares LFG, Rodrigues RF, Moreno MF (2007) Ginga-ncl: the declarative environment of the brazilian digital tv system. J Brazilian Comput Soc 12 (4):37–46
Article Google Scholar
Soares LFG, Rodrigues RF, Muchaluat-Saade DC (2000) Modeling, authoring and formatting hypermedia documents in the hyperprop system. Multimedia systems 8(2):118–134
Article MATH Google Scholar
Tobii (2020) This is eye tracking. https://www.tobii.com/group/about/this-is-eye-tracking. Acessed in: May 2020
Turabzadeh S, Meng H, Swash RM, Pleva M, Juhar J (2018) Facial expression emotion detection for real-time embedded systems. Technologies 6(1):17
Article Google Scholar
Turk M (2014) Multimodal interaction: a review. Pattern Recogn Lett 36:189–195
Article Google Scholar
Turunen M, Kallinen A, Sànchez I., Riekki J, Hella J, Olsson T, Melto A, Rajaniemi JP, Hakulinen J, Mäkinen E et al (2009) Multimodal interaction with speech and physical touch interface in a media center application. In: Proceedings of the international conference on advances in computer enterntainment technology, pp. 19–26
Union IT (2009) Itu-r h761: Nested context language (ncl) and ginga-ncl for iptv services
W3C (2008) Synchronized multimedia integration language - smil 3.0 specification. http://www.w3c.org/TR/SMIL3. World-Wide Web Consortium Rec.
W3C (2014) Html5: A vocabulary and associated apis for html and xhtml. https://www.w3.org/TR/html5/. World-Wide Web Consortium Rec.

Download references

Acknowledgments

The authors would like to thank CAPES, CAPES PRINT, CNPq, INCT-MACC and FAPERJ for the partial financial support of this work.

Author information

Authors and Affiliations

MídiaCom Lab, Fluminense Federal University, Niterói, Brazil
Fábio Barreto, Raphael S. de Abreu, Marina I. P. Josué, Eyre Brasil B. Montevecchi, Pedro Alves Valentim & Débora C. Muchaluat-Saade
Unilasalle-RJ, Niterói, Brazil
Fábio Barreto & Raphael S. de Abreu

Authors

Fábio Barreto
View author publications
You can also search for this author in PubMed Google Scholar
Raphael S. de Abreu
View author publications
You can also search for this author in PubMed Google Scholar
Marina I. P. Josué
View author publications
You can also search for this author in PubMed Google Scholar
Eyre Brasil B. Montevecchi
View author publications
You can also search for this author in PubMed Google Scholar
Pedro Alves Valentim
View author publications
You can also search for this author in PubMed Google Scholar
Débora C. Muchaluat-Saade
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fábio Barreto.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Barreto, F., de Abreu, R., Josué, M.I.P. et al. Providing multimodal and multi-user interactions for digital tv applications. Multimed Tools Appl 82, 4821–4846 (2023). https://doi.org/10.1007/s11042-021-11847-3

Download citation

Received: 30 December 2020
Revised: 25 August 2021
Accepted: 23 December 2021
Published: 18 July 2022
Issue Date: February 2023
DOI: https://doi.org/10.1007/s11042-021-11847-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Providing multimodal and multi-user interactions for digital tv applications

Access this article

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation