Skip to main content
Log in

Extending MPEG-7 for efficient annotation of complex web 3D scenes

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In this paper, we propose an annotation scheme for web-3D scenes based on the MPEG-7 standard. We focus on the annotation of 3D scenes that are encoded with the X3D modeling language which is the descendant of VRML. X3D has been adopted by the web service industry as the appropriate framework for developing internet friendly and flexible 3D visualization applications. We introduce MPEG-7 extensions that are necessary in order to fulfill the requirements of the X3D scene structure and we adapt the MPEG-7 schema encoding accordingly. In the annotation scheme, we consider animation and interactivity issues along with geometrical and appearance characteristics of the 3D content providing a more efficient description of the scene. Thus, the extensions proposed in this paper cover all the information required for a complete and efficient description on the position and relative size of 3D objects, specific characteristics such as object type, curvature properties and available textures, combined with the objects’ innate animation properties and its interactions with other objects in the scene or with the end user. The extensions are MPEG-7 Visual and Metadata Descriptors, which fully conform to the standardization restrictions, and we also provide the modifications to the corresponding schema of the ISO 15938 standard that are essential for validating against the proposed MPEG-7 implementation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Attene M, Robbiano F, Spagnuolo M, Falcidieno B (2007) “Semantic annotation of 3D surface meshes based on feature characterization”, semantic multimedia, lecture notes in computer science, vol. 4816. Springer, Berlin, pp 126–139. doi:10.1007/978-3-540-77051-0_15

    Google Scholar 

  2. Bilasco IM, Genzel J, Villanova M, Martin H (2005) “On indexing of 3D scenes using MPEG-7”. In: Proceedings of the 13th annual ACM international conference on multimedia (MULTIMEDIA ’05), Singapore, pp. 471–474, ACM, New York, doi:10.1145/1101149.1101254

  3. Bilasco IM, Genzel J, Villanova MO, Martin H (2006) “An MPEG-7 framework enhancing the reuse of 3D models”. In: Proceedings of the eleventh international conference on 3D web technology (Web3D ’06), Columbia, Maryland, ACM, New York, pp. 65–74, doi:10.1145/1122591.1122601

  4. Bilasco IM, Genzel J, Villanova MO, Martin H (2007) “Semantic-based rules for 3D scene adaptation”. In: Proceedings of the twelfth international conference on 3D web technology (Web3D ’07), Perugia, Italy, pp. 97–100, ACM, New York. doi:10.1145/1229390.1229406

  5. Chmielewski J (2008) “Interaction descriptor for 3D objects”. In: Proceedings of the international conference on human system interaction (HSI’08), Krakow, Poland, pp.18–23, May 25–27, doi:10.1109/HSI.2008.4581401

  6. Chmielewski J (2008) “Interaction interfaces for unrestricted multimedia interaction descriptions”. In: Proceedings of the 6th international conference on advances in mobile computing and multimedia (MoMM ’08), ACM, New York, pp. 397–400, doi:10.1145/1497185.1497270

  7. COLLADA Homepage. http://www.khronos.org/collada/

  8. Dasiopoulou S, Tzouvaras V, Kompatsiaris I, Strintzis MG (2010) Enquiring MPEG-7 based multimedia ontologies. Multimed Tools Appl 46(2–3):331–370

    Article  Google Scholar 

  9. Doller M, Kosch H (2008) The MPEG-7 multimedia database system (MPEG-7 MMDB). J Syst Softw 81(9):1559–1580

    Article  Google Scholar 

  10. Doulamis N, Ceacero C, Collantes L, Tektonidis D (2006) “DESYME: development system for mobile services”, 15th IST mobile & wireless communications summit, 4–8 June, Mykonos, Greece

  11. FOCUS K3D Homepage, http://www.focusk3d.eu/

  12. Glantz A, Krutz A, Sikora T, Nunes P, Pereira F (2010) Automatic MPEG-4 sprite coding—comparison of integrated object segmentation algorithms. J Multimed Tools Appl 49(3):483–512

    Article  Google Scholar 

  13. Grana C, Cucchiara R (2006) “Performance of the MPEG-7 shape spectrum descriptor for 3D objects retrieval”. In: Second Italian research conference on digital library management systems, IRCDL, Italy

  14. Halabala P (2003) “Semantic metadata creation.” In: Proceedings of CESCG 2003: 7th central European seminar on computer graphics, pp. 15–25

  15. Halkos D, Doulamis N, Doulamis A (2009) A secure framework exploiting content guided and automated algorithms for real time video searching. Multimed Tools Appl 42:343–375

    Article  Google Scholar 

  16. Hejazi MR, Ho Y-S (2007) An efficient approach to texture-based image retrieval. Int J Imaging Syst Technol 17(5):295–302. doi:10.1002/ima.20120, Wiley Subscription Services

    Article  Google Scholar 

  17. Hejazi MR, Ho Y-S (2007) Efficient approach to extraction of texture browsing descriptor in MPEG-7. Electron Lett 43(13):709–711. doi:10.1049/el:20070208

    Article  Google Scholar 

  18. ISO 15938:3 Multimedia content description interface—part 3: visual. May, 2002

  19. ISO 15938:5 multimedia content description interface—part 5: multimedia description schemes. May, 2003

  20. ISO/IEC 14496-2:2001 coding of audio-video objects—part 2: visual. December, 2001

  21. Kapetanakis K, Spala P, Sympa P, Mamakis G, Malamos AG (2010) “A novel approach in converting SVG architectural data to X3D worlds.” In: Proceedings of the international conference on telecommunications and multimedia, TEMU 2010, Chania, Greece, July 14-16

  22. Koller D, Frischer B, Humphreys G (2009) “Research challenges for digital archives of 3D cultural heritage models,” journal on computing and cultural heritage (JOCCH), vol. 2, issue. 3, article 7. ACM, New York. doi:10.1145/1658346.1658347

    Google Scholar 

  23. Lee K-L, Chen L-H (2005) An efficient computation method for the texture browsing descriptor of MPEG-7. Image Vis Comput 23(5):479–489. doi:10.1016/j.imavis.2004.12.002

    Article  Google Scholar 

  24. Loewenstein Y, Raimondo D, Redfern OC, Watson J, Frishman D, Linial M, Orengo C, Thornton J, Tramontano A (2009) Protein function annotation by homology-based inference. Genome Biol 10(2):207. doi:10.1186/gb-2009-10-2-207

    Article  Google Scholar 

  25. Malamos AG, Mamakis G, Sympa P, Tsirakis M, Piperidis G, Karakechagias J, Mavraganis K, Kaliakatsos Y (2006) “VCLASS-3D: a multimedia educational collaboration platform with 3D virtual workspace support”. In: Proceedings of the 5th IASTED international conference on web-based education (WBE’06), Puerto Vallarta, pp. 19–24, ACTA Press, Anaheim, CA, USA, 23–25 January

  26. Malamos AG, Mamakis G et al. (2009) “Extending X3D-based educational platform for mathematics with multicast networking capabilities”, WBE09, 644–038, Phuket, Thailand, March 16–12

  27. Mikolajczyk K, Zisserman A, Schmid C (2003) “Shape recognition with edge-based features”. In: Proceedings of the 13th British machine vision conference (BMVC’03), pp. 779–788, Norwich, U.K

  28. Min P, Kazhdan M, Funkhouser T (2004) “A comparison of text and shape matching for retrieval of online 3D models”, research and advanced technology for digital libraries, lecture notes in computer science, vol. 3232. Springer, Berlin/Heidelberg, pp 209–220. doi:10.1007/978-3-540-30230-8_20

    Google Scholar 

  29. MPEG-7 Homepage. http://mpeg.chiariglione.org/standards/mpeg-7/mpeg-7.htm

  30. MPEG Homepage. http://mpeg.chiariglione.org/

  31. Panagiotakis C, Doulamis A, Tziritas G (2009) Equivalent key frames selection based on iso-content principle. IEEE Trans Circuits Syst Video Technol 19(3):447–451. doi:10.1109/TCSVT.2009.2013517, Circuits and Systems for Video Technology

    Article  Google Scholar 

  32. Papaleo L, Floriani L (2009) “Semantic-based segmentation and annotation of 3D models.” In: Proceedings of the 15th international conference on image analysis and processing (ICIAP ’09), Springer-Verlag, Berlin/Heidelberg, pp. 103–112, doi:10.1007/978-3-642-04146-4_13

  33. Pavlopoulos GA, Wegener A-L, Schneider R (2008) A survey of visualization tools for biological network analysis. BioData Min 1:12. doi:10.1186/1756-0381-1-12

    Article  Google Scholar 

  34. Pein RP, Amador M, Lu J, Wolfgang R (2008) “Using CBIR and semantics in 3D-model retrieval”, 8th IEEE international conference on computer and information technology, CIT 2008, pp. 173–178, 8–11, doi:10.1109/CIT.2008.4594669

  35. Pitarello F, de Faveri A “Semantic description of 3D environments: a proposal based on web standards.” In: Proceedings of the eleventh international conference on 3D web technology (Web3D ’06), Columbia, Maryland, pp. 85–95, ACM, New York, 2006, doi:10.1145/1122591.1122603

  36. Ro YM, Kim M, Kang HK, Manjunath BS, Kim J (2001) MPEG-7 homogeneous texture descriptor. ETRI J 23(2):41–51. doi:10.4218/etrij.01.0101.0201

    Article  Google Scholar 

  37. Shen Y, Ong SK, Nee AYC (2008) Product information visualization and augmentation in collaborative design. Comput-Aided Des 40(9):963–974. doi:10.1016/j.cad.2008.07.003

    Article  Google Scholar 

  38. Sikora T (2001) The MPEG-7 visual standard for content description—an overview. IEEE Trans Circuits Syst Video Technol 11(6):696–702. doi:10.1109/76.927422

    Article  MathSciNet  Google Scholar 

  39. Sylaiou S, Liarokapis F, Kotsakis K, Patias P (2009) Virtual museums, a survey and some issues for consideration. J Cult Herit 10(4):520–528. doi:10.1016/j.culher.2009.03.003

    Article  Google Scholar 

  40. Tangelder JWH, Veltkamp RC (2008) A survey of content based 3D shape retrieval methods. Multimed Tools Appl 39(3):441–471

    Article  Google Scholar 

  41. Walczak K (2008) “Flex-VR: configurable 3D web applications”. In: Proceedings of the 2008 international conference on human system interactions (HSI’08), Krakow, Poland, pp. 135–140, May 25–27, doi:10.1109/HSI.2008.4581455

  42. WEB3D CONSORTIUM, Extensible 3D (X3D) ISO/IEC 19775:2004. http://www.web3d.org/x3d/specifications/ISOIEC-19775-X3DAbstractSpecification/, 2004

  43. Yang N-C, Chang W-H, Kuo C-M, Li T-H (2008) A fast MPEG-7 dominant color extraction with new similarity measure for image retrieval. J Vis Commun Image Represent 19(2):92–105. doi:10.1016/j.jvcir.2007.05.003

    Article  Google Scholar 

  44. Zaharia T, Preteux F (2001) “3D Shape-based retrieval within the MPEG-7 framework”. In: Proceedings of the SPIE/EI conference on nonlinear image processing, SPIE/EI 2001 Zaharia T, Prêteux F

  45. Zhang L, Ma J, Xu X, Yuan B (2007) “Rotation invariant image classification based on MPEG-7 homogeneous texture descriptor”, eighth ACIS international conference on software engineering, artificial intelligence, networking, and parallel/distributed computing, 2007(SNPD 2007), 3: 798–803, July 30–Aug. 1, doi:10.1109/SNPD.2007.302

  46. Zhou NN, Deng Y-L (2009) “Virtual reality: a state-of-the-art survey”, international journal of automation and computing, institute of automation, Chinese academy of sciences, co-published with Springer-Verlag GmbH, Vol. 6, issue 4, pp. 319–325, doi:10.1007/s11633-009-0319-9

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Athanasios G. Malamos.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Spala, P., Malamos, A.G., Doulamis, A. et al. Extending MPEG-7 for efficient annotation of complex web 3D scenes. Multimed Tools Appl 59, 463–504 (2012). https://doi.org/10.1007/s11042-011-0790-5

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-011-0790-5

Keywords

Navigation