Skip to main content

Part of the book series: Studies in Computational Intelligence ((SCI,volume 324))

  • 679 Accesses

Abstract

Virtual world and other 3D Web content has been treated as a separate domain from the traditional 2D Web, but is increasingly being integrated with broader web content. However, search engines do not have the ability to crawl virtual world content directly, making it difficult for users to find relevant content. We present an intelligent agent crawler designed to collect user-generated content in the Second Life and related virtual worlds. The agents navigate autonomously through the world to discover regions, parcels of land within regions, user-created objects, and other users. The agents also interact with objects through movement or ‘touch’ to trigger scripts that present dynamic content in the form of note cards, chat text, landmark links, and web URLs. The collection service includes a focused HTML crawler for collecting linked web content. The experiments we performed are the first which focus on the content of a large virtual world. Our results show that virtual worlds can be effectively crawled using autonomous agent crawlers that emulate normal user behavior. Additionally, we find that the collection of interactive content enhances our ability to identify dynamic, immersive environments within the world.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Bowman, C.: The Harvest Information Discovery and Access System. Computer Networks and ISDN Systems 28, 119–125 (1995)

    Article  Google Scholar 

  2. Pinkerton, B.: Finding What People Want: Experiences with the WebCrawler. In: Proc. 2nd Intl. WWW Conf. (1994)

    Google Scholar 

  3. McBryan, O.: GENVL and WWWW: Tools for Taming the Web. In: Proc. 1st Intl. WWW Conf. (1994)

    Google Scholar 

  4. Brin, S., Page, L.: The Anatomy of a Large-Scale Hypertextual Web Search Engine. Computer Networks and ISDN Systems 30, 107–117 (1998)

    Article  Google Scholar 

  5. Heydon, A., Najork, M.: Mercator: A Scalable, Extensible Web Crawler. In: World Wide Web, vol. 2, pp. 219–229 (2004)

    Google Scholar 

  6. Boldi, P., Codenotti, B., Santini, M., Vigna, S.: UbiCrawler: A Scalable Fully Distributed Web Crawler. Software: Practice and Experience 34, 711–726 (2004)

    Article  Google Scholar 

  7. Cho, J., Garcia-Molina, H.: Proc. 26th Intl. Conf. on Very Large Databases, pp. 200–209 (2000)

    Google Scholar 

  8. Hsin-Tsang, L., Leonard, D., Wang, X., Loguinov, D.: IRLbot: Scaling to 6 Billion Pages and Beyond. ACM Transactions on the Web 3(3) (2009)

    Google Scholar 

  9. Adobe Systems Inc.: Adobe Advances Rich Media Search on the Web (2008), http://www.adobe.com/aboutadobe/pressroom/pressreleases/pdfs/200806/070108AdobeRichMediaSearch.pdf (accessed January 14, 2010)

  10. Raghavan, S., Garcia-Molina, H.: Crawling the Hidden Web. In: Proc. 27th Intl. Conf. on Very Large Databases, pp. 129–138 (2001)

    Google Scholar 

  11. Wu, W., Yu, C., Doan, A., Meng, W.: An Interactive Clustering-Based Approach to Integrating Source Query Interfaces on the Deep Web. In: Proc. 2004 ACM SIGMOD Intl. Conf. on Management of Data, pp. 95–106 (2004)

    Google Scholar 

  12. He, H., Meng, W., Yu, C., Wu, Z.: Wise-Integrator: An Automatic Integrator of Web Search Interfaces for E-Commerce. In: Proc. 29th Intl. Conf. on Very Large Databases, pp. 357–368 (2003)

    Google Scholar 

  13. Gauch, S., Wang, G., Gomez, M.: Profusion: Intelligent Fusion from Multiple, Distributed Search Engines. Journal of Universal Computer Science 2(9), 637–649 (1997)

    Google Scholar 

  14. Ntoulas, A., Zerfos, P., Cho, J.: Downloading Textual Hidden Web Content through Keyword Queries. In: Proc. 5th ACM/IEEE-CS Join Conference on Digital Libraries, pp. 100–109 (2005)

    Google Scholar 

  15. Yang, Z., Kuo, C.: Survey on Image Content Analysis, Indexing, and Retrieval Techniques and Status Report on MPEG-7. Tamkang Journal of Sci. and Eng. 2(3), 101–118 (1999)

    MathSciNet  Google Scholar 

  16. Lew, M., Sebe, N., Djeraba, C., Jain, R.: Content-based Multimedia Information Retrieval: State of the Art and Challenges. ACM Trans. on Multimedia Computing, Communications and Applications 2(1), 1–19 (2006)

    Article  Google Scholar 

  17. Datta, R., Joshi, D., Li, J., Wang, J.: Image Retrieval: Ideas, Influences, and Trends of the New Age. ACM Computing Surveys 40(2), 1–60 (2008)

    Article  Google Scholar 

  18. Niblack, W., et al.: QBIC Project: Querying Images by Content, Using Color, Texture, and Shape. In: Proc. SPIE (2004), doi:10.1117/12.143648

    Google Scholar 

  19. Smith, J., Chang, S.: VisualSEEK: A Fully Automated Content-Based Image Query System. In: Proc. 4th ACM Intl. Conf. on Multimedia, pp. 87–98 (1997)

    Google Scholar 

  20. Ortega-Binderberger, M., Mehrotra, S., Chakrabarti, K., Porkaew, K.: WebMARS: A Multimedia Search Engine. In: Proc Intl. Society for Optics and Photonics, vol. 3964, pp. 314–321 (1999)

    Google Scholar 

  21. Yan, R., Hauptmann, A., Jin, R.: Multimedia Search with Pseudo-Relevance Feedback. In: Proc. 2nd Intl. Conf. on Image and Video Retrieval, pp. 238–247 (2003)

    Google Scholar 

  22. Iyer, M., Jayanti, S., Lou, K., Kalyanaraman, Y., et al.: Three Dimensional Shape Searching: State-of-the-Art Reviews and Future Trends. Computer Aided Design 5(15), 509–530 (2005)

    Article  Google Scholar 

  23. Tangelder, J., Veltkamp, R.: A Survey of Content-Based 3D Shape Retrieval Methods. Multimedia Tools and Applications 39(3), 441–471 (2007)

    Article  Google Scholar 

  24. Rodriguez-Echavarria, K., Morris, D., Arnold, D.: Web Based Presentation of Semantically Tagged 3D Content for Public Sculptures and Monuments in the UK. In: Proc. 14th Intl. Conf. on 3D Web Technology, pp. 119–126 (2009)

    Google Scholar 

  25. Au, W.: New World Notes: Second Life Concurrency Exceeds 70K – Is SL’s User Growth Plateau at an End, Too? New World Notes (2008), http://nwn.blogs.com/nwn/2008/09/second-life-con.html (accessed January 15, 2010)

  26. OpenSimulator.org, OpenSim.Region.DataSnapshot. OpenSimulator (2009), http://opensimulator.org/wiki/OpenSim.Region.DataSnapshot (accessed January 15, 2010)

  27. La, C., Pietro, M.: Characterizing User Mobility in Second Life. Institute Eurocom Technical Report RR-08-212 (2008)

    Google Scholar 

  28. Varvello, M., Picconi, F., Diot, C., Biersack, E.: Is There Life in Second Life? Thomson Technical Report CR-PRL-2008-07-0002 (2008)

    Google Scholar 

  29. Gayle, R., Manocha, D.: Navigating Virtual Agents in Online Virtual Worlds. In: Proc. 13th Intl. Symposium on 3D Web Technology, pp. 53–56 (2008)

    Google Scholar 

  30. Sud, A., Anderson, E., Curtis, S., Lin, M., Manocha, D.: Real-Time PAth Planning in Dynamic Virtual Environments Using Multiagent Navigation Graphs. IEEE Transactions on Visualization and Computer Graphics 14, 526–538 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Eno, J., Gauch, S., Thompson, C.W. (2010). Agent-Based Search and Retrieval in Virtual World Environments. In: Soro, A., Vargiu, E., Armano, G., Paddeu, G. (eds) Information Retrieval and Mining in Distributed Environments. Studies in Computational Intelligence, vol 324. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16089-9_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16089-9_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16088-2

  • Online ISBN: 978-3-642-16089-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics