Abstract
Virtual world and other 3D Web content has been treated as a separate domain from the traditional 2D Web, but is increasingly being integrated with broader web content. However, search engines do not have the ability to crawl virtual world content directly, making it difficult for users to find relevant content. We present an intelligent agent crawler designed to collect user-generated content in the Second Life and related virtual worlds. The agents navigate autonomously through the world to discover regions, parcels of land within regions, user-created objects, and other users. The agents also interact with objects through movement or ‘touch’ to trigger scripts that present dynamic content in the form of note cards, chat text, landmark links, and web URLs. The collection service includes a focused HTML crawler for collecting linked web content. The experiments we performed are the first which focus on the content of a large virtual world. Our results show that virtual worlds can be effectively crawled using autonomous agent crawlers that emulate normal user behavior. Additionally, we find that the collection of interactive content enhances our ability to identify dynamic, immersive environments within the world.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bowman, C.: The Harvest Information Discovery and Access System. Computer Networks and ISDN Systems 28, 119–125 (1995)
Pinkerton, B.: Finding What People Want: Experiences with the WebCrawler. In: Proc. 2nd Intl. WWW Conf. (1994)
McBryan, O.: GENVL and WWWW: Tools for Taming the Web. In: Proc. 1st Intl. WWW Conf. (1994)
Brin, S., Page, L.: The Anatomy of a Large-Scale Hypertextual Web Search Engine. Computer Networks and ISDN Systems 30, 107–117 (1998)
Heydon, A., Najork, M.: Mercator: A Scalable, Extensible Web Crawler. In: World Wide Web, vol. 2, pp. 219–229 (2004)
Boldi, P., Codenotti, B., Santini, M., Vigna, S.: UbiCrawler: A Scalable Fully Distributed Web Crawler. Software: Practice and Experience 34, 711–726 (2004)
Cho, J., Garcia-Molina, H.: Proc. 26th Intl. Conf. on Very Large Databases, pp. 200–209 (2000)
Hsin-Tsang, L., Leonard, D., Wang, X., Loguinov, D.: IRLbot: Scaling to 6 Billion Pages and Beyond. ACM Transactions on the Web 3(3) (2009)
Adobe Systems Inc.: Adobe Advances Rich Media Search on the Web (2008), http://www.adobe.com/aboutadobe/pressroom/pressreleases/pdfs/200806/070108AdobeRichMediaSearch.pdf (accessed January 14, 2010)
Raghavan, S., Garcia-Molina, H.: Crawling the Hidden Web. In: Proc. 27th Intl. Conf. on Very Large Databases, pp. 129–138 (2001)
Wu, W., Yu, C., Doan, A., Meng, W.: An Interactive Clustering-Based Approach to Integrating Source Query Interfaces on the Deep Web. In: Proc. 2004 ACM SIGMOD Intl. Conf. on Management of Data, pp. 95–106 (2004)
He, H., Meng, W., Yu, C., Wu, Z.: Wise-Integrator: An Automatic Integrator of Web Search Interfaces for E-Commerce. In: Proc. 29th Intl. Conf. on Very Large Databases, pp. 357–368 (2003)
Gauch, S., Wang, G., Gomez, M.: Profusion: Intelligent Fusion from Multiple, Distributed Search Engines. Journal of Universal Computer Science 2(9), 637–649 (1997)
Ntoulas, A., Zerfos, P., Cho, J.: Downloading Textual Hidden Web Content through Keyword Queries. In: Proc. 5th ACM/IEEE-CS Join Conference on Digital Libraries, pp. 100–109 (2005)
Yang, Z., Kuo, C.: Survey on Image Content Analysis, Indexing, and Retrieval Techniques and Status Report on MPEG-7. Tamkang Journal of Sci. and Eng. 2(3), 101–118 (1999)
Lew, M., Sebe, N., Djeraba, C., Jain, R.: Content-based Multimedia Information Retrieval: State of the Art and Challenges. ACM Trans. on Multimedia Computing, Communications and Applications 2(1), 1–19 (2006)
Datta, R., Joshi, D., Li, J., Wang, J.: Image Retrieval: Ideas, Influences, and Trends of the New Age. ACM Computing Surveys 40(2), 1–60 (2008)
Niblack, W., et al.: QBIC Project: Querying Images by Content, Using Color, Texture, and Shape. In: Proc. SPIE (2004), doi:10.1117/12.143648
Smith, J., Chang, S.: VisualSEEK: A Fully Automated Content-Based Image Query System. In: Proc. 4th ACM Intl. Conf. on Multimedia, pp. 87–98 (1997)
Ortega-Binderberger, M., Mehrotra, S., Chakrabarti, K., Porkaew, K.: WebMARS: A Multimedia Search Engine. In: Proc Intl. Society for Optics and Photonics, vol. 3964, pp. 314–321 (1999)
Yan, R., Hauptmann, A., Jin, R.: Multimedia Search with Pseudo-Relevance Feedback. In: Proc. 2nd Intl. Conf. on Image and Video Retrieval, pp. 238–247 (2003)
Iyer, M., Jayanti, S., Lou, K., Kalyanaraman, Y., et al.: Three Dimensional Shape Searching: State-of-the-Art Reviews and Future Trends. Computer Aided Design 5(15), 509–530 (2005)
Tangelder, J., Veltkamp, R.: A Survey of Content-Based 3D Shape Retrieval Methods. Multimedia Tools and Applications 39(3), 441–471 (2007)
Rodriguez-Echavarria, K., Morris, D., Arnold, D.: Web Based Presentation of Semantically Tagged 3D Content for Public Sculptures and Monuments in the UK. In: Proc. 14th Intl. Conf. on 3D Web Technology, pp. 119–126 (2009)
Au, W.: New World Notes: Second Life Concurrency Exceeds 70K – Is SL’s User Growth Plateau at an End, Too? New World Notes (2008), http://nwn.blogs.com/nwn/2008/09/second-life-con.html (accessed January 15, 2010)
OpenSimulator.org, OpenSim.Region.DataSnapshot. OpenSimulator (2009), http://opensimulator.org/wiki/OpenSim.Region.DataSnapshot (accessed January 15, 2010)
La, C., Pietro, M.: Characterizing User Mobility in Second Life. Institute Eurocom Technical Report RR-08-212 (2008)
Varvello, M., Picconi, F., Diot, C., Biersack, E.: Is There Life in Second Life? Thomson Technical Report CR-PRL-2008-07-0002 (2008)
Gayle, R., Manocha, D.: Navigating Virtual Agents in Online Virtual Worlds. In: Proc. 13th Intl. Symposium on 3D Web Technology, pp. 53–56 (2008)
Sud, A., Anderson, E., Curtis, S., Lin, M., Manocha, D.: Real-Time PAth Planning in Dynamic Virtual Environments Using Multiagent Navigation Graphs. IEEE Transactions on Visualization and Computer Graphics 14, 526–538 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Eno, J., Gauch, S., Thompson, C.W. (2010). Agent-Based Search and Retrieval in Virtual World Environments. In: Soro, A., Vargiu, E., Armano, G., Paddeu, G. (eds) Information Retrieval and Mining in Distributed Environments. Studies in Computational Intelligence, vol 324. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16089-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-16089-9_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16088-2
Online ISBN: 978-3-642-16089-9
eBook Packages: EngineeringEngineering (R0)