ABSTRACT
We present a framework for the automatic generation of links based on salient semantic structures extracted from homogeneous web repositories, and discuss an imple-mentation of the framework. For this study, we consider homogeneous the repositories of the eClass, an instrumented environment that automatically captures details of a lecture and provides effective multimedia-enhanced web-based in-terfaces for users to review the lecture, and the CoWeb, a web-based service for collaborative authoring of web-based material. We exploited Latent Semantic Analysis over data indexed by a general public license search engine. We exper-imented our service with data from a graduate course sup-ported by both eClass and CoWeb repositories. We present the results of the Latent Semantic Analysis linking service in the light of results previously obtained with our previous works.
- 1.G. Abowd. Classroom 2000: an experience with the instrumentation of a living educational environment. IBM Systems Journal, 38:508 - 530, 1999. Google ScholarDigital Library
- 2.G. D. Abowd, C. G. Atkeson, J. A. Brotherton, T. Enqvist, P. A. Gully, and J. Lemon. Investigating the capture, integration and access problem of ubiquitous computing in an educational setting. In Proceedings of the ACM CHI'98, pages 440 - 447, 1998. Google ScholarDigital Library
- 3.G. D. Abowd, M. G. C. Pimentel, B. Kerimbaev, Y. Ishiguro, and M. Guzdial. Anchoring discussion in lecture: an approach to collaboratively extending classroom digital media. In Proceedings of the Computer Support for Collaborative Learning (CSCL) Conference, pages 11 - 19, Stanford University, 1999. Google ScholarDigital Library
- 4.J. Allan. Automatic hypertext link typing. In Proceedings of the Seventh ACM Conference on Hypertext, pages 42 - 52, 1996. Google ScholarDigital Library
- 5.K. Borner. Extracting and visualizing semantic structures in retrieval results for browsing. In Proceedings of the Fifth ACM Conference on ACM 2000 Digital Libraries, pages 234 - 235, 2000. Google ScholarDigital Library
- 6.J. A. Brotherton, J. Bhalodia, and G. D. Abowd. Automated capture, integration, and visualization of multiple media streams. In Proceedings of the IEEE Multimedia'98, pages 54 - 63, 1998. Google ScholarDigital Library
- 7.W. W. W. Consortium. Resource description framework (RDF) model and syntax specification. Internet, Feb 1999. http://www.w3.org/TR/REC-rdf-syntax.Google Scholar
- 8.F. Crestani. Exploiting the similarity of nonmatching terms at retrieval time. Journal of Information Retrieval, 2:25 - 45, 2000. Google ScholarDigital Library
- 9.S. T. Dumais, G. W. Furnas, T. K. Landauer, S. Deerwester, and R. Harshman. Using latent semantic analysis to improve access to textual information. In Conference Proceedings on Human Factors in Computing Systems, pages 281 - 285, 1998. Google ScholarDigital Library
- 10.R. F. S. Filho, A. J. M. Traina, C. T. Jr., and C. Faloutsos. Similarity search without tears: The omni-family of all-purpose access methods. 17th IEEE Intl. Conference on Data Engineering, pages 623 - 630, 2001. Google ScholarDigital Library
- 11.G. W. Furnas, S. Deerwester, S. T. Dumais, T. K. Landauer, R. A. Harshman, L. A. Streeter, and K. E. Lochbaum. Information retrieval using a singular value decomposition model of latent semantic structure. In Proceedings of the Eleventh International Conference on Research & Development in Information Retrieval, pages 465 - 480, 1988. Google ScholarDigital Library
- 12.G. Golovchinsky. What the query told the link: The integrations of hypertext and information retrieval. In Proceedings of the ACM Conference on Hypertext'97, pages 30 - 39, 1997. Google ScholarDigital Library
- 13.M. Group. Mnogosearchtm web search engine software. Internet, 2001. URL: http://www.mnogosearch.ru.Google Scholar
- 14.M. Guzdial. Supporting learners as users. The Journal of Computer Documentation, 23(2):3 - 13, 1999. Google ScholarDigital Library
- 15.C. H. Papadimitriou, H. Tamaki, P. Raghavan, and S. Vempala. Latent semantic indexing: a probabilistic analysis. In Proceedings of the Seventeenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, pages 159 - 168, 1998. Google ScholarDigital Library
- 16.M. G. C. Pimentel, Y. I. B. Kerimbaev, G. D. Abowd, and M. Guzdial. Supporting long-term educational activities through dynamic web interfaces. Interacting With Computers Journal, 13:353 - 374, 2001.Google Scholar
- 17.M. G. C. Pimentel, A. A. Macedo, and G. D. Abowd. Linking homogeneous web-based repositories. In Proceedings of International Workshop on Information Integration on the Web, pages 35 - 42, Rio de Janeiro-Brazil, 2001. URL: http://www.cos.ufrj.br/wiiw/schedule.html.Google Scholar
- 18.M. N. Price, G. Golovchinsky, and B. N. Schilit. Linking by inking: trailblazing in a paper-like hypertext. In Proceedings of ACM Conference on Hypertext'98, pages 30 - 39, 1998. Google ScholarDigital Library
- 19.G. Salton. A blueprint for automatic indexing. ACM SIGIR Forum, 16(2):22 - 38, 1981. Google ScholarDigital Library
- 20.G. Salton. Another look at automatic text-retrieval systems. Commun. ACM 29, 7:648 - 656, 1986. Google ScholarDigital Library
- 21.G. Salton and J. Allan. Selective text utilization and text transversal. In Proceedings of the ACM Conference on Hypertext'93, pages 131 - 144, 1993. Google ScholarDigital Library
- 22.I. Silva, B. Ribeiro-Neto, P. Calado, E. Moura, and N. Ziviani. Link-based and content-based evidential information in a belief network model. In Proceedings of ACM SIGIR'00, pages 96 - 103, 2000. Google ScholarDigital Library
- 23.R. Soto. Learning and performing by exploration: label quality measured by latent semantic analysis. In Proceedings of the Conference on Human Factors in Computing Systems, pages 418 - 425, 1999. Google ScholarDigital Library
Index Terms
- Latent semantic linking over homogeneous repositories
Recommendations
An infrastructure for open latent semantic linking
HYPERTEXT '02: Proceedings of the thirteenth ACM conference on Hypertext and hypermediaThe more the web grows, the harder it is for users to find the information they need. As a result, it is even more difficult to identify when documents are related. To find out that two or more documents are in fact related, users have to navigate by ...
A look at some issues during textual linking of homogeneous web repositories
DocEng '04: Proceedings of the 2004 ACM symposium on Document engineeringInteracting with services that create links automatically via Web users are able to identify relationships among documents stored in different repositories. The fact that automatic linking services do not use queries performed by a human user has impact ...
Incremental probabilistic Latent Semantic Analysis for video retrieval
Recent research trends in Content-based Video Retrieval have shown topic models as an effective tool to deal with the semantic gap challenge. In this scenario, this paper has a dual target: (1) it is aimed at studying how the use of different topic ...
Comments