Skip to main content

Finding Hidden Semantics Behind Reference Linkages : An Ontological Approach for Scientific Digital Libraries

  • Conference paper
Database Systems for Advanced Applications (DASFAA 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3453))

Included in the following conference series:

Abstract

The contents and topologies of inter-document linkages, such as citations and references among scientific literature, have received increasing research interests in recent years. Some technologies have been fully studied and utilized upon this meaningful information to improve the organization, analysis and evaluation of scientific digital libraries. In this paper, we present a CiteSeer-like system to access scientific papers in computer science discipline by reference linking technique. Moreover, implicit semantics behind reference indices are mined and organized to improve accessibility of scientific papers. In order to model scientific literature and their interlinked relationships, we develop a domain-specific ontology to analyze contents and citation anchor context of scientific papers. Compared with abstract of a specific paper written by authors themselves, we introduce an automatic summary generation algorithm to create objective descriptions from other scholars’ perspectives based on the ontology. Semantic queries can also be asked to discover interesting patterns in scientific libraries in order to provide a comprehensive and meaningful guidance for users.

This research is funded in part by NSFC grant 90412010 as well as grant 60221120144.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. ACM CCS, http://www.acm.org/class/

  2. McCallum, A.K., Nigam, K., Rennie, J., Syemore, K.: Automating the construction of Internet portals with machine learning. Information Retrieval Journal 3, 127–163 (2000)

    Article  Google Scholar 

  3. Ding, C., Zha, H., He, X., Husbands, P., Simon, H.: Analysis of hubs and authorities on the web. Lawrence Berkeley Nat’l Lab Tech. Report 47847 (2001), www.nersc.gov/~cding/hits.ps

  4. Bergmark, D.: Automatic extraction of reference linking information from online documents. Technical Report TR 2000 -1821, Cornell Computer Science Department (October 2000)

    Google Scholar 

  5. Dublin Core Metadata Initiative, http://purl.oclc.org/dc/

  6. Flake, G.W., Lawrence, S., Giles, C.L.: Efficient identification of web communities. In: Sixth ACM SIGKDD International conference on Knowledge Discovery and Data Mining, pp. 150–159 (2000)

    Google Scholar 

  7. Salton, G.: Automatic indexing using bibliographic citations. Journal of Documentation 27, 98–110 (1971)

    Article  Google Scholar 

  8. Guarino, N.: Formal Ontology and Information Systems. In: Proc. FOIS 1998 trento, Italy, June 6-8 (1998)

    Google Scholar 

  9. Kleinberg, J.: Authoritative sources in a hyperlinked environment. In: Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms (1998)

    Google Scholar 

  10. Bollacker, K.D., Lawrence, S., Giles, C.L.: CiteSeer: an autonomous web agent for automatic retrieval and identification of interesting publications. In: Proceedings of 2nd International Conf. on Autonomous Agents, pp. 116–123. ACM Press, New York (1998)

    Chapter  Google Scholar 

  11. Lempel, R., Moran, S.: SALSA: stochastic approach for link-structure analysis and the TKC effect. ACM Trans. Information Systems 19, 131–160 (2001)

    Article  Google Scholar 

  12. Lu, Q., Getoor, L.: Link-based classification. In: Proc of ICML 2003 (2003)

    Google Scholar 

  13. ML Papers, http://www.ai.mit.edu/people/ayn/cgi/vpapers

  14. Miller, G.A., Beckwith, R., Felbaum, C., Gross, D., Miller, K.: Introduction to WordNet: An On-line Lexical Database. International Journal of Lexicography 3, 235–244 (1990)

    Article  Google Scholar 

  15. Nanba, H., Okumura, M.: Towards Multi-paper Summarization Using Reference Information. In: Proceedings of the 16th International Joint Conferences on Artificial Intelligence (IJCAI 1999), pp. 926–931 (1999)

    Google Scholar 

  16. Fikes, R., Hayes, P., Horrocks, I.: OWL-QL – a language for deductive query answering on the semantic web. Technical Report KSL-03-14, Knowledge Systems Lab, Stanford University, CA, USA (2003)

    Google Scholar 

  17. Brin, S., Page, L.: The Anatomy of a Large-scale Hypertextual Web Search Engine. In: The Seventh International World Wide Web Conference (1998)

    Google Scholar 

  18. Weinstock, N.: Citation indexes. In: Kent, A. (ed.) Encyclopedia of Library and Information Science, New York, pp. 16–41 (1971)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhao, P., Zhang, M., Yang, D., Tang, S. (2005). Finding Hidden Semantics Behind Reference Linkages : An Ontological Approach for Scientific Digital Libraries. In: Zhou, L., Ooi, B.C., Meng, X. (eds) Database Systems for Advanced Applications. DASFAA 2005. Lecture Notes in Computer Science, vol 3453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11408079_64

Download citation

  • DOI: https://doi.org/10.1007/11408079_64

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25334-1

  • Online ISBN: 978-3-540-32005-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics