Abstract
The University of Trier maintains the DBLP (Digital Bibliography & Library Project) Computer Science Bibliography which offers bibliographic information about more than 870.000 scientific publications. This paper describes the DBLP WebCrawler, a meta search engine that is able to search for full text publications in PDF format for each DBLP entry on the web. Various search engines such as Google and Yahoo are used as data sources. The retrieved documents are additionally analysed and ranked according to their relevance. The proposed system differs from systems like CiteSeer in so far, that the DBLP Webcrawler builds upon metadata and tries to find relevant full-texts whereas CiteSeer mainly starts with full-texts and extracts metadata.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Levenshtein, V.I.: Binary codes capable of correcting spurious insertions and deletions of ones (original in Russian). Russian Problemy Peredachi Informatsii 1, 12–25 (1965)
Navarro, G., Raffinot, M.: Flexible Pattern Matching in Strings. Practical on-line search algorithms for text and biological sequences. Cambridge Univ. Press, Cambridge (2002)
Reuther, P.: Personal name matching: New test collections and a social network based approach. Universität Trier, Mathematik/Informatik, Forschungsbericht, 06-01 (2006)
Reuther, P., Walter, B., Ley, M., Weber, A., Klink, S.: Managing the quality of person names in DBLP. In: Gonzalo, J., Thanos, C., Verdejo, M.F., Carrasco, R.C. (eds.) ECDL 2006. LNCS, vol. 4172, pp. 508–511. Springer, Heidelberg (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gajek, A., Klink, S., Reuther, P., Walter, B., Weber, A. (2007). Bibliographical Meta Search Engine for the Retrieval of Scientific Articles. In: Kovács, L., Fuhr, N., Meghini, C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2007. Lecture Notes in Computer Science, vol 4675. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74851-9_42
Download citation
DOI: https://doi.org/10.1007/978-3-540-74851-9_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74850-2
Online ISBN: 978-3-540-74851-9
eBook Packages: Computer ScienceComputer Science (R0)