Abstract:
Integration of the bibliographical information of scholarly publications available on the Internet is an important task in academic research. To accomplish this task, acc...Show MoreMetadata
Abstract:
Integration of the bibliographical information of scholarly publications available on the Internet is an important task in academic research. To accomplish this task, accurate reference metadata extraction for scholarly publications is essential for the integration of information from heterogeneous reference sources. In this paper, we propose a knowledge-based approach to literature mining and focus on reference metadata extraction methods for scholarly publications. We adopt an ontological knowledge representation framework called INFOMAP to automatically extract the reference metadata. The experimental results show that, by using INFOMAP, we can extract author, title, journal, volume, number (issue), year, and page information from different reference styles with a high degree of accuracy. The overall average field accuracy of citation extraction for a bioinformatics dataset is 97.87% for six reference styles.
Published in: IRI -2005 IEEE International Conference on Information Reuse and Integration, Conf, 2005.
Date of Conference: 15-17 August 2005
Date Added to IEEE Xplore: 12 September 2005
Print ISBN:0-7803-9093-8