Abstract
The number of electronic journal articles is growing faster than ever before; information is generated faster than people can deal with it. In order to handle this problem, many electronic periodical databases have proposed keyword search methods to decrease the effort and time spent by users in searching the journal’s archives. However, the users still have to deal with a huge number of search results. How to provide an efficient search, i.e., to present the search results in categories, has become an important current research issue. If search results can be classified and shown by their topics, users can find papers of interest quickly. However, traditional topic detection methods use only word frequencies, ignoring the importance of semantics. In addition, the bibliographic structures (e.g., Title, Keyword, and Abstract) have particular importance. Therefore, this paper describes a topic detection method based on bibliographic structures and semantic properties to extract important words and cluster the scholarly literature. The experimental results show that our method is better than the traditional method.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Voorhees, E.M.: Natural language processing and information retrieval. Springer, New York (1999)
Allan, J. (ed.): Topic detection and tracking: event-based information organization. Kluwer Academic Publishers, Dordrecht (2002)
Lee, M., Wang, W., Yu, H.: Exploring supervised and unsupervised methods to detect topics in biomedical text. BMC Bioinformatics 7 (2006)
Berkhin, P.: Survey of clustering data mining techniques. Accrue Sotware. Inc. (2002)
Hotho, A., Nürnberger, A., Paaß, G.: A brief survey of text mining. LDV Forum - GLDV Journal for Computational Linguistics and Language Technology (2005)
Kantardzic, M.: Data mining. Wiley Inter-Science, Hoboken (2003)
Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)
Wille, R.: Restructuring lattice theory: an approach based on hierarchies of concepts. In: Rival, I. (ed.) Ordered Sets, pp. 445–470. Reidel, Dordrecht (1982)
Barzilay, R., Elhadad, M.: Using lexical chains for text summarization. In: ACL/EACL Workshop on Intelligent Scalable Text Summarization (1997)
Halliday, M., Hasan, R.: Cohesion in English. Longman, London (1976)
Hatch, P., Stokes, N., Carthy, J.: Topic detection, a new application for lexical chaining? In: The Proceedings of BCS- IRSG 2000, the 22nd Annual Colloquim on Information Retrieval Research, Cambridge, pp. 94–103 (2000)
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing & Management 24, 513–523 (1988)
Salton, G.: Automatic information organization and retrieval. McGraw-Hill, New York (1968)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, HC., Huang, TH., Guo, JL., Li, SC. (2009). Journal Article Topic Detection Based on Semantic Features. In: Chien, BC., Hong, TP., Chen, SM., Ali, M. (eds) Next-Generation Applied Intelligence. IEA/AIE 2009. Lecture Notes in Computer Science(), vol 5579. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02568-6_65
Download citation
DOI: https://doi.org/10.1007/978-3-642-02568-6_65
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02567-9
Online ISBN: 978-3-642-02568-6
eBook Packages: Computer ScienceComputer Science (R0)