ABSTRACT
Citation classification is the task of assigning a category to a reference or citation. The current sets of categories or classes proposed in the literature vary in size and they are based on the analysis of a small sample of citation sentences. We are developing a process to automatically generate such categories and base them on the analysis of a large corpus of papers. Part of the generation process involves selecting the main verb relevant to the reference being cited in the sentence. In this paper we present our recently developed technique that automatically identifies the relevant verb in a citation sentence. The technique uses heuristic rules, which are dependent on the results of a semantic role labeler. Four test sets were collected, and the common annotations of the test sets annotated by three people were used to assess the accuracy of the rules. Through experimentation we show that the average accuracy achieved using our technique that automatically extracts verbs from citation sentences across the four test sets is reasonable at 75%.
- A. Bjorkelund, B. Bohnet, L. Hafdell, and P. Nugues. A high-performance syntactic and semantic dependency parser. In Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations, pages 33--3 Association for Computational Linguistics, 2010. Google ScholarDigital Library
- C. Cortes and V. Vapnik. Support-vector networks. Machine Learning, 20:273--297, 1995. Google ScholarDigital Library
- M. Davies and J. L. Fleiss. Measuring agreement for multinomial data. Biometrics, pages 1047--1051, 1982.Google Scholar
- A. M. Green. Kappa statistics for multiple raters using categorical classifications. In Proceedings of the 22nd annual SAS User Group International conference, pages 1110--1115, 1997.Google Scholar
- X. Liu, B. Han, K. Li, S. H. Stiller, and M. Zhou. Srl-based verb selection for esl. In Proceedings of the 2010 conference on empirical methods in natural language processing}, pages 1068--1076. Association for Computational Linguistics, 2010. Google ScholarDigital Library
- M. Palmer, D. Gildea, and N. Xue. Semantic Role Labeling. Synthesis Lectures on Human Language Technologies Series. Morgan & Claypool, 2010. Google ScholarDigital Library
- D. Shen and M. Lapata. Using semantic roles to improve question answering. In Proceedings of EMNLP-CoNLL, pages 12--21, 2007.Google Scholar
- D. Wang, T. Li, S. Zhu, and C. Ding. Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '08, pages 307--314, New York, NY, USA, 2008. ACM. Google ScholarDigital Library
- D. Wu and P. Fung. Can semantic role labeling improve smt. In Proceedings of the 13th Annual Conference of the EAMT, pages 218--225, 2009.Google Scholar
Index Terms
- Verb selection using semantic role labeling for citation classification
Recommendations
Journal self-citation study for semiconductor literature: synchronous and diachronous approach
Special issue: InformetricsThe present study investigates the self-citations of the most productive semiconductor journals by synchronous (self-citing rate) and diachronous (self-cited rate) approaches. Journal's productivity of 100 most productive semiconductor journals was ...
Leveraging full-text article exploration for citation analysis
AbstractScientific articles often include in-text citations quoting from external sources. When the cited source is an article, the citation context can be analyzed by exploring the article full-text. To quickly access the key information, researchers are ...
Citation success of different publication types: a case study on all references in psychology publications from the German-speaking countries (D---A---CH---L---L) in 2009, 2010, and 2011
Scientometric data on the citation success of different publication types and publication genres in psychology publications are presented. Data refer to references that are cited in these scientific publications and that are documented in PSYNDEX, the ...
Comments