Editorial Notes
NOTICE OF CONCERN: ACM has received evidence that casts doubt on the integrity of the peer review process for the ICIA 2016 Conference. As a result, ACM is issuing a Notice of Concern for all papers published and strongly suggests that the papers from this Conference not be cited in the literature until ACM's investigation has concluded and final decisions have been made regarding the integrity of the peer review process for this Conference.
ABSTRACT
Summarization is the process of reducing a text document to create a summary that retains the most important points of the original document. Extractive summarizers work on the given text to extract sentences that best convey the message hidden in the text. Most extractive summarization techniques revolve around the concept of finding keywords and extracting sentences that have more keywords than the rest. Keyword extraction usually is done by extracting relevant words having a higher frequency than others, with stress on important ones'. Manual extraction or annotation of keywords is a tedious process brimming with errors involving lots of manual effort and time. In this paper, we proposed an algorithm to extract keyword automatically for text summarization in e-newspaper datasets. The proposed algorithm is compared with the experimental result of articles having the similar title in four different e-Newspapers to check the similarity and consistency in summarized results.
- M. Banko and R. C. Moore. Part of speech tagging in context. In Proceedings of the 20th International Conference on Computational Linguistics, COLING '04, Stroudsburg, PA, USA, 2004. Association for Computational Linguistics. Google ScholarDigital Library
- R. Barzilay and M. Elhadad. Using lexical chains for text summarization. Advances in automatic text summarization, pages 111--121, 1999.Google Scholar
- S. O. CATEGORIZED. Keyword extraction based summarization of categorized kannada text documents. 2011.Google Scholar
- L.-F. Chien. Pat-tree-based keyword extraction for chinese information retrieval. In ACM SIGIR Forum, volume 31, pages 50--58. ACM, 1997. Google ScholarDigital Library
- J. D. Cohen et al. Highlights: Language- and domain-independent automatic indexing terms for abstracting. JASIS, 46(3):162--174, 1995. Google ScholarDigital Library
- J. M. Conroy and D. P. O'leary. Text summarization via hidden markov models. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pages 406--407. ACM, 2001. Google ScholarDigital Library
- G. Ercan and I. Cicekli. Using lexical chains for keyword extraction. Information Processing & Management, 43(6):1705--1714, 2007. Google ScholarDigital Library
- G. Erkan and D. R. Radev. Lexrank: graph-based lexical centrality as salience in text summarization. Journal of Artificial Intelligence Research, pages 457--479, 2004. Google ScholarDigital Library
- E. Frank, G. W. Paynter, I. H. Witten, C. Gutwin, and C. G. Nevill-Manning. Domain-specific keyphrase extraction. 1999.Google Scholar
- M. J. Giarlo. A comparative analysis of keyword extraction techniques. 2005.Google Scholar
- E. Hovy and C.-Y. Lin. Automated text summarization and the summarist system. In Proceedings of a workshop on held at Baltimore, Maryland: October 13-15, 1998, pages 197--214. Association for Computational Linguistics, 1998. Google ScholarDigital Library
- A. Hulth. Improved automatic keyword extraction given more linguistic knowledge. In Proceedings of the 2003 conference on Empirical methods in natural language processing, pages 216--223. Association for Computational Linguistics, 2003. Google ScholarDigital Library
- J. K. Humphreys. Phraserate: An html keyphrase extractor. Dept. of Computer Science, University of California, Riverside, California, USA, Tech. Rep, 2002.Google Scholar
- S. Lappin and H. J. Leass. An algorithm for pronominal anaphora resolution. Computational linguistics, 20(4):535--561, 1994. Google ScholarDigital Library
- M. Litvak and M. Last. Graph-based keyword extraction for single-document summarization. In Proceedings of the workshop on Multi-source Multilingual Information Extraction and Summarization, pages 17--24. Association for Computational Linguistics, 2008. Google ScholarDigital Library
- H. P. Luhn. A statistical approach to mechanized encoding and searching of literary information. IBM Journal of research and development, 1(4):309--317, 1957. Google ScholarDigital Library
- I. Mani and M. T. Maybury. Advances in automatic text summarization, volume 293. MIT Press, 1999. Google ScholarDigital Library
- M. P. Marcus, M. A. Marcinkiewicz, and B. Santorini. Building a large annotated corpus of english: The penn treebank. Computational linguistics, 19(2):1993, pp. 313--330. Google ScholarDigital Library
- Y. Matsuo and M. Ishizuka. Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools, 13(01):157--169, 2004.Google ScholarCross Ref
- A. K. Mondal and D. K. Maji. Improved algorithms for keyword extraction and headline generation from unstructured text. First Journal publication from SIMPLE groups, CLEAR Journal, 2013.Google Scholar
- J. Ramos. Using tf-idf to determine word relevance in document queries. In Proceedings of the first instructional conference on machine learning, 2003.Google Scholar
- G. Salton, A. Singhal, M. Mitra, and C. Buckley. Automatic text structuring and summarization. Information Processing & Management, 33(2): 193--207, 1997. Google ScholarDigital Library
- P. Turney. Learning to extract keyphrases from text. 1999.Google Scholar
- L. van der Plas, V. Pallotta, M. Rajman, and H. Ghorbel. Automatic keyword extraction from spoken text. a comparison of two lexical resources: the edr and wordnet. arXiv preprint cs/0410062, 2004.Google Scholar
- I. H. Witten, G. W. Paynter, E. Frank, C. Gutwin, and C. G. Nevill-Manning. Kea: Practical automatic keyphrase extraction. In Proceedings of the fourth ACM conference on Digital libraries, pages 254--255. ACM, 1999. Google ScholarDigital Library
- C. Zhang. Automatic keyword extraction from documents using conditional random fields. Journal of Computational Information Systems, 4(3):1169--1180, 2008.Google Scholar
- K. Zhang, H. Xu, J. Tang, and J. Li. Keyword extraction using support vector machine. In Advances in Web-Age Information Management, pages 85--96. Springer, 2006.Google ScholarDigital Library
Recommendations
Recent automatic text summarization techniques: a survey
As information is available in abundance for every topic on internet, condensing the important information in the form of summary would benefit a number of users. Hence, there is growing interest among the research community for developing new ...
Multi-document Hyperedge-based Ranking for Text Summarization
CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge ManagementIn a multi-document settings, graph-based extractive summarization approaches build a similarity graph out of sentences in each cluster of documents then use graph centrality approaches to measure the importance of sentences. The similarity is computed ...
Sentiment diversification for short review summarization
WI '17: Proceedings of the International Conference on Web IntelligenceWith the abundance of reviews published on the Web about a given product, consumers are looking for ways to view major opinions that can be presented in a quick and succinct way. Reviews contain many different opinions, making the ability to show a ...
Comments