Sumdoc: A Unified Approach for Automatic Text Summarization

Mudasir Mohd; Shah, Muzaffar Bashir; Bhat, Shabir Ahmad; Kawa, Ummer Bashir; Khanday, Hilal Ahmad; Wani, Abid Hussain; Wani, Mohsin Altaf; Rana Hashmy

doi:10.1007/978-981-10-0448-3_27

Mudasir Mohd⁷,
Muzaffar Bashir Shah⁷,
Shabir Ahmad Bhat⁷,
Ummer Bashir Kawa⁷,
Hilal Ahmad Khanday⁷,
Abid Hussain Wani⁷,
Mohsin Altaf Wani⁷ &
…
Rana Hashmy⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 436))

1147 Accesses
2 Citations

Abstract

In this paper, we focus on the task of automatic text summarization. Lot of work has already been carried out on automatic text summarization though most of the work done in this field is on extracted summaries. We have developed a tool that summarizes the given text. We have used several NLP features and machine learning techniques for text summarizing. We have also showed how WordNet can be used to obtain abstractive summarization. We are using an approach that first extracts sentences from the given text by using ranking algorithm, by means of which we rank the sentence on the basis of many features comprising of some classical features as well as some novel ones. Then, after extracting candidate sentences, we investigate some of the words and phrases and transform them into their respective simple substitutes so as to make the final summary a hybrid summarization technique.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Nenkova. A., McKeown, K.: Found. Trends Inf. Retrieval 5(2–3), 103–233 (2011)
Google Scholar
Amini, M.R., Usunier, N., Gallinari, P.: Automatic text summarization based on word-clusters and ranking algorithms. Computer Science Laboratory of Paris 6, 8 Rue du Capitaine Scott, 75015 Paris, France
Google Scholar
Patil, V., Krishnamoorthy, M., Oke, P., Kiruthika, M.: A statistical approach for document summarization. Department of Computer Engineering Fr. C. Rodrigues Institute of Technology, Vashi, Navi Mumbai, Maharashtra, India
Google Scholar
Yu, L., Liu, M., Ren, F., Kuroiwa, S.: A Chinese automatic text summarization system for mobile devices. Int. Inf. Inst. 13, 3(B) (2010)
Google Scholar
Ren, F.: Automatic abstracting important sentences. Int. J. Inf. Technol. Decis. Making 4(1), 141–152 (2005)
Google Scholar
Edmundson, H., Wyllys, R.: Automatic abstracting and indexing—survey and recommendations. Commun. ACM 4(5), 226–234 (1961)
Google Scholar
Balabantaray, R., Sahoo, D., Sahoo, S.M.: Text summarization using term weights. Int. J. Comput. Appl. (0975–8887) 38(1), 10–14 (2012)
Google Scholar
Pal, A., Maiti, P.K., Saha, D.: An approach to automatic text summarization using simplified Lesk algorithm and Wordnet
Google Scholar
Chang, T., Hsiao, W.: A hybrid approach to automatic text summarization. In: 8th IEEE International Conference on Computer and Information Technology (CIT 2008), Sydney, Australia, 2008
Google Scholar
Banerjee, S.: Adapting the Lesk algorithm for word sense disambiguation to WordNet
Google Scholar
Kulkarni, A.R.: An automatic Text summarization using feature terms for relevance measure (2002)
Google Scholar
Carlos, B.: WordNet.Br An Exercise of Human Language Technology Research. Dias-da-Silva Universidad estadual, Paulista, Brazil
Google Scholar
Das, D.: Unsupervised part-of-speech tagging with bilingual Graph-Based projections. Carnegie Mellon University, Pittsburgh, PA 15213, USA
Google Scholar
Manne, S., Fatima S.S.: A feature terms based method for improving text summarization with supervised POS tagging
Google Scholar
Stanford NLP Group.: Stanford log-linear part of speech tagger. http://nlp.stanford.edu/software/tagger.shtml. Accessed 15 June 2009
Brill, E.: A simple rule-based part-of-speech tagger. In: Proceedings of the Third Conference on Applied Computational Linguistics. Association for Computational Linguistics (1992)
Google Scholar
Dalianis, H.: SweSum-a text summarizer for Swedish. Technical Report, TRITA-NA-P0015, IPLab-174, KTH NADA, Sweden, 2000
Google Scholar
Edmundson, H.P.: New methods in automatic extracting. J. Assoc. Comput. Mach. 16(2), 264–285 (1969)
Google Scholar
Hull, D.A.: Information Retrieval Using Statistical Classification. Ph.D. dissertation, Stanford University, 1994
Google Scholar
Marcu, D.: The Rhetorical Parsing, Summarization, and Generation of Natural Language Texts. Ph.D. dissertation, University of Toronto, 1997
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Sciences, University of Kashmir, Srinagar, India
Mudasir Mohd, Muzaffar Bashir Shah, Shabir Ahmad Bhat, Ummer Bashir Kawa, Hilal Ahmad Khanday, Abid Hussain Wani, Mohsin Altaf Wani & Rana Hashmy

Authors

Mudasir Mohd
View author publications
You can also search for this author in PubMed Google Scholar
Muzaffar Bashir Shah
View author publications
You can also search for this author in PubMed Google Scholar
Shabir Ahmad Bhat
View author publications
You can also search for this author in PubMed Google Scholar
Ummer Bashir Kawa
View author publications
You can also search for this author in PubMed Google Scholar
Hilal Ahmad Khanday
View author publications
You can also search for this author in PubMed Google Scholar
Abid Hussain Wani
View author publications
You can also search for this author in PubMed Google Scholar
Mohsin Altaf Wani
View author publications
You can also search for this author in PubMed Google Scholar
Rana Hashmy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mudasir Mohd .

Editor information

Editors and Affiliations

Dept of Applied Sci & Eng, Indian Instit of Tech Roorkee, Roorkee, India
Millie Pant
Department of Mathematics, Indian Inst of Tech Roorkee, Roorkee, India
Kusum Deep
Chankyapuri, Rm 327, South Asian Univ, Akbar Bhawan, New Delhi, India
Jagdish Chand Bansal
Department of Mathematics and Comp Sci, Liverpool Hope University, LIVERPOOL, United Kingdom
Atulya Nagar
Department of Mathematics, National Inst of Tech Silchar, Silchar, Assam, India
Kedar Nath Das

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mudasir Mohd et al. (2016). Sumdoc: A Unified Approach for Automatic Text Summarization. In: Pant, M., Deep, K., Bansal, J., Nagar, A., Das, K. (eds) Proceedings of Fifth International Conference on Soft Computing for Problem Solving. Advances in Intelligent Systems and Computing, vol 436. Springer, Singapore. https://doi.org/10.1007/978-981-10-0448-3_27

Download citation

DOI: https://doi.org/10.1007/978-981-10-0448-3_27
Published: 15 March 2016
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-0447-6
Online ISBN: 978-981-10-0448-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics