Abstract
Document similarity calculation and summarization is a challenging task. Not many works have been done in this field for Bangla Language. Similarity calculation and summarization is more challenging for Bangla Language as Bangla grammar works differently than that of English. This paper proposes a way to calculate similarity between Bangla news and apply summarization on Bangla news documents taken from popular news portals by applying various data mining techniques as accurately as possible.
References
Uddin, M.N., Khan, S.A.: A study on text summarization techniques and implement few of them for Bangla language. In: 2007 10th International Conference on Computer and Information Technology (2007). doi:10.1109/ICCITECHN.2007.4579374
Saharia, N., Sharma, U., Kalita, J.: Stemming resource-poor Indian languages. ACM Trans. Asian Lang. Inf. Process. 13(3), 1–26 (2014)
Urmi, T.T., Jammy, J.J., Ismail, S.: A corpus based unsupervised Bangla word stemming using N-gram language model. In: 2016 5th International Conference on Informatics, Electronics and Vision (ICIEV) (2016). doi:10.1109/ICIEV.2016.7760117
Dave, H., Jaswal, S.: Multiple text document summarization system using hybrid summarization technique. In: 2015 1st International Conference on Next Generation Computing Technologies (NGCT) (2015). doi:10.1109/NGCT.2015.7375231
Baralis, E., Cagliero, L., Cerquitelli, T.: Supporting stock trading in multiple foreign markets. In: Proceedings of 2nd International Workshop on Data Science for Macro-Modeling – DSMM 2016 (2016). doi:10.1145/2951894.2951897
Dsouza, K.J., Ansari, Z.A.: A novel data mining approach for multi variant text classification. In: 2015 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM) (2015)
Bangla Word List [PDF] West Bengal Bangla Academy, Kolkata (n.d.)
List of Regular Expressions - Libreoffice Help. Help.libreoffice.org. N.p. (2017). Web: 5 May 2017
Tan, P., Steinbach, M., Kumar, V.: Introduction to Data Mining. Dorling Kindersley, Pearson, London (2015)
“আকস্মিক দেশের পথে মাশরাফি । খেলাধুলা । The Daily Ittefaq” Ittefaq.com.bd. N.p. (2017). Web: 3 May 2017
Ferreira, R., et al.: Assessing sentence scoring techniques for extractive text summarization. Expert Syst. Appl. 40(14), 5755–5764 (2013)
Ramos, J.: Using TF-IDF to determine word relevance in document queries. Technical report, Department of Computer Science, Rutgers University (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Paul, A. et al. (2017). Bangla News Summarization. In: Nguyen, N., Papadopoulos, G., Jędrzejowicz, P., Trawiński, B., Vossen, G. (eds) Computational Collective Intelligence. ICCCI 2017. Lecture Notes in Computer Science(), vol 10449. Springer, Cham. https://doi.org/10.1007/978-3-319-67077-5_46
Download citation
DOI: https://doi.org/10.1007/978-3-319-67077-5_46
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67076-8
Online ISBN: 978-3-319-67077-5
eBook Packages: Computer ScienceComputer Science (R0)