skip to main content
10.1145/3345252.3345275acmotherconferencesArticle/Chapter ViewAbstractPublication PagescompsystechConference Proceedingsconference-collections
research-article

A Novel Document Summarization System for Albanian Language

Authors Info & Claims
Published:21 June 2019Publication History

ABSTRACT

Summarization is a Natural Language Processing application that may seem trivial to a person, but in a time where the quantity of information provided is continuously growing, the possibility of implementing a "helper" in order to summarize it, has become a necessity. Most of the existing scientific studies in automatic text summarization has been paying attention primarily to English with only some recent attempts in other major languages. To the best of our knowledge, no prior approaches handle automatic summarization for Albanian documents. This paper is proposed to fill this gap by implementing a novel extractive summarization system, designed specifically for Albanian Language. We showed experimentally that the enrichment of the summarization system with language-dependent elements improves the systems' performance and the compression rate.

References

  1. D. Reinsel, J. Gantz, and J. Rydning: The Digitization of the World: From Edge to Core, Data Age 2020, An IDC whitepaper, November 2018.Google ScholarGoogle Scholar
  2. M. Allahvari, S. Pouriyeh, M. Assefi, S. Safaei, E.D. Trippe, J.B. Gutierrez, K. Kochut: Text Summarization Techniques: A Brief Survey, 2017.Google ScholarGoogle Scholar
  3. Y. J. Kumar et al. / Journal of Computer Science 2016, 12 (4): 178.190Google ScholarGoogle Scholar
  4. M. Last and M. Litvak (2019). Language-independent Techniques for Automated Text Summarization.Google ScholarGoogle Scholar
  5. H.P. Luhn (1958). The automatic creation of literature abstracts, IBM Journal of research and development. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. H.P. Edmundson (1969). New methods in automatic extracting, Journal of the ACM (JACM). Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. G. DeJong (1982). An overview of the FRUMP system. In: Lehnert, W., Ringle, M. (eds.) Strategies for Natural Language Processing, pp. 149--176. Lawrence Erlbaum Associates, Hillsdale.Google ScholarGoogle Scholar
  8. Ouyang Y, Li W, Li S, Lu Q (2011). Applying regression models to query-focused multi-document summarization. Inf Process Manag 47:227--237 Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M.A. Fattah (2014). A hybrid machine learning model for multi-document summarization. 592--600. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. R.Mihalcea (2005). Language Independent Extractive Summarization, American Association for Artificial Intelligence. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Mihalcea and P. Tarau (2004). TextRank: Bringing Order into Texts In Proceedings of the Conference on Empirical Methods in Natural Language Processing.Google ScholarGoogle Scholar
  12. U. Hahn and I. Mani (2000). The Challenges of Automatic Summarization Computer, 33 (11) (2000), pp. 29-36 Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. René Arnulfo García-Hernández and Y. Ledeneva (2009). Word Sequence Models for Single Text Summarization, Second International Conferences on Advances in Computer-Human Interactions. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. L.M. Al Qassem, D. Wang, Z. Al Mahmoud, H. Barada, A. Al-Rubaiea, N.I. Almoosa (2017). Automatic Arabic Summarization: A survey of methodologies and systems, 3rd International Conference on Arabic Computational Linguistics.Google ScholarGoogle ScholarCross RefCross Ref
  15. R. Bois, J. Leveling, L. Goeuriot, G.J.F. Jones, L. Kelly (2014). Porting a Summarizer to the French Language, 21ème Traitement Automatique des Langues Naturelles.Google ScholarGoogle Scholar
  16. A. G. Malamos, G. Mamakis and J. A. Ware (2019). Applying Statistic-based Algorithms for Automated Content Summarization in Greek language.Google ScholarGoogle Scholar
  17. L.H.M. Rino at al. (2004). A Comparison of Automatic Summarizers of Texts in Brazilian Portuguese. A.L.C. Bazzan and S. Labidi (Eds.): SBIA 2004, LNAI 3171, pp. 235--244, 2004.Google ScholarGoogle Scholar
  18. E.P. Hamp (2016). Albanian Language, Encyclopedia Britannica.Google ScholarGoogle Scholar
  19. E. Trandafili, E. K. Mece, K. Kica, H. Paci (2017). A Novel Question Answering System for Albanian Language.Google ScholarGoogle Scholar
  20. J. Sadiku and M. Biba (2012) Automatic stemming of Albanian through a rule-based approach. J. Int. Res. Publ. Lang. Individuals Soc. 6. ISSN-1313-2547Google ScholarGoogle Scholar
  21. D. Jurafsky and J.H. Martin (2018). Speech and Language Processing An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, Third Edition. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. J. Ramos (2003). Using TF-IDF to Determine Word Relevance in Document Queries.Google ScholarGoogle Scholar
  23. R. Mihalcea and H. Ceylan (2007). Explorations in Automatic Book Summarization, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 380--389.Google ScholarGoogle Scholar
  24. J. Steinberger and K. Ježek (2009). Evaluation measures for text summarization. Computing and Informatics. 28. 251--275.Google ScholarGoogle Scholar

Index Terms

  1. A Novel Document Summarization System for Albanian Language

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      CompSysTech '19: Proceedings of the 20th International Conference on Computer Systems and Technologies
      June 2019
      365 pages
      ISBN:9781450371490
      DOI:10.1145/3345252

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 21 June 2019

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      Overall Acceptance Rate241of492submissions,49%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader