skip to main content
10.1145/2967878.2967919acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicccntConference Proceedingsconference-collections
research-article

Query-oriented Unsupervised Multi-document Summarization on Big Data

Authors Info & Claims
Published:06 July 2016Publication History

ABSTRACT

Real time document summarization is a critical need nowadays, owing to the large volume of information available for our reading, and our inability to deal with this entirely due to limitations of time and resources. Oftentimes, information is available in multiple sources, offering multiple contexts and viewpoints on a single topic of interest. Automated multi-document summarization (MDS) techniques aim to address this problem. However, current techniques for automated MDS suffer from low precision and accuracy with reference to a given subject matter, when compared to those summaries prepared by humans and takes large time to create the summary when the input given is too huge. In this paper, we propose a hybrid MDS technique combining feature based algorithms and dynamic programming for generating a summary from multiple documents based on user provided query. Further, in real-world scenarios, Web search serves up a large number of URLs to users, and the work of making sense of these with reference to a particular query is left to the user. In this context, an efficient parallelized MDS technique based on Hadoop is also presented, for serving a concise summary of multiple Webpage contents for a given user query in reduced time duration.

References

  1. K.S. Jones, Automatic summarizing: the state of the art, Inf. Process. Manage.43 (6) (2007) 1449--1481. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. I. Mani, M.T. Maybury, Advances in Automatic Text Summarization, MIT Press,Cambridge, 1999, pp. 442. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Tang, L. Yao, D. Chen, Multi-topic based query-oriented summarization, in:Proceedings of the 9th SIAM International Conference on Data Mining, Nevada, USA, 30 April-2 May, 2009, pp. 1148--1159.Google ScholarGoogle Scholar
  4. X. Cai, W. Li, A spectral analysis approach to document summarization: clustering and ranking sentences simultaneously, Inf. Sci. 181 (18) (2011) 3816--3827.Google ScholarGoogle ScholarCross RefCross Ref
  5. M. Kutlu, C. Cigir, I. Cicekli, Generic text summarization for Turkish, Comput. J.53 (8) (2010) 1315--1323. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. Carbonell, J. Goldstein, The use of MMR, diversity-based re-ranking for reordering documents and producing summaries, in: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, 24-28 August, 1998, pp. 335--336. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Y. Ouyang, W. Li, S. Li, Q. Lu, Intertopic information mining for query-based summarization, J. Am. Soc. Inf. Sci. Technol. 61 (5) (2010) 1062--1072. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Kutlu, C. Cigir, I. Cicekli, Generic text summarization for Turkish, Comput. J. 53 (8) (2010) 1315--1323. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Mihalcea, Rada. "Graph-based ranking algorithms for sentence extraction, applied to text summarization." Proceedings of the ACL 2004 on Interactive poster and demonstration sessions. Association for Computational Linguistics, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Kyoomarsi, Farshad, et al. "Optimizing text summarization based on fuzzy logic." Seventh IEEE/ACIS International Conference on Computer and Information Science. IEEE, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. K. Kaikhah, "Automatic Text Summarization with Neural Networks", Second IEEE International Conference on Intelligent Systems, JUNE 2004, pp. 40--44Google ScholarGoogle Scholar
  12. Amini, Massih R., Nicolas Usunier, and Patrick Gallinari. "Automatic text summarization based on word-clusters and ranking algorithms." Advances in Information Retrieval. Springer Berlin Heidelberg, 2005. 142--156. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Luo, Yihui, and Shuchu Xiong. "An improvement on approximate dynamic programming for multi-document summarization." Security, Pattern Analysis, and Cybernetics (SPAC), 2014 International Conference on. IEEE, 2014.Google ScholarGoogle Scholar
  14. Zhong, Sheng-hua, et al. "Query-oriented unsupervised multi-document summarization via deep learning model." Expert Systems with Applications 42.21 (2015): 8146--8155. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Lee, Hyeokju, Joon Her, and Sung-Ryul Kim. "Implementation of a large-scalable social data analysis system based on Map Reduce." Computers, Networks, Systems and Industrial Engineering (CNSI), 2011 ACIS/JNU International Conference on. IEEE, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Query-oriented Unsupervised Multi-document Summarization on Big Data

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICCCNT '16: Proceedings of the 7th International Conference on Computing Communication and Networking Technologies
        July 2016
        262 pages
        ISBN:9781450341790
        DOI:10.1145/2967878

        Copyright © 2016 ACM

        © 2016 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 6 July 2016

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

        Acceptance Rates

        ICCCNT '16 Paper Acceptance Rate48of101submissions,48%Overall Acceptance Rate48of101submissions,48%
      • Article Metrics

        • Downloads (Last 12 months)1
        • Downloads (Last 6 weeks)0

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader