Skip to main content

Semantic Diversification of Text Search Results

  • Conference paper
  • First Online:
  • 2021 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9876))

Abstract

Search engines are getting faster and more feature-rich year by year, striving to bring their users the information they need as fast as possible. Bringing relevant information to the user in an effortless manner is no easy task. The search feature set is where search engines compete to win their users and it usually describes in what manner a search engine may be different from others. One of the most challenging features in a search engine is to diversify the search results in a way that each result has different meaning or different content from others. The goal is to free the user from the burden of separating redundant results. What is redundant for the user is the key challenge of this feature and may have different meanings depending on the format of the information. For text searches, user’s input is commonly used for diversification of results [1]. This input may include information like the topic of search, previous searches or supplementary parameters asked by the engine. This paper describes a different approach on text diversification, based on text semantics analysis and combined with clustering algorithms. The aim is to explore how similarities from the semantic point of view can be used to eliminate redundant texts.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    ROME Tools - http://rometools.github.io/rome/.

  2. 2.

    Open Calais - How Does Calais Work? - http://www.opencalais.com/about.

  3. 3.

    Apache Jena - https://jena.apache.org/.

References

  1. Welch, M.J., Cho, J., Olston, C.: Search result diversity for informational queries. In: Twentieth International World Wide Web Conference, Hyderabad, India, March (2011)

    Google Scholar 

  2. Marina, D., Evaggelia, P.: Search result diversification. SIGMOD Rec. 39(1), 41–47 (2010). ACM, New York, NY, USA

    Article  Google Scholar 

  3. Vallet, D., Castell, P.: Personalized diversification of search result. In: SIGIR 2012, August 12–16, Portland, Oregon, USA (2012)

    Google Scholar 

  4. Van, D., Bruce, C.W.: Term Level Search Result Diversification, SIGIR’13, July 28–August 1. ACM, Dublin, Ireland (2013)

    Google Scholar 

  5. Iftene, A., Alboaie, L.: Diversification in an image retrieval system based on text and image processing. Comput. Sci. J. Moldova 22(66), 339–348 (2014)

    MathSciNet  MATH  Google Scholar 

  6. Iftene, A., Alboaie, L.: Diversification in an image retrieval system, IMCS-50. In: The Third Conference of Mathematical Society of the Republic of Moldova Dedicated to the 50th Anniversary of the Foundation of the Institute of Mathematics and Computer Science, pp. 521–524, August 19–23, Chisinau, Republic of Moldova (2014)

    Google Scholar 

  7. Manning, C., Schutze, H.: Foundations of Statistical NLP ch. 14. MIT Press, Cambridge (2002)

    Google Scholar 

  8. Mitchell, T.: Machine Learning ch. 6.12. McGRAW Hill, Boston (1997)

    Google Scholar 

Download references

Acknowledgments

This work is supported by the PRIVATESKY project (Experimental development in public-private partnership for creating native Cloud platform with advanced features for data protection), from POC 2014-2020, Action 1.2.3, Partnerships for knowledge transfer.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Adrian Iftene .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Micu, A., Iftene, A. (2016). Semantic Diversification of Text Search Results. In: Nguyen, N., Iliadis, L., Manolopoulos, Y., Trawiński, B. (eds) Computational Collective Intelligence. ICCCI 2016. Lecture Notes in Computer Science(), vol 9876. Springer, Cham. https://doi.org/10.1007/978-3-319-45246-3_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-45246-3_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-45245-6

  • Online ISBN: 978-3-319-45246-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics