Semantic Diversification of Text Search Results

Micu, Andrei; Iftene, Adrian

doi:10.1007/978-3-319-45246-3_8

Semantic Diversification of Text Search Results

Andrei Micu¹⁷ &
Adrian Iftene¹⁷

Conference paper
First Online: 20 September 2016

2021 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9876))

Abstract

Search engines are getting faster and more feature-rich year by year, striving to bring their users the information they need as fast as possible. Bringing relevant information to the user in an effortless manner is no easy task. The search feature set is where search engines compete to win their users and it usually describes in what manner a search engine may be different from others. One of the most challenging features in a search engine is to diversify the search results in a way that each result has different meaning or different content from others. The goal is to free the user from the burden of separating redundant results. What is redundant for the user is the key challenge of this feature and may have different meanings depending on the format of the information. For text searches, user’s input is commonly used for diversification of results [1]. This input may include information like the topic of search, previous searches or supplementary parameters asked by the engine. This paper describes a different approach on text diversification, based on text semantics analysis and combined with clustering algorithms. The aim is to explore how similarities from the semantic point of view can be used to eliminate redundant texts.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
ROME Tools - http://rometools.github.io/rome/.
2.
Open Calais - How Does Calais Work? - http://www.opencalais.com/about.
3.
Apache Jena - https://jena.apache.org/.

References

Welch, M.J., Cho, J., Olston, C.: Search result diversity for informational queries. In: Twentieth International World Wide Web Conference, Hyderabad, India, March (2011)
Google Scholar
Marina, D., Evaggelia, P.: Search result diversification. SIGMOD Rec. 39(1), 41–47 (2010). ACM, New York, NY, USA
Article Google Scholar
Vallet, D., Castell, P.: Personalized diversification of search result. In: SIGIR 2012, August 12–16, Portland, Oregon, USA (2012)
Google Scholar
Van, D., Bruce, C.W.: Term Level Search Result Diversification, SIGIR’13, July 28–August 1. ACM, Dublin, Ireland (2013)
Google Scholar
Iftene, A., Alboaie, L.: Diversification in an image retrieval system based on text and image processing. Comput. Sci. J. Moldova 22(66), 339–348 (2014)
MathSciNet MATH Google Scholar
Iftene, A., Alboaie, L.: Diversification in an image retrieval system, IMCS-50. In: The Third Conference of Mathematical Society of the Republic of Moldova Dedicated to the 50th Anniversary of the Foundation of the Institute of Mathematics and Computer Science, pp. 521–524, August 19–23, Chisinau, Republic of Moldova (2014)
Google Scholar
Manning, C., Schutze, H.: Foundations of Statistical NLP ch. 14. MIT Press, Cambridge (2002)
Google Scholar
Mitchell, T.: Machine Learning ch. 6.12. McGRAW Hill, Boston (1997)
Google Scholar

Download references

Acknowledgments

This work is supported by the PRIVATESKY project (Experimental development in public-private partnership for creating native Cloud platform with advanced features for data protection), from POC 2014-2020, Action 1.2.3, Partnerships for knowledge transfer.

Author information

Authors and Affiliations

Faculty of Computer Science, Alexandru Ioan Cuza University, Berthelot 16, 700483, Iasi, Romania
Andrei Micu & Adrian Iftene

Authors

Andrei Micu
View author publications
You can also search for this author in PubMed Google Scholar
Adrian Iftene
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Adrian Iftene .

Editor information

Editors and Affiliations

Wroclaw University of Technology , Wroclaw, Poland
Ngoc Thanh Nguyen
Aristotle University of Thessaloniki , Thessaloniki, Greece
Lazaros Iliadis
Department of Forestry and Manageme, Democritus University of Thrace Department of Forestry and Manageme, Orestiada Thrace, Greece
Yannis Manolopoulos
Wrocław University of Technology , Wrocław, Poland
Bogdan Trawiński

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Micu, A., Iftene, A. (2016). Semantic Diversification of Text Search Results. In: Nguyen, N., Iliadis, L., Manolopoulos, Y., Trawiński, B. (eds) Computational Collective Intelligence. ICCCI 2016. Lecture Notes in Computer Science(), vol 9876. Springer, Cham. https://doi.org/10.1007/978-3-319-45246-3_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-45246-3_8
Published: 20 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45245-6
Online ISBN: 978-3-319-45246-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics