Abstract
We design an information retrieval algorithm that mimics the stochastic behavior of decision-makers (DMs) when evaluating the alternatives displayed by an online search engine. The algorithm consists of a decision tree that incorporates all the 1024 decision nodes that may arise from the information retrieval process of DMs. We calibrate the behavior of the algorithm to the one observed from online users and run several sets of 1,000,000 queries. Each query lets DMs decide which subset of the ten alternatives composing the initial page of results to click, allowing us to evaluate their behavior as ranking reliability is assumed to decrease when DMs decide not to click on an alternative. We compare the click-through rates (CTRs) obtained when modifying the degree of ranking reliability derived from the alternatives displayed on the first page of search results. We illustrate how the stability of the CTR prevails among the top-ranked alternatives within relatively reliable scenarios while it drops when imposing large initial decrements in reliability. The resulting consequences regarding the importance of relative ranking positions are analyzed, the top three alternatives exhibiting a generally contained decrease in their CTRs that contrasts with the cumulative pattern arising from the fourth position onwards.
Similar content being viewed by others
References
Epstein R, Robertson RE (2015) The Search Engine Manipulation Effect (SEME) and its possible impact on the outcomes of elections. Proc Natl Acad Sci USA 112:E4512–E4521
Jansen MBJ, Spink A, Saracevic T (2000) Real life, real users, and real needs: a study and analysis of user queries on the Web. Inf Process Manage 36:207–227
Luo W, Cook D, Karson EJ (2011) Search advertising placement strategy: exploring the efficacy of the conventional wisdom. Inf Manag 48:404–411
Lewandowski D, Kammerer Y (2020) Factors influencing viewing behaviour on search engine results pages: a review of eye-tracking research. Behav Inf Technol. https://doi.org/10.1080/0144929X.2020.1761450
Lorigo L, Haridasan M, Brynjarsdóttir H, Xia L, Joachims T, Gay G, Granka L, Pellacini F, Pan B (2008) Eye tracking and online search: lessons learned and challenges ahead. J Am Soc Inf Sci Technol 59:1041–1052
Chitika (2013) The value of Google result positioning. Chitika Insights, June 7, 2013. Chitika, Westborough. perma.cc/7AGC-HTDH
Dean B (2019) We analyzed 5 million Google search results. Here’s what we learned about organic click through rate. https://backlinko.com/google-ctr-stats. Accessed 23 Sep 2021
Baeza-Yates R (2005) Applications of web query mining. In: Losada DE, Fernández-Luna JM (eds) Advances in information retrieval. information systems and applications, incl. Internet/Web, and HCI, vol 3408. Springer, Berlin, pp 7–22
Jansen MBJ, Spink A, Bateman J, Saracevic T (1998) Real life information retrieval: a study of user queries on the web. ACM SIGIR Forum 32:5–17
European Commission (2016) Online platforms. Special Eurobarometer No. 447, April 2016. European Union
Hendahewa C, Shah C (2017) Evaluating user search trails in exploratory search tasks. Inf Process Manage 53:905–922
Yu H, Jatowt A, Blanco R, Joho H, Jose JM (2017) Decoding multi-click search behavior based on marginal utility. Inf Retrieval J 20:25–52
Simon HA (1997) Administrative behavior. The Free Press, New York
Schneider M, Deck C, Shor M, Besedeš T, Sarangi S (2019) Optimizing choice architectures. Decis Anal 16:2–30
Speier-Pero C (2019) Using aggregated data under time pressure: a mechanism for coping with information overload. J Decis Syst 28:82–100
Victorelli EZ, Dos Reis JC, Hornung H, Prado AB (2020) Understanding human-data interaction: literature review and recommendations for design. Int J Hum Comput Stud 134:13–32
Preibusch S, Kübler D, Beresford AR (2013) Price versus privacy: an experiment into the competitive advantage of collecting less personal information. Electron Commer Res 13:423–455
Sun H, Fan M, Tan Y (2020) An empirical analysis of seller advertising strategies in an online marketplace. Inf Syst Res 31:37–56
Yoo B, Jeon S, Han T (2016) An analysis of popularity information effects: field experiments in an online marketplace. Electron Commer Res Appl 17:87–98
Bell D, Mgbemena C (2018) Data-driven agent-based exploration of customer behavior. SIMULATION 94:195–212
Dunke F, Nickel S (2020) Neural networks for the metamodeling of simulation models with online decision making. Simul Model Pract Theory 99:102016
Mahony C, Sammon D, Heavin C (2016) Design guidelines for online resources: a longitudinal analysis of information processing. J Decis Syst 25:329–342
Zhang J, Adomavicius G, Gupta A, Ketter W (2020) Consumption and performance: understanding longitudinal dynamics of recommender systems via an agent-based simulation framework. Inf Syst Res 31:76–101
Unkel J, Haas A (2017) The effects of credibility cues on the selection of search engine results. J Am Soc Inf Sci 68:1850–1862
Wang J, Yang Z, Brocato ED (2018) An investigation into the antecedents of prepurchase online search. Inf Manag 55:285–293
Power DJ, Cyphert D, Roth RM (2019) Analytics, bias, and evidence: the quest for rational decision making. J Decis Syst 28:120–137
Yi C, Jiang Z, Li X, Lu X (2019) Leveraging user-generated content for product promotion: the effects of firm-highlighted reviews. Inf Syst Res 30:711–725
Li H, Duan H, Zheng Y, Wang Q, Wang Y (2020) A CTR prediction model based on user interest via attention mechanism. Appl Intell 50:1192–1203
Wang Q, Liu F, Xing S, Zhao X (2019) Research on CTR prediction based on stacked autoencoder. Appl Intell 49:2970–2981
Qin J, Zhang W, Wu X, Jin J, Fang Y, Yu Y (2020) User behavior retrieval for Click-Through Rate prediction. In: Proceedings of the 43rd international ACM SIGIR Conference on research and development in information retrieval (SIGIR’ 20). Association for Computing Machinery, New York, pp 2347–2356
Nelson P (1970) Information and consumer behavior. J Polit Econ 78:311–329
Tavana M, Di Caprio D, Santos Arteaga FJ, O’Connor A (2015) A novel entropy-based decision support framework for uncertainty resolution in the initial subjective evaluations of experts: the NATO enlargement problem. Decis Support Syst 74:135–149
Herrmann JW (2015) Engineering decision making and risk management. Wiley, Hoboken
Pei S, Hu Q (2018) Partially monotonic decision trees. Inf Sci 424:104–117
Sagi O, Rokach L (2020) Explainable decision forest: transforming a decision forest into an interpretable tree. Inf Fusion 61:124–138
Herce-Zelaya J, Porcel C, Bernabé-Moreno J, Tejeda-Lorente A, Herrera-Viedma E (2020) New technique to alleviate the cold start problem in recommender systems using information from social media and random decision forests. Inf Sci 536:156–170
Rokach L (2016) Decision forest: twenty years of research. Inf Fusion 27:111–125
Lim S (2013) College students’ credibility judgments and heuristics concerning Wikipedia. Inf Process Manage 49:405–419
Tavana M, Santos-Arteaga FJ, Di Caprio D, Tierney K (2016) Modeling signal-based decisions in online search environments: a non-recursive forward-looking approach. Information & Management 53:207–226
Schulz F (2008) Trees with exponentially growing costs. Inf Comput 206:569–578
Gao R, Shah C (2020) Toward creating a fairer ranking in search engine results. Inf Process Manag 57:102138
Pan B, Hembrooke H, Joachims T, Lorigo L, Gay G, Granka L (2007) In Google we trust: users’ decisions on rank, position, and relevance. J Comput Mediated Commun 12:801–823
Brihaye T, Geeraerts G, Hallet M, Le Roux S (2020) On the termination of dynamics in sequential games. Inf Comput 272:104505
van Benthem J (2018) Computation as social agency: what, how and who. Inf Comput 261:519–535
Acknowledgements
Dr. Madjid Tavana is grateful for the partial support he received from the Czech Science Foundation (GAČR19-13946S) for this research.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declared that they have no conflict of interest.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Di Caprio, D., Santos-Arteaga, F.J. & Tavana, M. A new algorithm for modeling online search behavior and studying ranking reliability variations. Appl Intell 52, 7529–7549 (2022). https://doi.org/10.1007/s10489-021-02856-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-021-02856-8