research-article

CloseUp—A Community-Driven Live Online Search Engine

Authors:

Christian Von Der Weth,

Abhinav R. Kashyap,

Mohan S. KankanhalliAuthors Info & Claims

ACM Transactions on Internet Technology (TOIT), Volume 19, Issue 3

Article No.: 39, Pages 1 - 21

https://doi.org/10.1145/3301442

Published: 27 August 2019 Publication History

Abstract

Search engines are still the most common way of finding information on the Web. However, they are largely unable to provide satisfactory answers to time- and location-specific queries. Such queries can best and often only be answered by humans that are currently on-site. Although online platforms for community question answering are very popular, very few exceptions consider the notion of users’ current physical locations. In this article, we present CloseUp, our prototype for the seamless integration of community-driven live search into a Google-like search experience. Our efforts focus on overcoming the defining differences between traditional Web search and community question answering, namely the formulation of search requests (keyword-based queries vs. well-formed questions) and the expected response times (milliseconds vs. minutes/hours). To this end, the system features a deep learning pipeline to analyze submitted queries and translate relevant queries into questions. Searching users can submit suggested questions to a community of mobile users. CloseUp provides a stand-alone mobile application for submitting, browsing, and replying to questions. Replies from mobile users are presented as live results in the search interface. Using a field study, we evaluated the feasibility and practicability of our approach.

References

[1]

Ashton Anderson, Daniel Huttenlocher, Jon Kleinberg, and Jure Leskovec. 2012. Discovering value from community activity on focused question answering sites: A case study of stack overflow. In Proceedings of KDD’12. ACM, New York, NY.

Digital Library

[2]

Hazleen Aris and Marina Md. Din. 2016. Crowdsourcing evolution: Towards a taxonomy of crowdsourcing initiatives. In Proceedings of the PerCom Workshops. IEEE, Los Alamitos, CA.

[3]

Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv:1409.0473.

[4]

Cory Barr, Rosie Jones, and Moira Regelson. 2008. The linguistic structure of English Web-search queries. In Proceedings of EMNLP’08.

Digital Library

[5]

Petter Bae Brandtzæg and Jan Heim. 2008. User loyalty and online communities: Why members of online communities are not faithful. In Proceedings of INTETAIN’08.

[6]

Chris Callison-Burch, Miles Osborne, and Philipp Koehn. 2006. Re-evaluation the role of BLEU in machine translation research. In Proceedings of EACL’06.

[7]

Claudio Carpineto and Giovanni Romano. 2012. A survey of automatic query expansion in information retrieval. ACM Computing Surveys 44, 1 (2012), Article 1.

Digital Library

[8]

William Chan, Navdeep Jaitly, Quoc Le, and Oriol Vinyals. 2016. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition. In Proceedings of ICASSP’16.

[9]

Shuo Chang and Aditya Pal. 2013. Routing questions for collaborative answering in community question answering. In Proceedings of ASONAM’13. ACM, New York, NY.

Digital Library

[10]

Danqi Chen and Christopher D. Manning. 2014. A fast and accurate dependency parser using neural networks. In Proceedings of EMNLP’14.

[11]

Xiang Cheng, Shuguang Zhu, Sen Su, and Gang Chen. 2017. A multi-objective optimization approach for question routing in community question answering services. IEEE Transactions on Knowledge and Data Engineering 29, 9 (2017), 1779--1792.

[12]

Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder--decoder for statistical machine translation. In Proceedings of EMNLP’14.

[13]

Sumit Chopra, Michael Auli, and Alexander M. Rush. 2016. Abstractive sentence summarization with attentive recurrent neural networks. In Proceedings of NAACL-HTL’16.

[14]

Brooke Cowan, Sven Zethelius, Brittany Luk, Teodora Baras, Prachi Ukarde, and Daodao Zhang. 2015. Named entity recognition in travel-related search queries. In Proceedings of AAAI’15.

Digital Library

[15]

Tom De Smedt and Walter Daelemans. 2012. Pattern for Python. Journal of Machine Learning Research 13 (2012), 2063--2067.

Digital Library

[16]

Sebastian Deterding, Miguel Sicart, Lennart Nacke, Kenton O’Hara, and Dan Dixon. 2011. Gamification. Using game-design elements in non-gaming contexts. In Proceedings of CHI’11. ACM, New York, NY.

Digital Library

[17]

Anhai Doan, Raghu Ramakrishnan, and Alon Y. Halevy. 2011. Crowdsourcing systems on the World-Wide Web. Communications of the ACM 54, 4 (2011), 86--96.

Digital Library

[18]

William F. Eddy. 1982. Convex Hull Peeling. Physica-Verlag HD.

[19]

Andreas Eiselt and Alejandro Figueroa. 2013. A two-step named entity recognizer for open-domain search queries. In Proceedings of IJCNLP’13.

[20]

Ahmad Ghazal, Tilmann Rabl, Minqing Hu, Francois Raab, Meikel Poess, Alain Crolotte, and Hans-Arno Jacobsen. 2013. BigBench: Towards an industry standard benchmark for big data analytics. In Proceedings of SIGMOD’13. ACM, New York, NY.

Digital Library

[21]

Uri Gneezy and Aldo Rustichini. 2000. Pay enough or don’t pay at all. Quarterly Journal of Economics 115, 3 (2000), 791--810.

[22]

Çaglar Gülçehre, Sungjin Ahn, Ramesh Nallapati, Bowen Zhou, and Yoshua Bengio. 2016. Pointing the unknown words. In Proceedings of the 54th Annual Meeting of the ACL.

[23]

Jiafeng Guo, Gu Xu, Xueqi Cheng, and Hang Li. 2009. Named entity recognition in query. In Proceedings of SIGIR’09. ACM, New York, NY.

Digital Library

[24]

Jiahui Guo, Bin Yue, Guandong Xu, Zhenglu Yang, and Jin-Mao Wei. 2017. An enhanced convolutional neural network model for answer selection. In Proceedings of WWW’17 Companion.

Digital Library

[25]

Ferry Hendrikx, Kris Bubendorfer, and Ryan Chard. 2015. Reputation systems. Journal of Parallel and Distributed Computing 75, C (2015), 184--197.

Digital Library

[26]

T. Hoßfeld, M. Hirth, P. Korshunov, P. Hanhart, B. Gardlo, C. Keimel, and C Timmerer. 2014. Survey of Web-based crowdsourcing frameworks for subjective quality assessment. In Proceedings of MMSP’14. IEEE, Los Alamitos, CA.

[27]

Max Jaderberg, Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. 2014. Synthetic data and artificial neural networks for natural scene text recognition. arXiv:1406.2227.

[28]

Jiahua Jin, Yijun Li, Xiaojia Zhong, and Li Zhai. 2015. Why users contribute knowledge to online communities: An empirical study of an online social Q8A community. Information and Management 52, 7 (2015), 840--849.

Digital Library

[29]

Xiao-Ling Jin, Zhongyun Zhou, Matthew K. O. Lee, and Christy M. K. Cheung. 2013. Why users keep answering questions in online question answering communities: A theoretical and empirical investigation. International Journal of Information Management 33, 1 (2013), 93--104.

[30]

Armand Joulin, Edouard Grave, Piotr Bojanowski, and Tomas Mikolov. 2016. Bag of tricks for efficient text classification. arXiv:1607.01759.

[31]

Thivya Kandappu, Nikita Jaiman, Randy Tandriansyah, Archan Misra, Shih-Fen Cheng, Cen Chen, Hoong Chuin Lau, Deepthi Chander, and Koustuv Dasgupta. 2016. TASKer: Behavioral insights via campus-based experimental mobile crowd-sourcing. In Proceedings of UbiComp’16. ACM, New York, NY.

Digital Library

[32]

Aikaterini Katmada, Anna Satsiou, and Ioannis Kompatsiaris. 2016. Incentive Mechanisms for Crowdsourcing Platforms. Springer.

[33]

Joachim Kimmerle, Ulrike Cress, and Friedrich W. Hesse. 2007. An interactional perspective on group awareness: Alleviating the information-exchange dilemma. International Journal of Human-Computer Studies 65, 11 (2007), 899--910.

Digital Library

[34]

Cliff Lampe, Rick Wash, Alcides Velasquez, and Elif Ozkaya. 2010. Motivations to participate in online communities. In Proceedings of CHI’10. ACM, New York, NY.

Digital Library

[35]

Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, and Chris Dyer. 2016. Neural architectures for named entity recognition. In Proceedings of AAACL’16.

[36]

Myriam Leggieri, Christian von der Weth, and John Breslin. 2015. Using sensors to bridge the gap between real places and their Web-based representations. In Proceedings of ISSNIP’15. IEEE, Los Alamitos, CA.

[37]

Chenliang Li, Aixin Sun, Jianshu Weng, and Qi He. 2013. Exploiting hybrid contexts for tweet segmentation. In Proceedings of SIGIR’13. ACM, New York, NY.

Digital Library

[38]

Xiaohua Liu, Shaodian Zhang, Furu Wei, and Ming Zhou. 2011. Recognizing named entities in tweets. In Proceedings of HLT’11.

Digital Library

[39]

Yefeng Liu, Todorka Alexandrova, and Tatsuo Nakajima. 2013. Using stranger as sensors: Temporal and geo-sensitive question answering via social media. In Proceedings of W WW’13. ACM, New York, NY.

Digital Library

[40]

Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective approaches to attention-based neural machine translation. In Proceedings of EMNLP’15.

[41]

Monica Marrero, Julian Urbano, Sonia Sanchez-Cuadrado, Jorge Morato, and Juan Miguel Gomez-Berbis. 2013. Named entity recognition: Fallacies, challenges and opportunities. Computer Standards and Interfaces 35, 5 (2013), 482--489.

[42]

Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of NIPS’13.

Digital Library

[43]

George A. Miller. 1995. WordNet: A lexical database for English. Communications of the ACM 38, 11 (1995), 39--41.

Digital Library

[44]

Nolan Miller, Paul Resnick, and Richard Zeckhauser. 2005. Eliciting informative feedback: The peer-prediction method. Management Science 51, 9 (2005), 1359--1373.

Digital Library

[45]

Mohamed Musthag and Deepak Ganesan. 2013. Labor dynamics in a mobile micro-task market. In Proceedings of CHI’13. ACM, New York, NY.

Digital Library

[46]

David Nadeau and Satoshi Sekine. 2007. A survey of named entity recognition and classification. Linguisticae Investigationes 30, 1 (2007), 1--20.

[47]

Ramesh Nallapati, Bowen Zhou, Cicero dos Santos, Caglar Gulcehre, and Bing Xiang. 2016. Abstractive text summarization using sequence-to-sequence RNNs and beyond. In Proceedings of CoNLL’16.

[48]

Jessie Ooi, Xiuqin Ma, Hongwu Qin, and Siau Chuin Liew. 2015. A survey of query expansion, query suggestion and query refinement techniques. In Proceedings of ICSECS’15. IEEE, Los Alamitos, CA.

[49]

Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: A method for automatic evaluation of machine translation. In Proceedings of ACL’02.

Digital Library

[50]

Greg Pass, Abdur Chowdhury, and Cayley Torgeson. 2006. A picture of search. In Proceedings of InfoScale’06. ACM, New York, NY.

Digital Library

[51]

Dražen Prelec. 2004. A Bayesian truth serum for subjective data. Science 306, 5695 (2004), 462--466.

[52]

Lev Ratinov and Dan Roth. 2009. Design challenges and misconceptions in named entity recognition. In Proceedings of CoNLL’09.

Digital Library

[53]

Soumya Ray, Sung S. Kim, and James G. Morris. 2014. The central role of engagement in online communities. Information Systems Research 25, 3 (2014), 528--546.

Digital Library

[54]

Ju Ren, Yaoxue Zhang, Kuan Zhang, and Xuemin Shen. 2015. Exploiting mobile crowdsourcing for pervasive cloud services: Challenges and solutions. IEEE Communications Magazine 53, 3 (2015), 1--9.

[55]

Fatemeh Riahi, Zainab Zolaktaf, Mahdi Shafiei, and Evangelos Milios. 2012. Finding expert users in community question answering. In Proceedings of WWW’12 Companion. ACM, New York, NY.

Digital Library

[56]

Dominic Seyler, Mohamed Yahya, Klaus Berberich, and Omar Alonso. 2016. Automated question generation for quality control in human computation tasks. In Proceedings of WebSci’16. ACM, New York, NY.

Digital Library

[57]

Nigel Shadbolt, Max Van Kleek, and Reuben Binns. 2016. The rise of social machines: The development of a human/digital ecosystem. IEEE Consumer Electronics Magazine 5, 2 (2016), 106--111.

[58]

Aaron D. Shaw, John J. Horton, and Daniel L. Chen. 2011. Designing incentives for inexpert human raters. In Proceedings of CSCW’11. ACM, New York, NY.

Digital Library

[59]

Yikang Shen, Wenge Rong, Nan Jiang, Baolin Peng, Jie Tang, and Zhang Xiong. 2017. Word embedding based correlation model for question/answer matching. In Proceedings of AAAI’17.

Digital Library

[60]

Ivan Srba and Maria Bielikova. 2016. A comprehensive survey and classification of approaches for community question answering. ACM Trans. Web 10, 3 (2016), Article 18.

Digital Library

[61]

Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In Proceedings of NIPS’14.

Digital Library

[62]

Luke Taylor and Geoff Nitschke. 2017. Improving deep learning using generic data augmentation. arXiv:1708.06020.

[63]

Jaime Teevan, Kevyn Collins-Thompson, Ryen W. White, Susan T. Dumais, and Yubin Kim. 2013. Slow search: Information retrieval without time constraints. In Proceedings of HCIR’13. ACM, New York, NY.

Digital Library

[64]

Trang Tran and Mari Ostendorf. 2016. Characterizing the language of online communities and its relation to community reception. In Proceedings of EMNLP’16.

[65]

Rajan Vaish, Keith Wyngarden, Jingshu Chen, Brandon Cheung, and Michael S. Bernstein. 2014. Twitch crowdsourcing: Crowd contributions in short bursts of time. In Proceedings of CHI’14. ACM, New York, NY.

Digital Library

[66]

Oriol Vinyals and Quoc Le. 2015. A neural conversational model. In Proceedings of ICML Deep Learning Workshop’15.

[67]

Yuhui Wang, Christian von der Weth, Thomas Winkler, and Mohan Kankanhalli. 2016. Tweeting camera: A new paradigm of event-based smart sensing device: Demo. In Proceedings of ICDSC’16. ACM, New York, NY.

Digital Library

[68]

Etienne Wenger. 2011. Communities of practice: Learning, meaning, and identity. Cambridge University Press.

[69]

Christian von der Weth, Ashraf M. Abdul, and Mohan Kankanhalli. 2017. Cyber-physical social networks. ACM Transactions on Internet Technology 17, 2 (2017), Article 17.

Digital Library

[70]

Ryen W. White, Matthew Richardson, and Wen-Tau Yih. 2015. Questions vs. queries in informational search tasks. In Proceedings of WWW’15 Companion. ACM, New York, NY.

Digital Library

[71]

Ronald J. Williams and David Zipser. 1989. A learning algorithm for continually running fully recurrent neural networks. Neural Computing 1, 2 (1989), 270--280.

Digital Library

[72]

Ian H. Witten, Eibe Frank, and Mark A. Hall. 2011. Data Mining: Practical Machine Learning Tools and Techniques. (3rd ed.). Morgan Kaufmann.

Digital Library

[73]

Haocheng Wu, Wei Wu, Ming Zhou, Enhong Chen, Lei Duan, and Heung-Yeung Shum. 2014. Improving search relevance for short queries in community question answering. In Proceedings of WSDM’14. ACM, New York, NY.

Digital Library

[74]

Tingxin Yan, Matt Marzilli, Ryan Holmes, Deepak Ganesan, and Mark Corner. 2009. mCrowd: A platform for mobile crowdsourcing. In Proceedings of SenSys’09. ACM, New York, NY.

Digital Library

[75]

Xuchen Yao, Benjamin Van Durme, Chris Callison-Burch, and Peter Clark. 2013. Answer extraction as sequence tagging with tree edit distance. In Proceedings of NAACL’13.

[76]

Man-Ching Yuen, Irwin King, and Kwong-Sak Leung. 2011. A survey of crowdsourcing systems. In Proceedings of PASSAT’11. IEEE, Los Alamitos, CA.

[77]

Yuxiang Zhao and Qinghua Zhu. 2014. Evaluation on crowdsourcing research: Current status and future direction. Information Systems Frontiers 16, 3 (2014), 417--434.

Digital Library

Cited By

Yue ZDing SZhao LZhang YCao ZTanveer MJolfaei AZheng X(2021)Privacy-preserving Time-series Medical Images Analysis Using a Hybrid Deep Learning FrameworkACM Transactions on Internet Technology10.1145/338377921:3(1-21)Online publication date: 16-Jun-2021
https://dl.acm.org/doi/10.1145/3383779

Index Terms

CloseUp—A Community-Driven Live Online Search Engine
1. Human-centered computing
  1. Collaborative and social computing
2. Information systems
  1. World Wide Web
    1. Web applications
      1. Crowdsourcing
    2. Web searching and information discovery

Recommendations

A community question-answering refinement system
HT '11: Proceedings of the 22nd ACM conference on Hypertext and hypermedia

Community Question Answering (CQA) websites, which archive millions of questions and answers created by CQA users to provide a rich resource of information that is missing at web search engines and QA websites, have become increasingly popular. Web users ...
Predicting web searcher satisfaction with existing community-based answers
SIGIR '11: Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval

Community-based Question Answering (CQA) sites, such as Yahoo! Answers, Baidu Knows, Naver, and Quora, have been rapidly growing in popularity. The resulting archives of posted answers to questions, in Yahoo! Answers alone, already exceed in size 1 ...
Improving search relevance for short queries in community question answering
WSDM '14: Proceedings of the 7th ACM international conference on Web search and data mining

Relevant question retrieval and ranking is a typical task in community question answering (CQA). Existing methods mainly focus on long and syntactically structured queries. However, when an input query is short, the task becomes challenging, due to a ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Internet Technology

ACM Transactions on Internet Technology Volume 19, Issue 3

Special Section on Advances in Internet-Based Collaborative Technologies

August 2019

289 pages

ISSN:1533-5399

EISSN:1557-6051

DOI:10.1145/3329912

Editor:
Ling Liu
Georgia Institute of Technology, USA

Issue’s Table of Contents

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 August 2019

Accepted: 01 November 2018

Revised: 01 October 2018

Received: 01 January 2018

Published in TOIT Volume 19, Issue 3

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Funding Sources

National Research Foundation, Prime Minister's Office, Singapore, under its Strategic Capability Research Centres Funding Initiative

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
171
Total Downloads

Downloads (Last 12 months)6
Downloads (Last 6 weeks)2

Reflects downloads up to 20 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Yue ZDing SZhao LZhang YCao ZTanveer MJolfaei AZheng X(2021)Privacy-preserving Time-series Medical Images Analysis Using a Hybrid Deep Learning FrameworkACM Transactions on Internet Technology10.1145/338377921:3(1-21)Online publication date: 16-Jun-2021
https://dl.acm.org/doi/10.1145/3383779

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Issue’s Table of Contents