skip to main content
10.1145/3342558.3345390acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
research-article

Searching and ranking questionnaires: an approach to calculate similarity between questionnaires

Published: 23 September 2019 Publication History

Abstract

Questionnaires are useful tools for research purposes and are generally used for collecting information about a population of interest, by focusing on different intentions. During the questionnaire project, or for sharing data purposes, it may be useful to check if there is already a questionnaire with the same intention as that being carried out. Well-designed questions can induce respondents to provide better answers. However, examining research questionnaires is not a trivial task since a question can be structured in different ways. In this paper, we propose a similarity measure to match questionnaires that are characterized by the heterogeneity of their questions and to provide a ranking method based on variations of a given query. In determining the effectiveness of this approach, we evaluated it through an experimental study, using recall, precision, f-value, MAP and NDGC, and this enabled us to obtain more effective results than other proposals.

References

[1]
2018. Apache Lucene. https://lucene.apache.org/. (Accessed: 02-2018).
[2]
Stephen Avard, Edgar Manton, Donald English, and Janet Walker. 2005. The Financial Knowledge of College Freshmen. College Student Journal 39, 2 (2005).
[3]
Ricardo Baeza-Yates and Berthier Ribeiro-Neto. 2011. Modern Information Retrieval-the concepts and technology behind search.
[4]
Markus Borg, Per Runeson, Jens Johansson, and Mika V Mäntylä. 2014. A replicated study on duplicate detection: Using Apache Lucene to search among Android defects. In Proceedings of the 8th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement. ACM, 8.
[5]
Chris Buckley and Ellen M Voorhees. 2000. Evaluating evaluation measure stability. In Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 33--40.
[6]
Hung Chim and Xiaotie Deng. 2007. A new suffix tree similarity measure for document clustering. In Proceedings of the 16th international conference on World Wide Web. ACM, 121--130.
[7]
Laurence Hirsch, M Saeedi, and R Hirsch. 2007. Evolving Lucene search queries for text classification. (2007).
[8]
John A Hoxmeier and Chris DiCesare. 2000. System response time and user satisfaction: An experimental study of browser-based applications. AMCIS 2000 Proceedings (2000), 347.
[9]
Md Islam et al. 2011. Rural Women's Empowerment through Self-income Generating Activities: A Study on NGO Development Programs in Bangladesh. Journal of Global Citizenship & Equity Education 1, 1 (2011), pp-96.
[10]
Eva Maria Lakatos and Marina de Andrade Marconi. 2010. Fundamentos da metodologia científica. In Fundamentos da metodologia científica. Altas.
[11]
Christopher D Manning and Hinrich Schütze. 1999. Foundations of statistical natural language processing. MIT press.
[12]
George A Miller. 1995. WordNet: a lexical database for English. Commun. ACM 38, 11 (1995), 39--41.
[13]
Mateusz Pawlik and Nikolaus Augsten. 2011. RTED: a robust algorithm for the tree edit distance. Proceedings of the VLDB Endowment 5, 4 (2011), 334--345.
[14]
Mateusz Pawlik and Nikolaus Augsten. 2012. A Robust Algorithm for the Tree Edit Distance. (2012).
[15]
Claude François Picard. 1980. Graphs and questionnaires. Vol. 32. Elsevier.
[16]
Paul B Sheatsley, Peter H Rossi, James D Wright, and Andy B Anderson. 1983. Questionnaire construction and item writing. Handbook of survey research (1983), 195--230.
[17]
Mikhail Sheremet, Dmitry Tishkovsky, Frank Wolter, and Michael Zakharyaschev. 2005. Comparative similarity, tree automata, and diophantine equations. In International Conference on Logic for Programming Artificial Intelligence and Reasoning. Springer, 651--665.
[18]
Richard Henrique de Souza and Carina Friedrich Dorneles. 2017. QSMatching: an Approach to Calculate Similarity Between Questionnaires. The 19th International Conference on Information Integration and Web-based Applications & Services. Short paper (Dec. 2017), 5.
[19]
Rui Yang, Panos Kalnis, and Anthony KH Tung. 2005. Similarity evaluation on tree-structured data. In Proceedings of the 2005 ACM SIGMOD international conference on Management of data. ACM, 754--765.
[20]
Yuejie Zhang, Tao Zhang, and Shijie Chen. 2005. Research on Lucene-based English-Chinese Cross-Language Information Retrieval. Journal of Chinese Language and Computing 15, 1 (2005), 25--32.
[21]
Zhenjie Zhang, Marios Hadjieleftheriou, Beng Chin Ooi, and Divesh Srivastava. 2010. Bed-tree: an all-purpose index structure for string similarity search based on edit distance. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of data. ACM, 915--926.

Cited By

View all
  • (2024)Developing Assessment for Key Stakeholders in Pediatric Congenital Heart Disease: A Qualitative Pilot Study to Inform Designing a Medical Education Toy (Preprint)JMIR Formative Research10.2196/63818Online publication date: 30-Jun-2024

Index Terms

  1. Searching and ranking questionnaires: an approach to calculate similarity between questionnaires

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    DocEng '19: Proceedings of the ACM Symposium on Document Engineering 2019
    September 2019
    254 pages
    ISBN:9781450368872
    DOI:10.1145/3342558
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 23 September 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Questionnaire
    2. ranking
    3. searching
    4. similarity

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    DocEng '19
    Sponsor:
    DocEng '19: ACM Symposium on Document Engineering 2019
    September 23 - 26, 2019
    Berlin, Germany

    Acceptance Rates

    DocEng '19 Paper Acceptance Rate 30 of 77 submissions, 39%;
    Overall Acceptance Rate 194 of 564 submissions, 34%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)15
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Developing Assessment for Key Stakeholders in Pediatric Congenital Heart Disease: A Qualitative Pilot Study to Inform Designing a Medical Education Toy (Preprint)JMIR Formative Research10.2196/63818Online publication date: 30-Jun-2024

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media