Skip to main content
Log in

Ranking answers of comparative questions using heterogeneous information organization from social media

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

One of the most typical subjective questions in CQA Web sites is comparative question. Comparative question answering is quite different from the factual question answering. First, it aims to generate opinions, conclusions, reasons, etc, rather than only facts. Second, the answers are usually summarized from different domains (or aspects). Despite the success of the previous answer ranking approaches, none of them can be adaptive well to ranking comparative question answers. They overlook the external knowledge as semantic expansions, mining the related domain terms of the questions as clues for ranking comparative question answers. In this paper, we employ the heterogeneous information organization technique as the external knowledge generator and propose a domain space model-based ranking scheme to integrate the domain terms in questions for answer re-ranking. Through empirical comparisons, the proposed approach significantly outperforms the state-of-the-art (SOTA) approaches. Meanwhile, we also verify that our proposed approach is robust to the noise in domain term set.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Notes

  1. http://answers.yahoo.com/.

  2. http://wiki.answers.com/.

  3. http://www.quora.com/.

  4. http://www.bing.com/.

  5. http://wordnet.princeton.edu/.

  6. http://dumps.wikimedia.org/.

  7. http://www.google.com/blogsearch.

References

  1. Agichtein, E., Castillo, C., Donato, D., Gionis, A., Mishne, G.: Finding high-quality content in social media. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, WSDM ’08, pp. 183–194. ACM, New York, NY, USA (2008). https://doi.org/10.1145/1341531.1341557

  2. Bian, J., Liu, Y., Agichtein, E., Zha, H.: Finding the right facts in the crowd: factoid question answering over social media. In: Proceedings of the 17th International Conference on World Wide Web, WWW ’08, pp. 467–476. ACM, New York, NY, USA (2008). https://doi.org/10.1145/1367497.1367561

  3. Bian, J., Liu, Y., Zhou, D., Agichtein, E., Zha, H.: Learning to recognize reliable users and content in social media with coupled mutual reinforcement. In: Proceedings of the 18th International Conference on World wide web, WWW ’09, pp. 51–60. ACM, New York, NY, USA (2009). https://doi.org/10.1145/1526709.1526717

  4. Bilotti, M.W., Ogilvie, P., Callan, J., Nyberg, E.: Structured retrieval for question answering. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’07, pp. 351–358. ACM, New York, NY, USA (2007). https://doi.org/10.1145/1277741.1277802

  5. Blooma, M.J., Chua, A.Y.K., Goh, D.H.L.: A predictive framework for retrieving the best answer. In: Proceedings of the 2008 ACM Symposium on Applied computing, SAC ’08, pp. 1107–1111. ACM, New York, NY, USA (2008). https://doi.org/10.1145/1363686.1363944

  6. Chen, B.C., Dasgupta, A., Wang, X., Yang, J.: Vote calibration in community question-answering systems. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’12, pp. 781–790. ACM, New York, NY, USA (2012). https://doi.org/10.1145/2348283.2348388

  7. Demner-Fushman, D., Lin, J.: Answering clinical questions with knowledge-based and statistical techniques. Comput. Linguist. 33(1), 63–103 (2007). https://doi.org/10.1162/coli.2007.33.1.63

    Article  Google Scholar 

  8. Ferrucci, D., Brown, E., Chu-Carroll, J., Fan, J., Gondek, D., Kalyanpur, A.A., Lally, A., Murdock, J.W., Nyberg, E., Prager, J., Schlaefer, N., Welty, C.: Answering clinical questions with knowledge-based and statistical techniques. AI Magazine, pp. 59–79 (2010)

  9. Harabagiu, S., Moldovan, D., Pasca, M., Surdeanu, M., Mihalcea, R., Lacatusu, F., Girju, R., Rus, V., Morarescu, P., Bunescu, R.: Answering complex list and context questions with lccs question-answering server. In: Proceeding of TREC (2001)

  10. Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th conference on Computational linguistics—vol. 2, COLING ’92, pp. 539–545. Association for Computational Linguistics, Stroudsburg, PA, USA (1992). https://doi.org/10.3115/992133.992154

  11. Jeon, J., Croft, W.B., Lee, J.H., Park, S.: A framework to predict the quality of answers with non-textual features. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’06, pp. 228–235. ACM, New York, NY, USA (2006). https://doi.org/10.1145/1148170.1148212

  12. Jindal, N., Liu, B.: Mining comparative sentences and relations. In: AAAI, pp. 1331–1336 (2006)

  13. Jurczyk, P., Agichtein, E.: Discovering authorities in question answer communities by using link analysis. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, CIKM ’07, pp. 919–922. ACM, New York, NY, USA (2007). https://doi.org/10.1145/1321440.1321575

  14. Ko, J., Nyberg, E., Si, L.: A probabilistic graphical model for joint answer ranking in question answering. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’07, pp. 343–350. ACM, New York, NY, USA (2007). https://doi.org/10.1145/1277741.1277801

  15. Ko, J., Si, L., Nyberg, E.: A probabilistic framework for answer selection in question answering. In: HLT-NAACL, pp. 524–531 (2007)

  16. Li, B., Liu, Y., Ram, A., Garcia, E.V., Agichtein, E.: Exploring question subjectivity prediction in community qa. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’08, pp. 735–736. ACM, New York, NY, USA (2008). https://doi.org/10.1145/1390334.1390477

  17. Li, S., Lin, C.Y., Song, Y.I., Li, Z.: Comparable entity mining from comparative questions. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL ’10, pp. 650–658. Association for Computational Linguistics, Stroudsburg, PA, USA (2010). http://dl.acm.org/citation.cfm?id=1858681.1858748

  18. Lin, J., Katz, B.: Question answering from the web using knowledge annotation and knowledge mining techniques. In: Proceedings of the Twelfth International Conference on Information and Knowledge Management, CIKM ’03, pp. 116–123. ACM, New York, NY, USA (2003). https://doi.org/10.1145/956863.956886

  19. Ming, Z.Y., Wang, K., Chua, T.S.: Prototype hierarchy based clustering for the categorization and navigation of web collections. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’10, pp. 2–9. ACM, New York, NY, USA (2010). https://doi.org/10.1145/1835449.1835453

  20. Snow, R., Jurafsky, D., Ng, A.Y.: Semantic taxonomy induction from heterogenous evidence. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, ACL-44, pp. 801–808. Association for Computational Linguistics, Stroudsburg, PA, USA (2006). https://doi.org/10.3115/1220175.1220276

  21. Soricut, R., Brill, E.: Automatic question answering: beyond the factoid. In: Proceedings of HLT-NAACL, pp. 191–206 (2004)

  22. Su, Q., Chen, H.K.Y., Huang, C.R.: Incorporate credibility into context for the best social media answers. In: PACLIC, pp. 535–541 (2010)

  23. Wang, F., Qi, S., Gao, G., Zhao, S., Wang, X.: Logo information recognition in large-scale social media data. Multimed. Syst. 22(1), 63–73 (2016)

    Article  Google Scholar 

  24. Yang, H., Callan, J.: A metric-based framework for automatic taxonomy induction. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1—Volume 1, ACL ’09, pp. 271–279. Association for Computational Linguistics, Stroudsburg, PA, USA (2009). http://dl.acm.org/citation.cfm?id=1687878.1687918

  25. Yu, J., Zha, Z.J., Wang, M., Wang, K., Chua, T.S.: Domain-assisted product aspect hierarchy generation: towards hierarchical organization of unstructured consumer reviews. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP ’11, pp. 140–150. Association for Computational Linguistics, Stroudsburg, PA, USA (2011). http://dl.acm.org/citation.cfm?id=2145432.2145449

  26. Zhang, W.N., Ming, Z.Y., Zhang, Y., Nie, L., Liu, T., Chua, T.S.: The use of dependency relation graph to enhance the term weighting in question retrieval. In: The 24th International Conference on Computational Linguistics, COLING ’12, pp. 3105–3120. Indian Institute of Technology Bombay, Mumbai, India (2012). http://aclweb.org/anthology/C/C12/C12-1190.pdf

  27. Zhao, S., Ding, G., Gao, Y., Han, J.: Approximating discrete probability distribution of image emotions by multi-modal features fusion. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence, pp. 4669–4675. AAAI Press (2017)

  28. Zhao, S., Gao, Y., Ding, G., Chua, T.S.: Real-time multimedia social event detection in microblog. IEEE Trans. Cybern. 99, 1–14 (2017)

    Google Scholar 

  29. Zhao, S., Yao, H., Gao, Y., Ding, G., Chua, T.-S.: Predicting personalized image emotion perceptions in social networks. IEEE Trans. Affect. Comput. 4, 526–540 (2018)

    Article  Google Scholar 

  30. Zhao, S., Yao, H., Gao, Y., Ji, R., Ding, G.: Continuous probability distribution prediction of image emotions via multitask shared sparse regression. IEEE Trans. Multimed. 19(3), 632–645 (2017)

    Article  Google Scholar 

  31. Zhao, S., Zhao, X., Ding, G., Keutzer, K.: Emotiongan: unsupervised domain adaptation for learning discrete probability distributions of image emotions. In: 2018 ACM Multimedia Conference on Multimedia Conference, pp. 1319–1327. ACM (2018)

  32. Zhou, T.C., Si, X., Chang, E.Y., King, I., Lyu, M.R.: A data-driven approach to question subjectivity identification in community question answering (2012)

  33. Zhou, Z.M., Lan, M., Niu, Z.Y., Lu, Y.: Exploiting user profile information for answer ranking in cqa. In: Proceedings of the 21st International Conference Companion on World Wide Web, WWW ’12 Companion, pp. 767–774. ACM, New York, NY, USA (2012). https://doi.org/10.1145/2187980.2188199

  34. Zhu, X., Ming, Z.Y., Zhu, X., Chua, T.S.: Topic hierarchy construction for the organization of multi-source user generated contents. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 233–242. ACM (2013)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nana Zhu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhu, N., Zhang, Z. & Ma, H. Ranking answers of comparative questions using heterogeneous information organization from social media. SIViP 13, 1267–1274 (2019). https://doi.org/10.1007/s11760-019-01465-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-019-01465-w

Keywords

Navigation