Skip to main content

What Matters for Shoppers: Investigating Key Attributes for Online Product Comparison

  • Conference paper
  • First Online:
Advances in Information Retrieval (ECIR 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13186))

Included in the following conference series:

  • 2444 Accesses

Abstract

Before making high-consideration purchase decisions, shoppers generally need to identify and evaluate products’ key differentiating features or attributes. Many customers, however, lack the knowledge required to do so for all product domains. In this work, we investigate and analyze alternatives for identifying important product attributes, which customers can then use to compare candidate products. We propose an unsupervised attribute-ranking approach ReBARC, that combines both objective data from structured product catalogs, and subjective information from unstructured customer reviews, to suggest to the shopper the most important attributes to consider. Our detailed analysis of product attribute importance across various domains on a shopping website shows that ReBARC significantly outperforms prior efforts judged by both automated and human evaluation metrics. We also analyze the correlation and overlap between key product attributes detected by ReBARC, and those visible to customers during online product search.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bing, L., Wong, T., Lam, W.: Unsupervised extraction of popular product attributes from e-commerce web sites by considering customer reviews. ACM TOIT (2016)

    Google Scholar 

  2. Buhrmester, M., Kwang, T., Gosling, S.: Amazon’s mechanical Turk: a new source of inexpensive, yet high-quality data? (2016)

    Google Scholar 

  3. Cai, F., de Rijke, M.: A survey of query auto completion in information retrieval. Found. Trends Inf. Retrieval (2016)

    Google Scholar 

  4. Campos, R., Mangaravite, V., Pasquali, A., Jorge, A., Nunes, C., Jatowt, A.: Yake! keyword extraction from single documents using multiple local features. Inf. Sci. (2020)

    Google Scholar 

  5. Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: ACM SIGIR (1998)

    Google Scholar 

  6. Carmel, D., Lewin-Eytan, L., Maarek, Y.: Product question answering using customer generated content-research challenges. In: ACM SIGIR (2018)

    Google Scholar 

  7. Chen, G., Tian, Y., Song, Y.: Joint aspect extraction and sentiment analysis with directional graph convolutional networks. In: COLING (2020)

    Google Scholar 

  8. Chen, S., Li, C., Ji, F., Zhou, W., Chen, H.: Driven answer generation for product-related questions in e-commerce. In: ACM WSDM (2019)

    Google Scholar 

  9. Cohen, J.: A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 20(1), 37–46 (1960)

    Article  Google Scholar 

  10. Da’u, A., Salim, N.: Aspect extraction on user textual reviews using multi-channel convolutional neural network. PeerJ Comput. Sci. 5 (2019)

    Google Scholar 

  11. Ghani, R., Probst, K., Liu, Y., Krema, M., Fano, A.: Text mining for product attribute extraction. ACM SIGKDD Explor. Newsl. (2006)

    Google Scholar 

  12. Giannakopoulos, A., Musat, C., Hossmann, A., Baeriswyl, M.: Unsupervised aspect term extraction with B-LSTM & CRF using automatically labelled datasets. arXiv preprint arXiv:1709.05094 (2017)

  13. He, R., Lee, W.S., Ng, H.T., Dahlmeier, D.: An unsupervised neural attention model for aspect extraction. In: ACL (2017)

    Google Scholar 

  14. Hirschmeier, S., Egger, M.: Social product search-enhancing product search with mined (sparse) product features (2018)

    Google Scholar 

  15. Huynh, V.P., Papotti, P.: A benchmark for fact checking algorithms built on knowledge bases. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp. 689–698 (2019)

    Google Scholar 

  16. Järvelin, K., Kekäläinen, J.: Cumulated gain-based evaluation of IR techniques. TOIS (2002)

    Google Scholar 

  17. Kozareva, Z., Li, Q., Zhai, K., Guo, W.: Recognizing salient entities in shopping queries. In: ACL (2016)

    Google Scholar 

  18. Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: probabilistic models for segmenting and labeling sequence data (2001)

    Google Scholar 

  19. Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the web. In: WWW (2005)

    Google Scholar 

  20. Liu, Y., et al.: Roberta: a robustly optimized Bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019)

  21. Luo, L., et al.: Unsupervised neural aspect extraction with sememes. In: IJCAI (2019)

    Google Scholar 

  22. More, A.: Attribute extraction from product titles in ecommerce. arXiv preprint arXiv:1608.04670 (2016)

  23. Ni, J., Li, J., McAuley, J.: Justifying recommendations using distantly-labeled reviews and fine-grained aspects. In: EMNLP-IJCNLP (2019)

    Google Scholar 

  24. Petrovski, P., Bizer, C.: Extracting attribute-value pairs from product specifications on the web. In: International Conference on Web Intelligence (2017)

    Google Scholar 

  25. Pontiki, M., et al.: SemEval-2016 task 5: aspect based sentiment analysis. In: Proceedings of the 10th International Workshop on Semantic Evaluation (2016)

    Google Scholar 

  26. Pontiki, M., Galanis, D., Pavlopoulos, J., Papageorgiou, H., Androutsopoulos, I., Manandhar, S.: SemEval-2014 task 4: aspect based sentiment analysis. In: Proceedings of the 8th International Workshop on Semantic Evaluation (2014)

    Google Scholar 

  27. Popescu, A., Etzioni, O.: Extracting product features and opinions from reviews. In: Kao, A., Poteet, S.R. (eds.) Natural Language Processing and Text Mining. Springer, London (2007). https://doi.org/10.1007/978-1-84628-754-1_2

    Chapter  Google Scholar 

  28. Probst, K., Ghani, R., Krema, M., Fano, A., Liu, Y.: Semi-supervised learning of attribute-value pairs from product descriptions. In: IJCAI (2007)

    Google Scholar 

  29. Putthividhya, D., Hu, J.: Bootstrapped named entity recognition for product attribute extraction. In: EMNLP (2011)

    Google Scholar 

  30. Reimers, N., Gurevych, I.: Sentence-Bert: sentence embeddings using Siamese Bert-networks. In: EMNLP-IJCNLP (2019)

    Google Scholar 

  31. Retail, T.: They say they want a revolution - price water house (2016). https://www.pwc.es/es/publicaciones/retail-y-consumo/assets/total-retail-2016.pdf

  32. Socher, R., et al.: Recursive deep models for semantic compositionality over a sentiment treebank. In: EMNLP (2013)

    Google Scholar 

  33. Thorne, J., Vlachos, A.: Evidence-based factual error correction. arXiv preprint arXiv:2106.01072 (2021)

  34. Tulkens, S., van Cranenburgh, A.: Embarrassingly simple unsupervised aspect extraction. ArXiv abs/2004.13580 (2020)

    Google Scholar 

  35. Vedula, N., Parthasarathy, S.: Face-keg: fact checking explained using knowledge graphs. In: Proceedings of the 14th ACM International Conference on Web Search and Data Mining, pp. 526–534 (2021)

    Google Scholar 

  36. Wu, B., Cheng, X., Wang, Y., Guo, Y., Song, L.: Simultaneous product attribute name and value extraction from web pages. In: ACM WI-IAT (2009)

    Google Scholar 

  37. Xu, H., Liu, B., Shu, L., Yu, P.S.: Double embeddings and CNN-based sequence labeling for aspect extraction. ArXiv abs/1805.04601 (2018)

    Google Scholar 

  38. Yang, Y., Chen, W., Li, Z., He, Z., Zhang, M.: Distantly supervised NER with partial annotation learning and reinforcement learning. In: International Conference on Computational Linguistics (2018)

    Google Scholar 

  39. Zheng, G., Mukherjee, S., Dong, X., Li, F.: OpenTag: open attribute value extraction from product profiles. In: ACM SIGKDD (2018)

    Google Scholar 

  40. Zhu, M.: Recall, precision and average precision. Department of Statistics and Actuarial Science, University of Waterloo, Waterloo (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nikhita Vedula .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Vedula, N., Collins, M., Agichtein, E., Rokhlenko, O. (2022). What Matters for Shoppers: Investigating Key Attributes for Online Product Comparison. In: Hagen, M., et al. Advances in Information Retrieval. ECIR 2022. Lecture Notes in Computer Science, vol 13186. Springer, Cham. https://doi.org/10.1007/978-3-030-99739-7_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-99739-7_27

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-99738-0

  • Online ISBN: 978-3-030-99739-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics