research-article

Descriptions from the Customers: Comparative Analysis of Review-based Product Description Generation Methods

Authors:

Slava Novgorodov,

Kira RadinskyAuthors Info & Claims

ACM Transactions on Internet Technology (TOIT), Volume 20, Issue 4

Article No.: 44, Pages 1 - 31

https://doi.org/10.1145/3418202

Published: 06 October 2020 Publication History

Abstract

Product descriptions play an important role in the e-commerce ecosystem. Yet, on leading e-commerce websites product descriptions are often lacking or missing. In this work, we suggest to overcome these issues by generating product descriptions from user reviews. We identify the set of candidates using a supervised approach that extracts review sentences in their original form, diversifies them, and selects the top candidates. We present extensive analyses of the generated descriptions, including a comparison to the original descriptions and examination of review coverage. We also perform an A/B test that demonstrates the impact of presenting our descriptions on user traffic.

References

[1]

Palakorn Achananuparp, Xiaohua Hu, and Xiajiong Shen. 2008. The evaluation of sentence similarity measures. In Proceedings of the DaWaK. 305--316.

Digital Library

[2]

Mathieu Acher, Anthony Cleve, Gilles Perrouin, Patrick Heymans, Charles Vanbeneden, Philippe Collet, and Philippe Lahire. 2012. On extracting feature models from product descriptions. In Proceedings of the VaMoS. 45--54.

Digital Library

[3]

Sanjeev Arora, Yingyu Liang, and Tengyu Ma. 2017. A simple but tough-to-beat baseline for sentence embeddings. In Proceedings of the ICLR.

[4]

Adam Berger and John Lafferty. 1999. Information retrieval as statistical translation. In Proceedings of the SIGIR. 222--229.

Digital Library

[5]

Avi Bleiweiss. 2019. LSTM neural networks for transfer learning in online moderation of abuse context. In Proceedings of the ICAART. SciTePress, 112--122.

[6]

Rich Caruana. 1998. Multitask learning. In Learning to Learn. Springer, 95--133.

Digital Library

[7]

Y. Chae, M. Nakazawa, and B. Stenger. 2018. Enhancing product images for click-through rate improvement. In Proceedings of the ICIP. 1428--1432.

[8]

Tianqi Chen and Carlos Guestrin. 2016. XGBoost: A scalable tree boosting system. In Proceedings of the KDD. 785--794.

Digital Library

[9]

Judith A. Chevalier and Dina Mayzlin. 2006. The effect of word of mouth on sales: Online book reviews. J. Market. Res. 43, 3 (2006), 345--354.

[10]

Jacob Cohen. 1960. A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 20, 1 (1960), 37--46.

[11]

Jacob Cohen. 1968. Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. Psychol. Bull. 70, 4 (1968), 213--220.

[12]

Horatiu Dumitru, Marek Gibiec, Negar Hariri, Jane Cleland-Huang, Bamshad Mobasher, Carlos Castro-Herrera, and Mehdi Mirakhorli. 2011. On-demand feature recommendations derived from mining public product descriptions. In Proceedings of the ICSE. 181--190.

Digital Library

[13]

Guy Elad, Ido Guy, Slava Novgorodov, Benny Kimelfeld, and Kira Radinsky. 2019. Learning to generate personalized product descriptions. In Proceedings of the CIKM. 389--398.

Digital Library

[14]

Günes Erkan and Dragomir R. Radev. 2004. Lexrank: Graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22 (2004), 457--479.

[15]

Kavita Ganesan, ChengXiang Zhai, and Jiawei Han. 2010. Opinosis: A graph-based approach to abstractive summarization of highly redundant opinions. In Proceedings of the COLING. 340--348.

[16]

Shima Gerani, Yashar Mehdad, Giuseppe Carenini, Raymond T. Ng, and Bita Nejat. 2014. Abstractive summarization of product reviews using discourse structure. In Proceedings of the EMNLP. 1602--1613.

[17]

Anindya Ghose and Panagiotis G. Ipeirotis. 2011. Estimating the helpfulness and economic impact of product reviews: Mining text and reviewer characteristics. IEEE Trans. Knowl. Data Eng. 23, 10 (2011), 1498--1512.

Digital Library

[18]

Ross Girshick. 2015. Fast R-CNN. In Proceedings of the ICCV. 1440--1448.

[19]

Xavier Glorot, Antoine Bordes, and Yoshua Bengio. 2011. Domain adaptation for large-scale sentiment classification: A deep learning approach. In Proceedings of the ICML. 513--520.

[20]

Jade Goldstein, Mark Kantrowitz, Vibhu Mittal, and Jaime Carbonell. 1999. Summarizing text documents: Sentence selection and evaluation metrics. In Proceedings of the SIGIR. 121--128.

Digital Library

[21]

Vishrawas Gopalakrishnan, Suresh Parthasarathy Iyengar, Amit Madaan, Rajeev Rastogi, and Srinivasan Sengamedu. 2012. Matching product titles using web-based enrichment. In Proceedings of the CIKM. 605--614.

Digital Library

[22]

Anjan Goswami, Naren Chittar, and Chung H. Sung. 2011. A study on the impact of product images on user clicks for online shopping. In Proceedings of the WWW. 45--46.

[23]

Ido Guy, Avihai Mejer, Alexander Nus, and Fiana Raiber. 2017. Extracting and ranking travel tips from user-generated reviews. In Proceedings of the WWW. 987--996.

Digital Library

[24]

Ido Guy and Bracha Shapira. 2018. From royals to vegans: Characterizing question trolling on a community question answering website. In Proceedings of the SIGIR. 835--844.

Digital Library

[25]

Kazuma Hashimoto, Caiming Xiong, Yoshimasa Tsuruoka, and Richard Socher. 2016. A joint many-task model: Growing a neural network for multiple NLP tasks. arXiv preprint abs/1611.01587 (2016).

[26]

Ruining He and Julian McAuley. 2016. Ups and downs: Modeling the visual evolution of fashion trends with one-class collaborative filtering. In Proceedings of the WWW. 507--517.

Digital Library

[27]

Sharon Hirsch, Ido Guy, Alexander Nus, Arnon Dagan, and Oren Kurland. 2020. Query reformulation in E-commerce search. In Proceedings of the SIGIR. 1319--1328.

Digital Library

[28]

Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Comput. 9, 8 (1997), 1735--1780.

Digital Library

[29]

Minqing Hu and Bing Liu. 2004. Mining and summarizing customer reviews. In Proceedings of the KDD. 168--177.

Digital Library

[30]

Nan Hu, Paul A. Pavlou, and Jennifer Zhang. 2006. Can online reviews reveal a product’s true quality?: Empirical findings and analytical modeling of online word-of-mouth communication. In Proceedings of the EC. 324--330.

Digital Library

[31]

Alice Jiang, Zhilin Yang, and Minjoon Jun. 2013. Measuring consumer perceptions of online shopping convenience. J. Serv. Manag. 24, 2 (2013), 191--214.

[32]

Gagandeep Kaur and Gagandeep Kaur. 2016. Mobile applications are major players in the world of e-commerce. Int. J. Adv. Res. IT Eng. 5, 2 (2016), 13--21.

[33]

Zehra Kavasoğlu and Şule Gündüz Öğüdücü. 2013. Personalized summarization of customer reviews based on user’s browsing history. IADIS Int. J. Comput. Sci. Inf. Syst. 8, 2 (2013), 147--158.

[34]

H. Khalid, E. Shihab, M. Nagappan, and A. E. Hassan. 2015. What do mobile app users complain about? IEEE Softw. 32, 3 (2015), 70--77.

Digital Library

[35]

Hyun Duk Kim, Kavita Ganesan, Parikshit Sondhi, and Chengxiang Zhai. 2011. Comprehensive review of opinion summarization. UIUC Technical Report. University of Illinois Urbana-Champaign. https://core.ac.uk/download/pdf/4827130.pdf.

[36]

Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint abs/1412.6980 (2014).

[37]

Ron Kohavi and Roger Longbotham. 2017. Online controlled experiments and A/B testing. Encyclopedia of Machine Learning and Data Mining 7, 8 (2017), 922--929.

[38]

Siwei Lai, Liheng Xu, Kang Liu, and Jun Zhao. 2015. Recurrent convolutional neural networks for text classification. In Proceedings of the AAAI. 2267--2273.

[39]

Eun-Ju Lee and Soo Yun Shin. 2014. When do consumers buy online product reviews? Effects of review quality, product type, and reviewer’s photo. Comput. Hum. Behav. 31 (2014), 356--366.

Digital Library

[40]

Beibei Li, Anindya Ghose, and Panagiotis G. Ipeirotis. 2011. Towards a theory model for product search. In Proceedings of the WWW. 327--336.

[41]

Fangtao Li, Chao Han, Minlie Huang, Xiaoyan Zhu, Ying-Ju Xia, Shu Zhang, and Hao Yu. 2010. Structure-aware review mining and summarization. In Proceedings of the COLING. 653--661.

[42]

Xinxin Li and Lorin M. Hitt. 2008. Self-selection and information role of online product reviews. Inf. Syst. Res. 19, 4 (2008), 456--474.

[43]

Moez Limayem, Mohamed Khalifa, and A. Frini. 2000. What makes consumers buy from Internet? A longitudinal study of online shopping. IEEE Trans. Syst. Man Cyber. Part A 30, 4 (2000), 421--432.

Digital Library

[44]

Chin-Yew Lin and Eduard Hovy. 2002. From single to multi-document summarization: A prototype system and its evaluation. In Proceedings of the ACL. 457--464.

[45]

Bing Liu. 2012. Sentiment Analysis and Opinion Mining. Morgan 8 Claypool Publishers.

[46]

Qian Liu, Zhiqiang Gao, Bing Liu, and Yuanlin Zhang. 2015. Automated rule selection for aspect extraction in opinion mining. In Proceedings of the IJCAI. 1291--1297.

[47]

Roque Enrique Lpez Condori and Thiago Alexandre Salgueiro Pardo. 2017. Opinion summarization methods. Exp. Syst. Appl. 78, C (July 2017), 124--134.

[48]

Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective approaches to attention-based neural machine translation. arXiv preprint abs/1508.04025 (2015).

[49]

Duy Khang Ly, Kazunari Sugiyama, Ziheng Lin, and Min-Yen Kan. 2011. Product review summarization from a deeper perspective. In Proceedings of the JCDL. 311--314.

Digital Library

[50]

Dehong Ma, Sujian Li, Xiaodong Zhang, and Houfeng Wang. 2017. Interactive attention networks for aspect-level sentiment classification. In Proceedings of the IJCAI. 4068--4074.

[51]

James MacQueen et al. 1967. Some methods for classification and analysis of multivariate observations. In Proceedings of the Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1. 281--297.

[52]

Deborah Brown McCabe and Stephen M. Nowlis. 2003. The effect of examining actual products or product descriptions on consumer preference. J. Consum. Psychol. 13, 4 (2003), 431--439.

[53]

Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint abs/1301.37810 (2013).

[54]

Hye-Jin Min and Jong C. Park. 2012. Identifying helpful reviews based on customer’s mentions about experiences. Exp. Syst. Applic. 39, 15 (2012), 11830--11838.

Digital Library

[55]

Samaneh Moghaddam and Martin Ester. 2012. On the design of LDA models for aspect-based opinion mining. In Proceedings of the CIKM. 803--812.

[56]

Ajinkya More. 2016. Attribute extraction from product titles in eCommerce. arXiv preprint abs/1608.04670 (2016).

[57]

Lili Mou, Zhao Meng, Rui Yan, Ge Li, Yan Xu, Lu Zhang, and Zhi Jin. 2016. How transferable are neural networks in NLP applications? arXiv preprint arXiv:1603.06111 (2016).

[58]

Quang Nguyen. 2012. Detecting Experience Revealing Sentences in Product Reviews. Ph.D. Dissertation. University of Amsterdam.

[59]

Slava Novgorodov, Ido Guy, Guy Elad, and Kira Radinsky. 2019. Generating product descriptions from user reviews. In Proceedings of the WWW. 1354--1364.

Digital Library

[60]

Bo Pang and Lillian Lee. 2008. Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2, 1--2 (Jan. 2008), 1--135.

Digital Library

[61]

Eun Joo Park, Eun Young Kim, Venessa Martin Funches, and William Foxx. 2012. Apparel product attributes, web browsing, and e-impulse buying on shopping websites. J. Bus. Res. 65, 11 (2012), 1583--1589.

[62]

A. M. Popescu and Oren Etzioni. 2005. Extracting product features and opinions from reviews. In Proceedings of the HLT. 339--346.

Digital Library

[63]

Katharina Probst, Rayid Ghani, Marko Krema, Andrew Fano, and Yan Liu. 2007. Semi-supervised learning of attribute-value pairs from product descriptions. In Proceedings of the IJCAI. 2838--2843.

[64]

Reid Pryzant, Young-Joo Chung, and Dan Jurafsky. 2017. Predicting sales from the language of product descriptions. In Proceedings of the ECOM (SIGIR Workshops).

[65]

Pradeep Racherla, Munir Mandviwalla, and Daniel J. Connolly. 2012. Factors affecting consumers’ trust in online product reviews. J. Consum. Behav. 11, 2 (2012), 94--104.

[66]

Irina Rish. 2001. An empirical study of the naive Bayes classifier. In IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, Vol. 3. 41--46.

[67]

Sebastian Ruder. 2017. An overview of multi-task learning in deep neural networks. arXiv preprint abs/1706.05098 (2017).

[68]

Andrew I. Schein, Alexandrin Popescul, Lyle H. Ungar, and David M. Pennock. 2002. Methods and metrics for cold-start recommendations. In Proceedings of the SIGIR. 253--260.

[69]

Keiji Shinzato and Satoshi Sekine. 2013. Unsupervised extraction of attributes and their values from product description. In Proceedings of the ACL. 1339--1347.

[70]

Anders Søgaard and Yoav Goldberg. 2016. Deep multi-task learning with low level tasks supervised at lower layers. In Proceedings of the ACL, Vol. 2. 231--235.

[71]

Krysta M. Svore, Lucy Vanderwende, and Chris J. C. Burges. 2007. Enhancing single-document summarization by combining RankNet and third-party sources. In Proceedings of the EMNLP-CoNLL.

[72]

Tijmen Tieleman and Geoffrey Hinton. 2012. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neur. Netw. Mach. Learn. 4, 2 (2012), 26--31.

[73]

Hen Tzaban, Ido Guy, Asnat Greenstein-Messica, Arnon Dagan, Lior Rokach, and Bracha Shapira. 2020. Product bundle identification using semi-supervised learning. In Proceedings of the SIGIR. 791--800.

Digital Library

[74]

Damir Vandic, Flavius Frasincar, and Uzay Kaymak. 2018. A framework for product description classification in e-commerce. J. Web Eng. 17, 1--2 (2018), 001--027.

Digital Library

[75]

Shuai Wang, Zhiyuan Chen, and Bing Liu. 2016. Mining aspect-specific opinion using a holistic lifelong topic model. In Proceedings of the WWW. 167--176.

Digital Library

[76]

Wenya Wang, Sinno Jialin Pan, Daniel Dahlmeier, and Xiaokui Xiao. 2017. Coupled multi-layer attentions for co-extraction of aspect and opinion terms. In Proceedings of the AAAI. 3316--3322.

[77]

Zichao Yang, Diyi Yang, Chris Dyer, Xiaodong He, Alex Smola, and Eduard Hovy. 2016. Hierarchical attention networks for document classification. In Proceedings of the NAACL. 1480--1489.

[78]

Koji Yatani, Michael Novati, Andrew Trusty, and Khai N. Truong. 2011. Analysis of adjective-noun word pair extraction methods for online review summarization. In Proceedings of the IJACI. 2771--2776.

[79]

Naitong Yu, Minlie Huang, Yuanyuan Shi, et al. 2016. Product review summarization by exploiting phrase properties. In Proceedings of the COLING. 1113--1124.

[80]

Di Zhu, Theodoros Lappas, and Juheng Zhang. 2018. Unsupervised tip-mining from customer reviews. Dec. Supp. Syst. 107 (2018), 116--124.

[81]

Li Zhuang, Feng Jing, and Xiao-Yan Zhu. 2006. Movie review mining and summarization. In Proceedings of the CIKM. 43--50.

Digital Library

[82]

Barret Zoph, Deniz Yuret, Jonathan May, and Kevin Knight. 2016. Transfer learning for low-resource neural machine translation. arXiv preprint arXiv:1604.02201 (2016).

Cited By

Mu YWei QChen G(2024)Encoding consumer interests into product snippets with a multi-criteria genetic optimization approachInformation & Management10.1016/j.im.2024.10405161:8(104051)Online publication date: Dec-2024
https://doi.org/10.1016/j.im.2024.104051
Zhao ZZhang LLian XGao XLv HShi L(2023)ReqGen: Keywords-Driven Software Requirements GenerationMathematics10.3390/math1102033211:2(332)Online publication date: 9-Jan-2023
https://doi.org/10.3390/math11020332
Verma JBhargav SBhavsar MBhattacharya PBostani AChowdhury SWebber JMehbodniya A(2023)Graph-Based Extractive Text Summarization Sentence Scoring Scheme for Big Data ApplicationsInformation10.3390/info1409047214:9(472)Online publication date: 22-Aug-2023
https://doi.org/10.3390/info14090472
Show More Cited By

Index Terms

Descriptions from the Customers: Comparative Analysis of Review-based Product Description Generation Methods

Recommendations

Generating Product Descriptions from User Reviews
WWW '19: The World Wide Web Conference

Product descriptions play an important role in the e-commerce ecosystem, conveying to buyers information about a merchandise they may purchase. Yet, on leading e-commerce websites, with high volumes of new items offered for sale every day, product ...
User-Generated Content and Competing Firms' Product Design

Firms employ various techniques to obtain information about consumer taste/location and valuation prior to making product design decisions. User-generated content has become an important information source. The vast variety and volume of user-generated ...
Tag suggestion and localization in user-generated videos based on social knowledge
WSM '10: Proceedings of second ACM SIGMM workshop on Social media

Nowadays, almost any web site that provides means for sharing user-generated multimedia content, like Flickr, Facebook, YouTube and Vimeo, has tagging functionalities to let users annotate the material that they want to share. The tags are then used to ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Internet Technology

ACM Transactions on Internet Technology Volume 20, Issue 4

November 2020

391 pages

ISSN:1533-5399

EISSN:1557-6051

DOI:10.1145/3427795

Editor:
Ling Liu
Georgia Institute of Technology, USA

Issue’s Table of Contents

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 October 2020

Accepted: 01 July 2020

Revised: 01 June 2020

Received: 01 December 2019

Published in TOIT Volume 20, Issue 4

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

7
Total Citations
View Citations
340
Total Downloads

Downloads (Last 12 months)62
Downloads (Last 6 weeks)2

Reflects downloads up to 01 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Mu YWei QChen G(2024)Encoding consumer interests into product snippets with a multi-criteria genetic optimization approachInformation & Management10.1016/j.im.2024.10405161:8(104051)Online publication date: Dec-2024
https://doi.org/10.1016/j.im.2024.104051
Zhao ZZhang LLian XGao XLv HShi L(2023)ReqGen: Keywords-Driven Software Requirements GenerationMathematics10.3390/math1102033211:2(332)Online publication date: 9-Jan-2023
https://doi.org/10.3390/math11020332
Verma JBhargav SBhavsar MBhattacharya PBostani AChowdhury SWebber JMehbodniya A(2023)Graph-Based Extractive Text Summarization Sentence Scoring Scheme for Big Data ApplicationsInformation10.3390/info1409047214:9(472)Online publication date: 22-Aug-2023
https://doi.org/10.3390/info14090472
Guo XWang SZhao HDiao SChen JDing ZHe ZLu JXiao YLong BYu HWu L(2023)Intelligent online selling point extraction and generation for e‐commerce recommendationAI Magazine10.1002/aaai.1208344:1(16-29)Online publication date: 5-Apr-2023
https://dl.acm.org/doi/10.1002/aaai.12083
Hirsch SNovgorodov SGuy INus A(2022)The Tip of the Buyer: Extracting Product Tips from ReviewsACM Transactions on Internet Technology10.1145/354714023:1(1-30)Online publication date: 14-Jul-2022
https://dl.acm.org/doi/10.1145/3547140
Fukumoto KSuzuki RTerada HBato MNadamoto A(2021)Comparison of Deep Learning Models for Automatic Generation of Product Description on E-commerce siteThe 23rd International Conference on Information Integration and Web Intelligence10.1145/3487664.3487696(223-231)Online publication date: 29-Nov-2021
https://dl.acm.org/doi/10.1145/3487664.3487696
Guy I(2012)Social Recommender SystemsRecommender Systems Handbook10.1007/978-1-0716-2197-4_22(835-870)Online publication date: 24-Feb-2012
https://doi.org/10.1007/978-1-0716-2197-4_22

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Issue’s Table of Contents