skip to main content
research-article

Descriptions from the Customers: Comparative Analysis of Review-based Product Description Generation Methods

Published: 06 October 2020 Publication History

Abstract

Product descriptions play an important role in the e-commerce ecosystem. Yet, on leading e-commerce websites product descriptions are often lacking or missing. In this work, we suggest to overcome these issues by generating product descriptions from user reviews. We identify the set of candidates using a supervised approach that extracts review sentences in their original form, diversifies them, and selects the top candidates. We present extensive analyses of the generated descriptions, including a comparison to the original descriptions and examination of review coverage. We also perform an A/B test that demonstrates the impact of presenting our descriptions on user traffic.

References

[1]
Palakorn Achananuparp, Xiaohua Hu, and Xiajiong Shen. 2008. The evaluation of sentence similarity measures. In Proceedings of the DaWaK. 305--316.
[2]
Mathieu Acher, Anthony Cleve, Gilles Perrouin, Patrick Heymans, Charles Vanbeneden, Philippe Collet, and Philippe Lahire. 2012. On extracting feature models from product descriptions. In Proceedings of the VaMoS. 45--54.
[3]
Sanjeev Arora, Yingyu Liang, and Tengyu Ma. 2017. A simple but tough-to-beat baseline for sentence embeddings. In Proceedings of the ICLR.
[4]
Adam Berger and John Lafferty. 1999. Information retrieval as statistical translation. In Proceedings of the SIGIR. 222--229.
[5]
Avi Bleiweiss. 2019. LSTM neural networks for transfer learning in online moderation of abuse context. In Proceedings of the ICAART. SciTePress, 112--122.
[6]
Rich Caruana. 1998. Multitask learning. In Learning to Learn. Springer, 95--133.
[7]
Y. Chae, M. Nakazawa, and B. Stenger. 2018. Enhancing product images for click-through rate improvement. In Proceedings of the ICIP. 1428--1432.
[8]
Tianqi Chen and Carlos Guestrin. 2016. XGBoost: A scalable tree boosting system. In Proceedings of the KDD. 785--794.
[9]
Judith A. Chevalier and Dina Mayzlin. 2006. The effect of word of mouth on sales: Online book reviews. J. Market. Res. 43, 3 (2006), 345--354.
[10]
Jacob Cohen. 1960. A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 20, 1 (1960), 37--46.
[11]
Jacob Cohen. 1968. Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. Psychol. Bull. 70, 4 (1968), 213--220.
[12]
Horatiu Dumitru, Marek Gibiec, Negar Hariri, Jane Cleland-Huang, Bamshad Mobasher, Carlos Castro-Herrera, and Mehdi Mirakhorli. 2011. On-demand feature recommendations derived from mining public product descriptions. In Proceedings of the ICSE. 181--190.
[13]
Guy Elad, Ido Guy, Slava Novgorodov, Benny Kimelfeld, and Kira Radinsky. 2019. Learning to generate personalized product descriptions. In Proceedings of the CIKM. 389--398.
[14]
Günes Erkan and Dragomir R. Radev. 2004. Lexrank: Graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22 (2004), 457--479.
[15]
Kavita Ganesan, ChengXiang Zhai, and Jiawei Han. 2010. Opinosis: A graph-based approach to abstractive summarization of highly redundant opinions. In Proceedings of the COLING. 340--348.
[16]
Shima Gerani, Yashar Mehdad, Giuseppe Carenini, Raymond T. Ng, and Bita Nejat. 2014. Abstractive summarization of product reviews using discourse structure. In Proceedings of the EMNLP. 1602--1613.
[17]
Anindya Ghose and Panagiotis G. Ipeirotis. 2011. Estimating the helpfulness and economic impact of product reviews: Mining text and reviewer characteristics. IEEE Trans. Knowl. Data Eng. 23, 10 (2011), 1498--1512.
[18]
Ross Girshick. 2015. Fast R-CNN. In Proceedings of the ICCV. 1440--1448.
[19]
Xavier Glorot, Antoine Bordes, and Yoshua Bengio. 2011. Domain adaptation for large-scale sentiment classification: A deep learning approach. In Proceedings of the ICML. 513--520.
[20]
Jade Goldstein, Mark Kantrowitz, Vibhu Mittal, and Jaime Carbonell. 1999. Summarizing text documents: Sentence selection and evaluation metrics. In Proceedings of the SIGIR. 121--128.
[21]
Vishrawas Gopalakrishnan, Suresh Parthasarathy Iyengar, Amit Madaan, Rajeev Rastogi, and Srinivasan Sengamedu. 2012. Matching product titles using web-based enrichment. In Proceedings of the CIKM. 605--614.
[22]
Anjan Goswami, Naren Chittar, and Chung H. Sung. 2011. A study on the impact of product images on user clicks for online shopping. In Proceedings of the WWW. 45--46.
[23]
Ido Guy, Avihai Mejer, Alexander Nus, and Fiana Raiber. 2017. Extracting and ranking travel tips from user-generated reviews. In Proceedings of the WWW. 987--996.
[24]
Ido Guy and Bracha Shapira. 2018. From royals to vegans: Characterizing question trolling on a community question answering website. In Proceedings of the SIGIR. 835--844.
[25]
Kazuma Hashimoto, Caiming Xiong, Yoshimasa Tsuruoka, and Richard Socher. 2016. A joint many-task model: Growing a neural network for multiple NLP tasks. arXiv preprint abs/1611.01587 (2016).
[26]
Ruining He and Julian McAuley. 2016. Ups and downs: Modeling the visual evolution of fashion trends with one-class collaborative filtering. In Proceedings of the WWW. 507--517.
[27]
Sharon Hirsch, Ido Guy, Alexander Nus, Arnon Dagan, and Oren Kurland. 2020. Query reformulation in E-commerce search. In Proceedings of the SIGIR. 1319--1328.
[28]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Comput. 9, 8 (1997), 1735--1780.
[29]
Minqing Hu and Bing Liu. 2004. Mining and summarizing customer reviews. In Proceedings of the KDD. 168--177.
[30]
Nan Hu, Paul A. Pavlou, and Jennifer Zhang. 2006. Can online reviews reveal a product’s true quality?: Empirical findings and analytical modeling of online word-of-mouth communication. In Proceedings of the EC. 324--330.
[31]
Alice Jiang, Zhilin Yang, and Minjoon Jun. 2013. Measuring consumer perceptions of online shopping convenience. J. Serv. Manag. 24, 2 (2013), 191--214.
[32]
Gagandeep Kaur and Gagandeep Kaur. 2016. Mobile applications are major players in the world of e-commerce. Int. J. Adv. Res. IT Eng. 5, 2 (2016), 13--21.
[33]
Zehra Kavasoğlu and Şule Gündüz Öğüdücü. 2013. Personalized summarization of customer reviews based on user’s browsing history. IADIS Int. J. Comput. Sci. Inf. Syst. 8, 2 (2013), 147--158.
[34]
H. Khalid, E. Shihab, M. Nagappan, and A. E. Hassan. 2015. What do mobile app users complain about? IEEE Softw. 32, 3 (2015), 70--77.
[35]
Hyun Duk Kim, Kavita Ganesan, Parikshit Sondhi, and Chengxiang Zhai. 2011. Comprehensive review of opinion summarization. UIUC Technical Report. University of Illinois Urbana-Champaign. https://core.ac.uk/download/pdf/4827130.pdf.
[36]
Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint abs/1412.6980 (2014).
[37]
Ron Kohavi and Roger Longbotham. 2017. Online controlled experiments and A/B testing. Encyclopedia of Machine Learning and Data Mining 7, 8 (2017), 922--929.
[38]
Siwei Lai, Liheng Xu, Kang Liu, and Jun Zhao. 2015. Recurrent convolutional neural networks for text classification. In Proceedings of the AAAI. 2267--2273.
[39]
Eun-Ju Lee and Soo Yun Shin. 2014. When do consumers buy online product reviews? Effects of review quality, product type, and reviewer’s photo. Comput. Hum. Behav. 31 (2014), 356--366.
[40]
Beibei Li, Anindya Ghose, and Panagiotis G. Ipeirotis. 2011. Towards a theory model for product search. In Proceedings of the WWW. 327--336.
[41]
Fangtao Li, Chao Han, Minlie Huang, Xiaoyan Zhu, Ying-Ju Xia, Shu Zhang, and Hao Yu. 2010. Structure-aware review mining and summarization. In Proceedings of the COLING. 653--661.
[42]
Xinxin Li and Lorin M. Hitt. 2008. Self-selection and information role of online product reviews. Inf. Syst. Res. 19, 4 (2008), 456--474.
[43]
Moez Limayem, Mohamed Khalifa, and A. Frini. 2000. What makes consumers buy from Internet? A longitudinal study of online shopping. IEEE Trans. Syst. Man Cyber. Part A 30, 4 (2000), 421--432.
[44]
Chin-Yew Lin and Eduard Hovy. 2002. From single to multi-document summarization: A prototype system and its evaluation. In Proceedings of the ACL. 457--464.
[45]
Bing Liu. 2012. Sentiment Analysis and Opinion Mining. Morgan 8 Claypool Publishers.
[46]
Qian Liu, Zhiqiang Gao, Bing Liu, and Yuanlin Zhang. 2015. Automated rule selection for aspect extraction in opinion mining. In Proceedings of the IJCAI. 1291--1297.
[47]
Roque Enrique Lpez Condori and Thiago Alexandre Salgueiro Pardo. 2017. Opinion summarization methods. Exp. Syst. Appl. 78, C (July 2017), 124--134.
[48]
Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective approaches to attention-based neural machine translation. arXiv preprint abs/1508.04025 (2015).
[49]
Duy Khang Ly, Kazunari Sugiyama, Ziheng Lin, and Min-Yen Kan. 2011. Product review summarization from a deeper perspective. In Proceedings of the JCDL. 311--314.
[50]
Dehong Ma, Sujian Li, Xiaodong Zhang, and Houfeng Wang. 2017. Interactive attention networks for aspect-level sentiment classification. In Proceedings of the IJCAI. 4068--4074.
[51]
James MacQueen et al. 1967. Some methods for classification and analysis of multivariate observations. In Proceedings of the Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1. 281--297.
[52]
Deborah Brown McCabe and Stephen M. Nowlis. 2003. The effect of examining actual products or product descriptions on consumer preference. J. Consum. Psychol. 13, 4 (2003), 431--439.
[53]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint abs/1301.37810 (2013).
[54]
Hye-Jin Min and Jong C. Park. 2012. Identifying helpful reviews based on customer’s mentions about experiences. Exp. Syst. Applic. 39, 15 (2012), 11830--11838.
[55]
Samaneh Moghaddam and Martin Ester. 2012. On the design of LDA models for aspect-based opinion mining. In Proceedings of the CIKM. 803--812.
[56]
Ajinkya More. 2016. Attribute extraction from product titles in eCommerce. arXiv preprint abs/1608.04670 (2016).
[57]
Lili Mou, Zhao Meng, Rui Yan, Ge Li, Yan Xu, Lu Zhang, and Zhi Jin. 2016. How transferable are neural networks in NLP applications? arXiv preprint arXiv:1603.06111 (2016).
[58]
Quang Nguyen. 2012. Detecting Experience Revealing Sentences in Product Reviews. Ph.D. Dissertation. University of Amsterdam.
[59]
Slava Novgorodov, Ido Guy, Guy Elad, and Kira Radinsky. 2019. Generating product descriptions from user reviews. In Proceedings of the WWW. 1354--1364.
[60]
Bo Pang and Lillian Lee. 2008. Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2, 1--2 (Jan. 2008), 1--135.
[61]
Eun Joo Park, Eun Young Kim, Venessa Martin Funches, and William Foxx. 2012. Apparel product attributes, web browsing, and e-impulse buying on shopping websites. J. Bus. Res. 65, 11 (2012), 1583--1589.
[62]
A. M. Popescu and Oren Etzioni. 2005. Extracting product features and opinions from reviews. In Proceedings of the HLT. 339--346.
[63]
Katharina Probst, Rayid Ghani, Marko Krema, Andrew Fano, and Yan Liu. 2007. Semi-supervised learning of attribute-value pairs from product descriptions. In Proceedings of the IJCAI. 2838--2843.
[64]
Reid Pryzant, Young-Joo Chung, and Dan Jurafsky. 2017. Predicting sales from the language of product descriptions. In Proceedings of the ECOM (SIGIR Workshops).
[65]
Pradeep Racherla, Munir Mandviwalla, and Daniel J. Connolly. 2012. Factors affecting consumers’ trust in online product reviews. J. Consum. Behav. 11, 2 (2012), 94--104.
[66]
Irina Rish. 2001. An empirical study of the naive Bayes classifier. In IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, Vol. 3. 41--46.
[67]
Sebastian Ruder. 2017. An overview of multi-task learning in deep neural networks. arXiv preprint abs/1706.05098 (2017).
[68]
Andrew I. Schein, Alexandrin Popescul, Lyle H. Ungar, and David M. Pennock. 2002. Methods and metrics for cold-start recommendations. In Proceedings of the SIGIR. 253--260.
[69]
Keiji Shinzato and Satoshi Sekine. 2013. Unsupervised extraction of attributes and their values from product description. In Proceedings of the ACL. 1339--1347.
[70]
Anders Søgaard and Yoav Goldberg. 2016. Deep multi-task learning with low level tasks supervised at lower layers. In Proceedings of the ACL, Vol. 2. 231--235.
[71]
Krysta M. Svore, Lucy Vanderwende, and Chris J. C. Burges. 2007. Enhancing single-document summarization by combining RankNet and third-party sources. In Proceedings of the EMNLP-CoNLL.
[72]
Tijmen Tieleman and Geoffrey Hinton. 2012. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neur. Netw. Mach. Learn. 4, 2 (2012), 26--31.
[73]
Hen Tzaban, Ido Guy, Asnat Greenstein-Messica, Arnon Dagan, Lior Rokach, and Bracha Shapira. 2020. Product bundle identification using semi-supervised learning. In Proceedings of the SIGIR. 791--800.
[74]
Damir Vandic, Flavius Frasincar, and Uzay Kaymak. 2018. A framework for product description classification in e-commerce. J. Web Eng. 17, 1--2 (2018), 001--027.
[75]
Shuai Wang, Zhiyuan Chen, and Bing Liu. 2016. Mining aspect-specific opinion using a holistic lifelong topic model. In Proceedings of the WWW. 167--176.
[76]
Wenya Wang, Sinno Jialin Pan, Daniel Dahlmeier, and Xiaokui Xiao. 2017. Coupled multi-layer attentions for co-extraction of aspect and opinion terms. In Proceedings of the AAAI. 3316--3322.
[77]
Zichao Yang, Diyi Yang, Chris Dyer, Xiaodong He, Alex Smola, and Eduard Hovy. 2016. Hierarchical attention networks for document classification. In Proceedings of the NAACL. 1480--1489.
[78]
Koji Yatani, Michael Novati, Andrew Trusty, and Khai N. Truong. 2011. Analysis of adjective-noun word pair extraction methods for online review summarization. In Proceedings of the IJACI. 2771--2776.
[79]
Naitong Yu, Minlie Huang, Yuanyuan Shi, et al. 2016. Product review summarization by exploiting phrase properties. In Proceedings of the COLING. 1113--1124.
[80]
Di Zhu, Theodoros Lappas, and Juheng Zhang. 2018. Unsupervised tip-mining from customer reviews. Dec. Supp. Syst. 107 (2018), 116--124.
[81]
Li Zhuang, Feng Jing, and Xiao-Yan Zhu. 2006. Movie review mining and summarization. In Proceedings of the CIKM. 43--50.
[82]
Barret Zoph, Deniz Yuret, Jonathan May, and Kevin Knight. 2016. Transfer learning for low-resource neural machine translation. arXiv preprint arXiv:1604.02201 (2016).

Cited By

View all
  • (2024)Encoding consumer interests into product snippets with a multi-criteria genetic optimization approachInformation & Management10.1016/j.im.2024.10405161:8(104051)Online publication date: Dec-2024
  • (2023)ReqGen: Keywords-Driven Software Requirements GenerationMathematics10.3390/math1102033211:2(332)Online publication date: 9-Jan-2023
  • (2023)Graph-Based Extractive Text Summarization Sentence Scoring Scheme for Big Data ApplicationsInformation10.3390/info1409047214:9(472)Online publication date: 22-Aug-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Internet Technology
ACM Transactions on Internet Technology  Volume 20, Issue 4
November 2020
391 pages
ISSN:1533-5399
EISSN:1557-6051
DOI:10.1145/3427795
  • Editor:
  • Ling Liu
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 October 2020
Accepted: 01 July 2020
Revised: 01 June 2020
Received: 01 December 2019
Published in TOIT Volume 20, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Deep multi-task leaning
  2. electronic commerce
  3. language generation
  4. user-generated content

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)62
  • Downloads (Last 6 weeks)2
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Encoding consumer interests into product snippets with a multi-criteria genetic optimization approachInformation & Management10.1016/j.im.2024.10405161:8(104051)Online publication date: Dec-2024
  • (2023)ReqGen: Keywords-Driven Software Requirements GenerationMathematics10.3390/math1102033211:2(332)Online publication date: 9-Jan-2023
  • (2023)Graph-Based Extractive Text Summarization Sentence Scoring Scheme for Big Data ApplicationsInformation10.3390/info1409047214:9(472)Online publication date: 22-Aug-2023
  • (2023)Intelligent online selling point extraction and generation for e‐commerce recommendationAI Magazine10.1002/aaai.1208344:1(16-29)Online publication date: 5-Apr-2023
  • (2022)The Tip of the Buyer: Extracting Product Tips from ReviewsACM Transactions on Internet Technology10.1145/354714023:1(1-30)Online publication date: 14-Jul-2022
  • (2021)Comparison of Deep Learning Models for Automatic Generation of Product Description on E-commerce siteThe 23rd International Conference on Information Integration and Web Intelligence10.1145/3487664.3487696(223-231)Online publication date: 29-Nov-2021
  • (2012)Social Recommender SystemsRecommender Systems Handbook10.1007/978-1-0716-2197-4_22(835-870)Online publication date: 24-Feb-2012

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media