skip to main content
10.1145/3184558.3186346acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article
Free access

When E-commerce Meets Social Media: Identifying Business on WeChat Moment Using Bilateral-Attention LSTM

Published: 23 April 2018 Publication History

Abstract

WeChat Business, developed on WeChat, the most extensively used instant messaging platform in China, is a new business model that bursts into people's lives in the e-commerce era. As one of the most typical WeChat Business behaviors, WeChat users can advertise products, advocate companies and share customer feedback to their WeChat friends by posting a WeChat Moment--a public status that contains images and a text. Given its popularity and significance, in this paper, we propose a novel Bilateral-Attention LSTM network (BiATT-LSTM) to identify WeChat Business Moments based on their texts and images. In particular, different from previous schemes that equally consider visual and textual modalities for a joint visual-textual classification task, we start our work with a text classification task based on an LSTM network, then we incorporate a bilateral-attention mechanism that can automatically learn two kinds of explicit attention weights for each word, namely 1) a global weight that is insensitive to the images in the same Moment with the word, and 2) a local weight that is sensitive to the images in the same Moment. In this process, we utilize visual information as a guidance to figure out the local weight of a word in a specific Moment. Two-level experiments demonstrate the effectiveness of our framework. It outperforms other schemes that jointly model visual and textual modalities. We also visualize the bilateral-attention mechanism to illustrate how this mechanism helps joint visual-textual classification.

References

[1]
Aishwarya Agrawal, Jiasen Lu, Stanislaw Antol, Margaret Mitchell, C. Lawrence Zitnick, Devi Parikh, and Dhruv Batra. 2017. VQA: Visual Question Answering. International Journal of Computer Vision Vol. 123, 1 (2017), 4--31.
[2]
Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C Lawrence Zitnick, and Devi Parikh. 2015. Vqa: Visual question answering. In Proceedings of the IEEE International Conference on Computer Vision. 2425--2433.
[3]
David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. Journal of machine Learning research Vol. 3, Jan (2003), 993--1022.
[4]
Alex Graves et almbox. 2012. Supervised sequence labelling with recurrent neural networks. Vol. Vol. 385. Springer.
[5]
Zhao Guo, Lianli Gao, Jingkuan Song, Xing Xu, Jie Shao, and Heng Tao Shen. 2016. Attention-based LSTM with Semantic Consistency for Videos Captioning Proceedings of the 2016 ACM on Multimedia Conference. ACM, 357--361.
[6]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Deep Residual Learning for Image Recognition. arXiv preprint arXiv:1512.03385 (2015).
[7]
Quoc Le and Tomas Mikolov. 2014. Distributed representations of sentences and documents Proceedings of the 31st International Conference on Machine Learning (ICML-14). 1188--1196.
[8]
Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, and Xiaogang Wang. 2017. Person Search with Natural Language Description. arXiv:1702.05729 (2017).
[9]
Zhuqi Li, Lin Chen, Yichong Bai, Kaigui Bian, and Pan Zhou. 2016. On Diffusion-restricted Social Network: A Measurement Study of WeChat Moments. IEEE International Conference on Communications (2016).
[10]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality Advances in neural information processing systems. 3111--3119.
[11]
Jiezhong Qiu, Yixuan Li, Jie Tang, Zheng Lu, Hao Ye, Bo Chen, Qiang Yang, and John E. Hopcroft. 2016. The Lifecycle and Cascade of WeChat Social Messaging Groups. Proceedings of. ACM International Conference on World Wide Web (WWW) Pages 311--320 (2016).
[12]
Steffen Rendle. 2010. Factorization Machines. In ICDM 2010, the IEEE International Conference on Data Mining, Sydney, Australia, 14--17 December. 995--1000.
[13]
Kevin J Shih, Saurabh Singh, and Derek Hoiem. 2016. Where to look: Focus regions for visual question answering. (2016), 4613--4621.
[14]
Kunal Swani, Brian P. Brown, and George R. Milne. 2014. Should tweets differ for B2B and B2C An analysis of Fortune 500 companies' Twitter communications. Industrial Marketing Management Vol. 43, 5 (2014), 873--881.
[15]
Yang Wang, Yao Li, Bryan Semaan, and Jian Tang. 2016. Space Collapse: Reinforcing, Reconfiguring and Enhancing Chinese Social Practices through WeChat. In ICWSM.
[16]
Ho Chung Wu, Robert Wing Pong Luk, Kam Fai Wong, and Kui Lam Kwok. 2008. Interpreting tf-idf term weights as making relevance decisions. ACM Transactions on Information Systems (TOIS) Vol. 26, 3 (2008), 13.
[17]
Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron C Courville, Ruslan Salakhutdinov, Richard S Zemel, and Yoshua Bengio. 2015. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention. Vol. 14 (2015), 77--81.
[18]
Zichao Yang, Xiaodong He, Jianfeng Gao, Li Deng, and Alex Smola. 2016. Stacked attention networks for image question answering. (2016), 21--29.
[19]
Quanzeng You, Liangliang Cao, Hailin Jin, and Jiebo Luo. 2016 a. Robust Visual-Textual Sentiment Analysis: When Attention meets Tree-structured Recursive Neural Networks. In Proceedings of the 2016 ACM on Multimedia Conference. ACM, 1008--1017.
[20]
Quanzeng You, Hailin Jin, Zhaowen Wang, Chen Fang, and Jiebo Luo. 2016 b. Image captioning with semantic attention. (2016), 4651--4659.
[21]
Quanzeng You, Jiebo Luo, Hailin Jin, and Jianchao Yang. 2016 c. Cross-modality Consistent Regression for Joint Visual-Textual Sentiment Analysis of Social Multimedia. In ACM International Conference on Web Search and Data Ming (WSDM). 13--22.
[22]
Chengxi Zang, Peng Cui, and Christos Faloutsos. 2016. Beyond Sigmoids: The NetTide Model for Social Network Growth, and Its Applications The ACM SIGKDD International Conference. 2015--2024.
[23]
Shuangfei Zhai, Keng Hao Chang, Ruofei Zhang, and Zhongfei Mark Zhang. 2016. DeepIntent: Learning Attentions for Online Advertising with Recurrent Neural Networks ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1295--1304.

Cited By

View all
  • (2023)Deep Learning-Based Sentiment Analysis of Customer Reviews on Hotels Through TweetsProceedings of ICACTCE'23 — The International Conference on Advances in Communication Technology and Computer Engineering10.1007/978-3-031-37164-6_39(537-548)Online publication date: 24-Sep-2023
  • (2019)Analysis of Online Marketplace Data on Social Networks Using LSTM2019 5th International Conference on Advances in Electrical Engineering (ICAEE)10.1109/ICAEE48663.2019.8975591(381-385)Online publication date: Sep-2019
  • (2018)Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTMProceedings of the 26th ACM international conference on Multimedia10.1145/3240508.3240533(117-125)Online publication date: 15-Oct-2018

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '18: Companion Proceedings of the The Web Conference 2018
April 2018
2023 pages
ISBN:9781450356404
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • IW3C2: International World Wide Web Conference Committee

In-Cooperation

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 23 April 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. attention model
  2. joint visual-textual learning
  3. multimodality analysis
  4. wechat business

Qualifiers

  • Research-article

Conference

WWW '18
Sponsor:
  • IW3C2
WWW '18: The Web Conference 2018
April 23 - 27, 2018
Lyon, France

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)189
  • Downloads (Last 6 weeks)26
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Deep Learning-Based Sentiment Analysis of Customer Reviews on Hotels Through TweetsProceedings of ICACTCE'23 — The International Conference on Advances in Communication Technology and Computer Engineering10.1007/978-3-031-37164-6_39(537-548)Online publication date: 24-Sep-2023
  • (2019)Analysis of Online Marketplace Data on Social Networks Using LSTM2019 5th International Conference on Advances in Electrical Engineering (ICAEE)10.1109/ICAEE48663.2019.8975591(381-385)Online publication date: Sep-2019
  • (2018)Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTMProceedings of the 26th ACM international conference on Multimedia10.1145/3240508.3240533(117-125)Online publication date: 15-Oct-2018

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media