skip to main content
10.1145/3404555.3404583acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccaiConference Proceedingsconference-collections
research-article

Hierarchical Label Embedding Networks for Financial Document Sentiment Analysis

Published: 20 August 2020 Publication History

Abstract

With the rapid development of the Internet, document data have become an important source of information in the financial field. The application of documents sentiment analysis in the financial field has attracted increasing attention. It is obviously impractical to extract sentiments manually from a large amount of financial document, but natural language processing (NLP) technology can solve this problem. The research object of this paper focuses on the research reports of listed companies, which is a kind of long financial document published by experts in the field. In this paper, we propose a hierarchical label embedding neural network model for sentiment analysis of financial documents. This model adopts hierarchical network structure to capture the structural information of financial documents. Moreover, the model also includes an expression embedding mechanism for focusing on important content. We believe that most of the words and sentences in a document are consistent with the sentiments of the labels marked by the author. The label embedding mechanism can pay more attention to the content that is consistent with the sentiments of the labels during the document's hierarchical representation. Experiments showed that our method is more effective than other advanced methods on the established dataset.

References

[1]
Araci D. FinBERT: Financial Sentiment Analysis with Pretrained Language Models[J]. arXiv preprint arXiv:1908.10063, 2019.
[2]
Maia M, Handschuh S. FinSSLx: A Sentiment Analysis Model for the Financial Domain Using Text Simplification[C]//2018 IEEE 12th International Conference on Semantic Computing (ICSC). IEEE, 2018: 318--319.
[3]
Zhang L, Wang S, Liu B. Deep learning for sentiment analysis: A survey [J]. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 2018, 8(4): e1253.
[4]
Li Guo, Feng Shi, and Jun Tu. 2016. Textual analysis and machine leaning: Crackunstructured data in finance and accounting. The Journal of Finance and Data Science 2, 3 (sep 2016), 153--170. https://doi.org/10.1016/J.JFDS.2017.02.001.
[5]
Dridi A, Atzeni M, Recupero D R. FineNews: fine-grained semantic sentiment analysis on financial microblogs and news[J]. International Journal of Machine Learning and Cybernetics, 2018(10):1--9.
[6]
Akhtar M S, Kumar A, Ghosal D, et al. A multilayer perceptron based ensemble technique for fine-grained financial sentiment analysis[C]//Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2017: 540--546.
[7]
Maia M, Handschuh S. FinSSLx: A Sentiment Analysis Model for the Financial Domain Using Text Simplification[C]//2018 IEEE 12th International Conference on Semantic Computing (ICSC). IEEE, 2018: 318--319.
[8]
Taboada M, Brooke J, Tofiloski M, et al. Lexicon-based methods for sentiment analysis[J]. Computational linguistics, 2011, 37(2): 267--307.
[9]
Im T., San, P., On, C., Rayner, A. and Anthony P. (2013) Analysing market sentiment in financial news using lexical approach, IEEE Open Systems Conference.
[10]
Al-Ayyoub M, Essa S B, Alsmadi I. Lexicon-based sentiment analysis of Arabic tweets[J]. IJSNM, 2015, 2(2): 101--114.
[11]
Baccianella, S., Esuli, A. and Sebastiani F. (2010) SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining, In Proceedings of the Seventh Conference on International Language Resources and Evaluation.
[12]
Hutto C J, Gilbert E. Vader: A parsimonious rule-based model for sentiment analysis of social media text[C]//Eighth international AAAI conference on weblogs and social media. 2014.
[13]
Latif M, Qamar U, Muzaffar A. Content-specific unigrams and syntactic phrases to enhance SentiWordNet based sentiment classification[J]. International Journal of Machine Learning and Computing, 2015, 5(4): 307.
[14]
Sohangir S, Petty N, Wang D. Financial Sentiment Lexicon Analysis[C]//2018.
[15]
G. Wang, T. Wang, B. Wang, D. Sambasivan, Z. Zhang, H. Zheng, and B. Y. Zhao, "Crowds on wall street: Extracting value from collaborative investing platforms," The 18th ACM conference on Computer-Supported Cooperative Work and Social Computing (CSCW), March 2015.
[16]
Yusof N N, Mohamed A, Abdul-Rahman S. A review of contextual information for context-based approach in sentiment analysis[J]. Int. J. Mach. Learn. Comput, 2018, 8: 399--403.
[17]
Ansari G, Ahmad T, Doja M N. Ensemble of Feature Ranking Methods Using Hesitant Fuzzy Sets for Sentiment Classification[J]. International Journal of Machine Learning and Computing, 2019, 9(5).
[18]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their ompositionality. In Advances in neural information processing systems. Lake Tahoe, NV, USA, pages 3111--3119.
[19]
Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. GloVe: Global Vectors for Word Representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP-2014). ACL, Doha, Qatar, pages 1532--1543.
[20]
Xiao K, Zhang Z, Wu J. Chinese text sentiment analysis based on improved Convolutional Neural Networks[C]//2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS). IEEE, 2016: 922--926.
[21]
Yang Z, Yang D, Dyer C, et al. Hierarchical attention networks for document classification[C]//Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2016: 1480--1489.
[22]
Wang G, Li C, Wang W, et al. Joint embedding of words and labels for text classification [J]. arXiv preprint arXiv:1805.04174, 2018.
[23]
Bojanowski P, Grave E, Joulin A, et al. Enriching word vectors with subword information[J]. Transactions of the Association for Computational Linguistics, 2017, 5: 135--146.
[24]
Yan Song, Shuming Shi, Jing Li, and Haisong Zhang. Directional Skip-Gram: Explicitly Distinguishing Left and Right Context for Word Embeddings. NAACL 2018.
[25]
Y. Bengio, P. Simard, and P. Frasconi, "Learning long-term dependencies with gradient descent is difficult," IEEE Transactions on Neural Networks, vol. 5, pp. 157--166, 2002.
[26]
S. Hochreiter and J. Schmidhuber, "Long Short-Term Memory," Neural Computation, vol. 9, pp. 1735--1780, 1997.
[27]
K. Cho, B. Van Merrienboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, et al., "Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation," Computer Science, 2014.
[28]
Bahdanau D, Cho K, Bengio Y. Neural machine translation by jointly learning to align and translate[J]. arXiv preprint arXiv:1409.0473, 2014.
[29]
Bird S, Loper E. NLTK: The Natural Language Toolkit[C]//Acl on Interactive Poster & Demonstration Sessions. Association for Computational Linguistics, 2004.
[30]
D. P. Kingma and J. Ba, "Adam: A Method for Stochastic Optimization," international conference on learning representations, 2015.

Index Terms

  1. Hierarchical Label Embedding Networks for Financial Document Sentiment Analysis

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICCAI '20: Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence
    April 2020
    563 pages
    ISBN:9781450377089
    DOI:10.1145/3404555
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • University of Tsukuba: University of Tsukuba

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 August 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Document sentiment analysis
    2. hierarchical neural network
    3. label embedding mechanism
    4. research reports

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICCAI '20

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 121
      Total Downloads
    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 22 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media