skip to main content
10.1145/3442442.3451388acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

aiai at the FinSim-2 task: Finance Domain Terms Automatic Classification Via Word Ontology and Embedding

Published:03 June 2021Publication History

ABSTRACT

This paper describes the method that we submitted to the FinSim-2 task on learning similarities for the financial domain. This task aims to automatically classify the Financial domain terms into the most relevant hypernym (or top-level) concept in an external ontology. This paper shows the result of experiments using the Catboost, Attention-LSTM, BERT, RoBERTa to develop an automatic finance domain classifier via word ontology and embedding. The experiment result demonstrates that each model could be an effective method to tackle the FinSim-2 task, respectively.

References

  1. Colin Raffel and Daniel P. W. Ellis. 2015. Feed-forward networks with attention can solve some long term memory problems. https://arxiv.org/abs/1512.08756.Google ScholarGoogle Scholar
  2. Jacob Devlin, Ming-Wei Chang, Ken ton Lee, and Kristina Toutanova.2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.Google ScholarGoogle Scholar
  3. Dogu Araci. 2019. FinBERT: Financial Sentiment Analysis with Pre-trained Language Models. https://arxiv.org/abs/1908.10063.Google ScholarGoogle Scholar
  4. Ke Tian, Hua Chen and Jie Yang. 2020. aiai at FinSBD task: Sentence Boundary Detection in Noisy Texts From Financial Documents Using Deep Attention Model. Processing of the First Workshop on Finance Technology and Natural Language Processing IJCAI2020. Japan.Google ScholarGoogle Scholar
  5. Ke Tian and Zi Jun Peng. 2019. aiai at FinNum task: Financial numeral tweets fine-grained classification using deep word and character embed- ding-based attention model. The 14th NTCIR Conference, Tokyo, Japan.Google ScholarGoogle Scholar
  6. FinSim-2 task. 2021. https://sites.google.com/nlg.csie.ntu.edu.tw/finweb2021/shared-task-finsim-2.Google ScholarGoogle Scholar
  7. Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado and Jeffrey Dean. 2013. Distributed representations of words and phrases and their composi- tionality. https://arxiv.org/abs/1310.4546.Google ScholarGoogle Scholar
  8. Anna Veronika Dorogush, Vasily Ershov, Andrey Gulin. 2018. CatBoost: gradient boosting with categorical features support. https://arxiv.org/abs/1810.11363.Google ScholarGoogle Scholar
  9. Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, Veselin Stoyanov. 2019. RoBERTa: A Robustly Optimized BERT Pretraining Approach. https://arxiv.org/abs/1907.11692.Google ScholarGoogle Scholar
  10. Natural Language Toolkit. https://www.nltk.org.Google ScholarGoogle Scholar
  1. aiai at the FinSim-2 task: Finance Domain Terms Automatic Classification Via Word Ontology and Embedding

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      WWW '21: Companion Proceedings of the Web Conference 2021
      April 2021
      726 pages
      ISBN:9781450383134
      DOI:10.1145/3442442

      Copyright © 2021 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 3 June 2021

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      Overall Acceptance Rate1,899of8,196submissions,23%

      Upcoming Conference

      WWW '24
      The ACM Web Conference 2024
      May 13 - 17, 2024
      Singapore , Singapore

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format