skip to main content
10.1145/3132847.3133109acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

A Study of Feature Construction for Text-based Forecasting of Time Series Variables

Published: 06 November 2017 Publication History

Abstract

Time series are ubiquitous in the world since they are used to measure various phenomena (e.g., temperature, spread of a virus, sales, etc.). Forecasting of time series is highly beneficial (and necessary) for optimizing decisions, yet is a very challenging problem; using only the historical values of the time series is often insufficient. In this paper, we study how to construct effective additional features based on related text data for time series forecasting. Besides the commonly used n-gram features, we propose a general strategy for constructing multiple topical features based on the topics discovered by a topic model. We evaluate feature effectiveness using a data set for predicting stock price changes where we constructed additional features from news text articles for stock market prediction. We found that: 1) Text-based features outperform time series-based features, suggesting the great promise of leveraging text data for improving time series forecasting. 2) Topic-based features are not very effective stand-alone, but they can further improve performance when added on top of n-gram features. 3) The best topic-based feature appears to be a long-term aggregation of topics over time with high weights on recent topics.

References

[1]
D. M. Blei et al. Latent dirichlet allocation. Journal of machine Learning research, 3(Jan), 2003.
[2]
X. Ding et al. Using structured events to predict stock price movement: An empirical investigation. In EMNLP, 2014.
[3]
X. Ding et al. Deep learning for event-driven stock prediction. In IJCAI, 2015.
[4]
S. Feuerriegel and R. Fehrer. Improving decision analytics with deep learning: the case of financial disclosures. In ECIS, 2016.
[5]
M. Hagenau et al. Automated news reading: Stock price prediction based on financial news using context-capturing features. Decision Supp. Syst., 55(3), 2013.
[6]
K. S. JONES. A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28(1), 1972.
[7]
D. A. Kumar and S. Murugan. Performance analysis of indian stock market index using neural network time series model. In PRIME, 2013.
[8]
C.-J. Lu et al. Financial time series forecasting using independent component analysis and support vector regression. Decision Supp. Syst., 47(2), 2009.
[9]
M.-A. Mittermayer and G. Knolmayer. Text mining systems for market response to news: A survey. 2006.
[10]
T. H. Nguyen et al. Sentiment analysis on social media for stock movement prediction. Expert Syst. Appl., 2015.
[11]
T. Preis et al. Quantifying trading behavior in financial markets using google trends. Scientific reports, 3, 2013.
[12]
M. M. Rahman, S. K. K. Santu, M. M. Islam, and K. Murase. Forecasting time series -- a layered ensemble architecture. In IJCNN, pages 210--217. IEEE, 2014.
[13]
Y. Shynkevich et al. Predicting stock price movements based on different categories of news articles. In SSCI, 2015.
[14]
J. Si et al. Exploiting topic based twitter sentiment for stock prediction. In ACL, 2013.
[15]
F. E. Tay and L. Cao. Application of support vector machines in financial time series forecasting. Omega, 29(4), 2001.
[16]
Y. Zhang and L. Wu. Stock market prediction of S&P 500 via combination of improved BCO approach and BP neural network. Expert Syst. Appl., 36(5), 2009.

Cited By

View all
  • (2024)Stock Price Prediction Using Sentiment Analysis on Financial NewsData Science and Applications10.1007/978-981-99-7817-5_40(551-567)Online publication date: 18-Jan-2024
  • (2024)Textual data for electricity load forecastingQuality and Reliability Engineering International10.1002/qre.363740:8(4187-4208)Online publication date: 24-Aug-2024
  • (2023)Ensemble Learning Technique with A Novelty Multi‑Source Information for Stock Price MovementsProceedings of the 12th International Symposium on Information and Communication Technology10.1145/3628797.3629007(707-714)Online publication date: 7-Dec-2023
  • Show More Cited By

Index Terms

  1. A Study of Feature Construction for Text-based Forecasting of Time Series Variables

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management
      November 2017
      2604 pages
      ISBN:9781450349185
      DOI:10.1145/3132847
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 06 November 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Qualifiers

      • Short-paper

      Conference

      CIKM '17
      Sponsor:

      Acceptance Rates

      CIKM '17 Paper Acceptance Rate 171 of 855 submissions, 20%;
      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      CIKM '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)13
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 02 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Stock Price Prediction Using Sentiment Analysis on Financial NewsData Science and Applications10.1007/978-981-99-7817-5_40(551-567)Online publication date: 18-Jan-2024
      • (2024)Textual data for electricity load forecastingQuality and Reliability Engineering International10.1002/qre.363740:8(4187-4208)Online publication date: 24-Aug-2024
      • (2023)Ensemble Learning Technique with A Novelty Multi‑Source Information for Stock Price MovementsProceedings of the 12th International Symposium on Information and Communication Technology10.1145/3628797.3629007(707-714)Online publication date: 7-Dec-2023
      • (2021)AutoML to Date and Beyond: Challenges and OpportunitiesACM Computing Surveys10.1145/347091854:8(1-36)Online publication date: 4-Oct-2021
      • (2021)Analyzing effect of news polarity on stock market prediction: a machine learning approach2021 12th International Conference on Information and Knowledge Technology (IKT)10.1109/IKT54664.2021.9685403(102-106)Online publication date: 14-Dec-2021
      • (2020)Leveraging Personalized Sentiment Lexicons for Sentiment AnalysisProceedings of the 2020 ACM SIGIR on International Conference on Theory of Information Retrieval10.1145/3409256.3409850(109-112)Online publication date: 14-Sep-2020
      • (2020)Semantic text analysis for detection of compromised accounts on social networksProceedings of the 12th IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining10.1109/ASONAM49781.2020.9381432(417-424)Online publication date: 7-Dec-2020
      • (2019)Automatic Assessment of Complex Assignments using Topic ModelsProceedings of the Sixth (2019) ACM Conference on Learning @ Scale10.1145/3330430.3333615(1-10)Online publication date: 24-Jun-2019
      • (2019)Stock Price Prediction Using News Sentiment Analysis2019 IEEE Fifth International Conference on Big Data Computing Service and Applications (BigDataService)10.1109/BigDataService.2019.00035(205-208)Online publication date: Apr-2019

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media