skip to main content
10.1145/3406601.3406621acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiaitConference Proceedingsconference-collections
research-article

Visual Features for Multivariate Time Series

Authors Info & Claims
Published:03 July 2020Publication History

ABSTRACT

Visual analytics combines the capabilities of computers and humans to explore the insight of data. It provides coupling interactive visual representations with underlying analytical processes (e.g., visual feature extraction) so that users can utilize their cognitive and reasoning capabilities to perform complex tasks effectively or to make decisions. This paper applies successfulness of visual analytics to multivariate temporal data by proposing an interactive web prototype and an approach that enables users to explore data and detect visual features of interest. A list of nonparametric quantities is proposed to extract visual patterns of time series as well as to compute the similarity between them. The prototype integrates visualization and dimensional reduction techniques to support the exploration processes. Many different temporal datasets are used to justify the effectiveness of this approach, and some remarkable results are presented to show its value.

References

  1. Mohammed Ali, Mark W Jones, Xianghua Xie, and Mark Williams. 2019. TimeCluster: dimension reduction applied to temporal data for visual analytics. The Visual Computer 35, 6--8 (2019), 1013--1026.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Kimberly Amadeo. 2019. History of Recessions in the United States. website. Retrieved January 30, 2020 from https://www.thebalance.com/the-history-of-recessions-in-the-united-states-3306011.Google ScholarGoogle Scholar
  3. G.H. Ball and D.J. Hall. 1965. Isodata, a Novel Method of Data Analysis and Pattern Classification. Stanford Research Institute. https://books.google.com/books?id=Ti3BGwAACAAJGoogle ScholarGoogle Scholar
  4. Etienne Becht, Leland McInnes, John Healy, Charles-Antoine Dutertre, Immanuel WH Kwok, Lai Guan Ng, Florent Ginhoux, and Evan W Newell. 2019. Dimensionality reduction for visualizing single-cell data using UMAP. Nature biotechnology 37, 1 (2019), 38.Google ScholarGoogle Scholar
  5. Joachim A Behar, Aviv A Rosenberg, Ido Weiser-Bitoun, Ori Shemla, Alexan- dra Alexandrovich, Evgene Konyukhov, and Yael Yaniv. 2018. PhysioZoo: a novel open access platform for heart rate variability analysis of mammalian electrocardiographic data. Frontiers in physiology 9 (2018), 1390.Google ScholarGoogle Scholar
  6. J. Bernard, M. Hutter, M. Zeppelzauer, D. Fellner, and M. Sedlmair. 2018. Com- paring Visual-Interactive Labeling with Active Learning: An Experimental Study. IEEE Transactions on Visualization and Computer Graphics 24, 1 (Jan 2018), 298--308. https://doi.org/10.1109/TVCG.2017.2744818Google ScholarGoogle ScholarCross RefCross Ref
  7. Michael Bostock, Vadim Ogievetsky, and Jeffrey Heer. 2011. D3 data-driven documents. IEEE transactions on visualization and computer graphics 17, 12 (2011), 2301--2309.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Peter J Brockwell and Richard A Davis. 2016. Introduction to time series and forecasting. springer.Google ScholarGoogle Scholar
  9. Robert B Cleveland, William S Cleveland, Jean E McRae, and Irma Terpenning. 1990. STL: A seasonal-trend decomposition. Journal of official statistics 6, 1 (1990), 3--73.Google ScholarGoogle Scholar
  10. Tuan Nhon Dang, Anushka Anand, and Leland Wilkinson. 2013. TimeSeer: Scagnostics for high-dimensional time series. IEEE Transactions on Visualization and Computer Graphics 19, 3 (2013), 470--483. https://doi.org/10.1109/TVCG.2012.128Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Tuan Nhon Dang and Leland Wilkinson. 2013. TimeExplorer: Similarity search time series by their signatures. In International Symposium on Visual Computing. Springer, 280--289.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Tuan Nhon Dang and Leland Wilkinson. 2014. ScagExplorer: Exploring scatter- plots by their scagnostics. IEEE Pacific Visualization Symposium (2014), 73--80. https://doi.org/10.1109/PacificVis.2014.42Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Jon Danielsson. 2011. Financial risk forecasting: the theory and practice of forecasting market risk with implementation in R and Matlab. Vol. 588. John Wiley & Sons.Google ScholarGoogle Scholar
  14. Samarjit Das. 1994. Time series analysis.Google ScholarGoogle Scholar
  15. Ben D Fulcher and Nick S Jones. 2014. Highly comparative feature-based time- series classification. IEEE Transactions on Knowledge and Data Engineering 26, 12 (2014), 3026--3037.Google ScholarGoogle ScholarCross RefCross Ref
  16. Ben D Fulcher and Nick S Jones. 2017. hctsa: A computational framework for automated time-series phenotyping using massive feature extraction. Cell systems 5, 5 (2017), 527--531.Google ScholarGoogle Scholar
  17. Ary L Goldberger, Luis AN Amaral, Leon Glass, Jeffrey M Hausdorff, Plamen Ch Ivanov, Roger G Mark, Joseph E Mietus, George B Moody, Chung-Kang Peng, and H Eugene Stanley. 2000. PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. Circulation 101, 23 (2000), e215-e220.Google ScholarGoogle ScholarCross RefCross Ref
  18. John A. Hartigan. 1975. Clustering Algorithms (99th ed.). John Wiley & Sons, Inc., New York, NY, USA.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. DM Hawkins. 1980. Monographs on Applied Probability and Statistics.Google ScholarGoogle Scholar
  20. Boris Iglewicz and David Hoaglin. 1993. Volume 16: how to detect and handle outliers. The ASQC basic references in quality control: statistical techniques 16 (1993).Google ScholarGoogle Scholar
  21. Boris Iglewicz and David Caster Hoaglin. 1993. How to detect and handle outliers. Vol. 16. Asq Press.Google ScholarGoogle Scholar
  22. A. K. Jain, M. N. Murty, and P. J. Flynn. 1999. Data Clustering: A Review. ACM Comput. Surv. 31, 3 (Sept. 1999), 264--323. https://doi.org/10.1145/331499.331504Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Yanfei Kang, Rob J Hyndman, and Kate Smith-Miles. 2017. Visualising forecasting algorithm performance using time series instance spaces. International Journal of Forecasting 33, 2 (2017), 345--358.Google ScholarGoogle ScholarCross RefCross Ref
  24. Eamonn Keogh, Stefano Lonardi, and Bill'Yuan-chi' Chiu. 2002. Finding surprising patterns in a time series database in linear time and space. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 550--556.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Ragnar H Lesch, Yannick Caillé, and David Lowe. 1999. Component analysis in financial time series. In Proceedings of the IEEE/IAFE 1999 Conference on Computational Intelligence for Financial Engineering (CIFEr)(IEEE Cat. No. 99TH8408). IEEE, 183--190.Google ScholarGoogle ScholarCross RefCross Ref
  26. Yuan Li, Jessica Lin, and Tim Oates. 2012. Visualizing variable-length time series motifs. In Proceedings of the 2012 SIAM international conference on data mining. SIAM, 895--906.Google ScholarGoogle ScholarCross RefCross Ref
  27. Jessica Lin, Eamonn Keogh, and Stefano Lonardi. 2005. Visualizing and discover- ing non-trivial patterns in large time series databases. Information visualization 4, 2 (2005), 61--82.Google ScholarGoogle Scholar
  28. Carl H Lubba, Sarab S Sethi, Philip Knaute, Simon R Schultz, Ben D Fulcher, and Nick S Jones. 2019. catch22: CAnonical Time-series CHaracteristics. Data Mining and Knowledge Discovery 33, 6 (2019), 1821-1852.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of machine learning research 9, Nov (2008), 2579--2605.Google ScholarGoogle Scholar
  30. Spyros Makridakis and Michele Hibon. 2000. The M3-Competition: results, conclusions and implications. International journal of forecasting 16, 4 (2000), 451--476.Google ScholarGoogle ScholarCross RefCross Ref
  31. Henry B Mann. 1945. Nonparametric tests against trend. Econometrica: Journal of the Econometric Society (1945), 245--259.Google ScholarGoogle Scholar
  32. Robert McGill, John W Tukey, and Wayne A Larsen. 1978. Variations of box plots. The American Statistician 32, 1 (1978), 12--16.Google ScholarGoogle ScholarCross RefCross Ref
  33. Leland McInnes, John Healy, and James Melville. 2018. UMAP: Uni- form Manifold Approximation and Projection for Dimension Reduction. arXiv:stat.ML/1802.03426Google ScholarGoogle Scholar
  34. Alex Nanopoulos, Rob Alcock, and Yannis Manolopoulos. 2001. Feature-based classification of time-series data. International Journal of Computer Research 10, 3 (2001), 49--61.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. NIST/SEMATECH. 2013. e-Handbook of Statistical Methods. e-handbook. Re- trieved January 3, 2020 from https://www.itl.nist.gov/div898/handbook/index. htm.Google ScholarGoogle Scholar
  36. U.S. Bureau of Labor Statistics. 2019. 2020 Census and CES Employment by State. website. Retrieved January 30, 2020 from https://www.bls.gov/sae/additional-resources/ces-state-and-area-census-2020-workers.htm.Google ScholarGoogle Scholar
  37. Bernard Rosner. 1983. Percentage points for a generalized ESD many-outlier procedure. Technometrics 25, 2 (1983), 165--172.Google ScholarGoogle ScholarCross RefCross Ref
  38. Pavel Senin, Jessica Lin, Xing Wang, Tim Oates, Sunil Gandhi, Arnold P Boedihardjo, Crystal Chen, Susan Frankenstein, and Manfred Lerner. 2014. Grammarviz 2.0: a tool for grammar-based pattern discovery in time series. In Joint European conference on machine learning and knowledge discovery in databases. Springer, 468--472.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. National Park Service. 2017. Hurricanes Irma and Maria. website. Retrieved January 30, 2020 from https://www.nps.gov/viis/learn/nature/2017-hurricanes. htm.Google ScholarGoogle Scholar
  40. Ori Shemla and Joachim Behar. 2019. PhysioZoo - mammalian NSR databases. https://doi.org/10.13026/P63Q-HQ95Google ScholarGoogle ScholarCross RefCross Ref
  41. Ruey S Tsay. 1988. Outliers, level shifts, and variance changes in time series. Journal of forecasting 7, 1 (1988), 1--20.Google ScholarGoogle ScholarCross RefCross Ref
  42. Laurens Van Der Maaten. 2014. Accelerating t-SNE using tree-based algorithms. The Journal of Machine Learning Research 15, 1 (2014), 3221--3245.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Laurens Van Der Maaten, Eric Postma, and Jaap Van den Herik. 2009. Dimensionality reduction: a comparative. J Mach Learn Res 10, 66--71 (2009), 13.Google ScholarGoogle Scholar
  44. Michail Vlachos, Philip Yu, and Vittorio Castelli. 2005. On periodicity detection and structural periodic similarity. In Proceedings of the 2005 SIAM international conference on data mining. SIAM, 449--460.Google ScholarGoogle ScholarCross RefCross Ref
  45. Tien Hoang-Thuy Vo, Tran Luu-Nha Dang, Ngan Vuong-Thuy Nguyen, and Tuan Van Huynh. 2019. Classification Electroencephalography Using Machine Learn- ing. In 2019 19th International Symposium on Communications and Information Technologies (ISCIT). IEEE, 14--19.Google ScholarGoogle Scholar
  46. Svante Wold, Kim Esbensen, and Paul Geladi. 1987. Principal component analysis. Chemometrics and intelligent laboratory systems 2, 1--3 (1987), 37--52.Google ScholarGoogle Scholar
  47. Kiyoung Yang and Cyrus Shahabi. 2004. A PCA-based similarity measure for multivariate time series. In Proceedings of the 2nd ACM international workshop on Multimedia databases. 65--74.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Matthew Yglesias. 2013. The U.S. Virgin Islands Are in a Catastrophic Recession. website. Retrieved January 30, 2020 from https://slate.com/business/2013/08/virgin-islands-recession.html.Google ScholarGoogle Scholar

Index Terms

  1. Visual Features for Multivariate Time Series

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      IAIT '20: Proceedings of the 11th International Conference on Advances in Information Technology
      July 2020
      370 pages
      ISBN:9781450377591
      DOI:10.1145/3406601

      Copyright © 2020 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 3 July 2020

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      Overall Acceptance Rate20of47submissions,43%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader