AutoML Meets Time Series Regression Design and Analysis of the AutoSeries Challenge

Xu, Zhen; Tu, Wei-Wei; Guyon, Isabelle

doi:10.1007/978-3-030-86517-7_3

Zhen Xu¹²,
Wei-Wei Tu¹² &
Isabelle Guyon^13,14,15

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12979))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

1573 Accesses

Abstract

Analyzing better time series with limited human effort is of interest to academia and industry. Driven by business scenarios, we organized the first Automated Time Series Regression challenge (AutoSeries) for the WSDM Cup 2020. We present its design, analysis, and post-hoc experiments. The code submission requirement precluded participants from any manual intervention, testing automated machine learning capabilities of solutions, across many datasets, under hardware and time limitations. We prepared 10 datasets from diverse application domains (sales, power consumption, air quality, traffic, and parking), featuring missing data, mixed continuous and categorical variables, and various sampling rates. Each dataset was split into a training and a test sequence (which was streamed, allowing models to continuously adapt). The setting of “time series regression”, differs from classical forecasting in that covariates at the present time are known. Great strides were made by participants to tackle this AutoSeries problem, as demonstrated by the jump in performance from the sample submission, and post-hoc comparisons with AutoGluon. Simple yet effective methods were used, based on feature engineering, LightGBM, and random search hyper-parameter tuning, addressing all aspects of the challenge. Our post-hoc analyses revealed that providing additional time did not yield significant improvements. The winners’ code was open-sourced (https://www.4paradigm.com/competition/autoseries2020).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Model Monitoring and Dynamic Model Selection in Travel Time-Series Forecasting

MultiETSC: automated machine learning for early time series classification

Article Open access 16 August 2021

A Comprehensive Survey of Regression-Based Loss Functions for Time Series Forecasting

Notes

1.
https://archive.physionet.org/physiobank/database/santa-fe/.
2.
http://www.neural-forecasting-competition.com/.
3.
https://www.kaggle.com/c/m5-forecasting-accuracy.
4.
https://www.kaggle.com/c/web-traffic-time-series-forecasting.
5.
http://automl.chalearn.org, http://autodl.chalearn.org.
6.
https://www.automl.ai/competitions/3.
7.
https://autodl.lri.fr/competitions/64.
8.
In some application domains (not considered in this paper), even future $ \{ t+1, \cdots , t+t_{max} \}$) values of the covariates may be considered. An example would be “simultaneous translation” with a small lag.
9.
https://www.kaggle.com/c/web-traffic-time-series-forecasting.
10.
https://doc.dataiku.com/dss/latest/time-series/data-formatting.html.
11.
https://autodl.lri.fr/.
12.
https://hub.docker.com/r/vergilgxw/autotable.
13.
https://autodl.lri.fr/competitions/149#results.
14.
https://keras-team.github.io/keras-tuner/.

References

Alexandrov, A., et al.: GluonTS: probabilistic and neural time series modeling in Python. J. Mach. Learn. Res. 21(116), 1–6 (2020)
MATH Google Scholar
Erickson, N., et al.: AutoGluon-tabular: robust and accurate AutoML for structured data (2020)
Google Scholar
Hutter, F., Kotthoff, L., Vanschoren, J. (eds.): Automated Machine Learning. Methods, Systems, Challenges. The Springer Series on Challenges in Machine Learning. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-05318-5
Hyndman, R.J., Athanasopoulos, G. (eds.): Forecasting: principles and practice. OTexts (2021). https://otexts.com/fpp3/. Accessed 25 Mar 2021
Jin, H., Song, Q., Hu, X.: Auto-Keras: an efficient neural architecture search system. In: KDD (2019)
Google Scholar
Kanter, J.M., Veeramachaneni, K.: Deep feature synthesis: towards automating data science endeavors. In: IEEE International Conference on Data Science and Advanced Analytics, DSAA (2015)
Google Scholar
Ke, G., et al.: LightGBM: a highly efficient gradient boosting decision tree. In: Advances in Neural Information Processing Systems (2017)
Google Scholar
Lai, G., Chang, W., Yang, Y., Liu, H.: Modeling long- and short-term temporal patterns with deep neural networks. In: SIGIR (2018)
Google Scholar
Lim, B., Zohren, S.: Time series forecasting with deep learning: a survey (2020)
Google Scholar
Liu, Z., et al.: Towards automated computer vision: analysis of the AutoCV challenges 2019. Pattern Recogn. Lett. 135, 196–203 (2020)
Article Google Scholar
Tan, C.W., Bergmeir, C., Petitjean, F., Webb, G.I.: Time series extrinsic regression. Data Min. Knowl. Disc. 35(3), 1032–1060 (2021). https://doi.org/10.1007/s10618-021-00745-9
Article Google Scholar
Taylor, S.J., Letham, B.: Forecasting at scale. PeerJ Prepr. 5, e3190v2 (2017)
Google Scholar
Wang, L., Chen, J., Marathe, M.: DEFSI: deep learning based epidemic forecasting with synthetic information. In: AAAI (2019)
Google Scholar
Wang, Z., Yan, W., Oates, T.: Time series classification from scratch with deep neural networks: a strong baseline. In: International Joint Conference on Neural Networks (2017)
Google Scholar
Yao, Q., et al.: Taking human out of learning applications: a survey on automated machine learning (2018)
Google Scholar

Download references

Author information

Authors and Affiliations

4Paradigm, Beijing, China
Zhen Xu & Wei-Wei Tu
LISN CNRS/INRIA, Gif-sur-Yvette, France
Isabelle Guyon
University Paris-Saclay, Gif-sur-Yvette, France
Isabelle Guyon
ChaLearn, California, USA
Isabelle Guyon

Authors

Zhen Xu
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Wei Tu
View author publications
You can also search for this author in PubMed Google Scholar
Isabelle Guyon
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhen Xu .

Editor information

Editors and Affiliations

Facebook AI, Seattle, WA, USA
Yuxiao Dong
Torre Telefonica, Barcelona, Spain
Nicolas Kourtellis
Bielefeld University, CITEC, Bielefeld, Germany
Barbara Hammer
Basque Center for Applied Mathematics, Bilbao, Spain
Jose A. Lozano

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, Z., Tu, WW., Guyon, I. (2021). AutoML Meets Time Series Regression Design and Analysis of the AutoSeries Challenge. In: Dong, Y., Kourtellis, N., Hammer, B., Lozano, J.A. (eds) Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track. ECML PKDD 2021. Lecture Notes in Computer Science(), vol 12979. Springer, Cham. https://doi.org/10.1007/978-3-030-86517-7_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-86517-7_3
Published: 10 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86516-0
Online ISBN: 978-3-030-86517-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the ECML PKDD community (opens in a new tab)