Abstract
In order to meet the mounting social and economic demands, railway operators and manufacturers are striving for a longer availability and a better reliability of railway transportation systems. Commercial trains are being equipped with state-of-the-art onboard intelligent sensors monitoring various subsystems all over the train. These sensors provide real-time spatio-temporal data consisting of georeferenced timestamped events that tend sometimes to occur in bursts. Once ordered with respect to time, these events can be considered as long temporal sequences that can be mined for possible relationships leading to association rules. In this paper, we propose a methodology for discovering association rules in very bursty and challenging floating train data sequences with multiple constraints. This methodology is based on using null models to discover significant co-occurrences between pairs of events. Once identified and scrutinized by various metrics, these co-occurrences are then used to derive temporal association rules that can predict the imminent arrival of severe failures. Experiments performed on Alstom’s TrainTracerTM data show encouraging results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD 1993, pp. 207–216 (1993)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Data Bases (VLDB 1994), pp. 487–499 (1994)
Bellwood, D., Wainwright, P., Fulton, C., Hoey, A.: Assembly rules and functional groups at global biogeographical scales. Functional Ecology 16, 557–562 (2002)
Cule, B., Goethals, B., Tassenoy, S., Verboven, S.: Mining train delays. In: Gama, J., Bradley, E., Hollmén, J. (eds.) IDA 2011. LNCS, vol. 7014, pp. 113–124. Springer, Heidelberg (2011)
Flier, H., Gelashvili, R., Graffagnino, T., Nunkesser, M.: Mining railway delay dependencies in large-scale real-world delay data. In: Ahuja, R.K., Möhring, R.H., Zaroliagis, C.D. (eds.) Robust and Online Large-Scale Optimization. LNCS, vol. 5868, pp. 354–368. Springer, Heidelberg (2009)
Gotelli, N.J., Graves, G.R.: Null models in ecology. Smisonian Inst. Press (1996)
Gotelli, N.: Null model analysis of species co-occurrence patterns. Ecology 81, 2606–2621 (2000)
Grahne, G., Wang, X., Laksgmanan, L.: Efficient mining of constrained correlated sets. In: International Conference on Data Engineering, p. 512 (2000)
Haiminen, N., Mannila, H., Terzi, E.: Determining significance of pairwise co-occurrences of events in bursty sequences. BMC Bioinformatics 9(1) (2008)
Hannenhalli, S., Levy, S.: Predicting transcription factor synergism. Nucleic Acids Res. 30(19) (2002)
Honda, S., Fukui, K., Moriyama, K., Kurihara, S., Numao, M.: Extracting human behaviors with infrared sensor network. In: Proceedings of the 4th International Conference on Networked Sensing Systems, INSS 2007, pp. 122–125 (2007)
Kerner, B., Demir, C., Herrtwich, R., Klenov, S., Rehborn, H., Aleksi, M., Haug, A.: Traffic state detection with floating car data in road networks. In: Proceedings of the International IEEE Conference on Intelligent Transportation Systems 2005, pp. 700–705 (2005)
Klein, H., Vingron, M.: Using transcription factor binding site co-occurrence to predict regulatory regions. Genome informatics. International Conference on Genome Informatics 18, 109–118 (2007)
Koh, Y.S., Pears, R., Yeap, W.: Valency based weighted asssociation rule mining. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. (eds.) PAKDD 2010, Part I. LNCS (LNAI), vol. 6118, pp. 274–285. Springer, Heidelberg (2010)
Levy, S., Hannenhalli, S., Workman, C.: Enrichment of regulatory signals in conserved non-coding genomic sequence. Bioinformatics 17(10), 871–877 (2001)
Liu, Y., Xu, W., Du, H.: The method of test for state of railway tunnel lining based on association rules (May 2011)
Magnusson, S.: Discovering hidden time patterns in behavior: T-patterns and their detection. Behakvior Research Methods, Instruments, & Computers 32s, 93–110 (2000)
Mannila, H., Toivonen, H., Verkamo, A.: Discovery of frequent episodes in event sequences. Data Mining and Knowledge Discovery 1, 259–289 (1997)
Mirabadi, A., Sharifian, S.: Application of association rules in iranian railways (rai) accident data analysis. Safety Science 48(10), 1427–1435 (2010)
Ng, R., Lakshmanan, L., Han, J., Pang, A.: Exploratory mining and pruning optimizations of constrained association rules. SIGMOD 27(2), 13–24 (1998)
Onboard diagnosis transforms maintenance: Alstom’s onboard diagnosis system traintracer is changing the way maintenance is planned and conducted. International Railway Journal (2009)
Salah, A., Pauwels, E., Tavenard, R., Gevers, T.: T-patterns revisited: Mining for temporal patterns in sensor data. Sensors 10(8), 7496–7513 (2010)
Sammouri, W., Côme, E., Oukhellou, L., Aknin, P., Fonlladosa, C.E., Prendergast, K.: Temporal association rule mining for the preventive diagnosis of onboard subsystems within floating train data framework. In: Proceedings of the IEEE International Conference on Intelligent Transportation Systems, ITSC 2012, pp. 1351–1356 (2012)
Tan, P., Steinbach, M., Kumar, V., Potter, C., Klooster, S., Torregrosa, A.: Finding spatio-temporal patterns in earth science data. In: Proceedings of the KDD Workshop on Temporal Data Mining (2001)
Van Zuylen, H., Chen, Y., Zheng, F.: Using floating car data for traffic state estimation in signalized urban networks. In: IWTDCS Barcelona 2008 (2008)
Weiss, G.: Timeweaver: A genetic algorithm for identifying predictive patterns in sequences of events, pp. 718–725 (1999)
Zaki, M.: SPADE: An efficient algorithm for mining frequent sequences. Machine Learning Journal 42, 31–60 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sammouri, W., Côme, E., Oukhellou, L., Aknin, P. (2013). Mining Floating Train Data Sequences for Temporal Association Rules within a Predictive Maintenance Framework. In: Perner, P. (eds) Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2013. Lecture Notes in Computer Science(), vol 7987. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39736-3_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-39736-3_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39735-6
Online ISBN: 978-3-642-39736-3
eBook Packages: Computer ScienceComputer Science (R0)