Abstract
Immense amounts of unstructured data account for up to 90% of all human generated data, yet the attempts to extract significant value from it with Machine Learning (ML) and Big Data (BD) technologies yield limited successes. We propose a generic approach to deep data summarization and subsequent automated ML design optimization to extract maximum predictive value from big data. Knowledge summarization is a central component of the proposed methodology and we argue that coupled with strictly linear modeling complexity, hierarchical decomposition and optimized model design may define a backbone of the new platform for automated and scalable construction of robust ML models. We consider ML build process as data journeys through the layers of modeling that consistently follow the same patterns of data summarization and transformation at the subsequent layers of abstraction. In such framework we argue that the robust construction of the ML model can be achieved through hierarchical greedy optimization of the links between connected ML model components. We demonstrate several case studies of deep data summarization and automated ML model design on text, numerical time series and images data. We point out that application awareness allows to deepen data summarizations while maintaining or improving its predictive value.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Yu, P., McLaughlin, J., Levy, M., Data, B.: A big disappointment for scoring consumer credit risk. NCLC report (2014)
Sicular, S.: Big Data is Falling into the Through of Disillusionment. Gartner Blog Network (2013)
Bengio, Y., Goodfellow, I., Courville, A.: Deep Learning. MIT Press, Cambridge (2015)
Schmidhuber, J.: Deep learning in neural networks: an overview. Nural Netw. 61, 85–117 (2015)
Zhang, Z., Huang, Z., Zhang, Z.: Knowledge summarization for scalable semantic data processing. J. Comput. Inf. Syst. 6(12), 3893–3902 (2010)
Changsheng, X., Maddage, M.C., Xi, S.: Automatic music classification and summarization. IEEE Trans. Speech Audio 13(3), 441–450 (2004)
Ekin, A., Tekalp, A.M.: Automatic soccer video analysis and summarization. IEEE Trans. Image Process. 12(7), 796–807 (2003)
Hori, C., Furui, S.: A new approch to automatic speech summarization. IEEE Trans. Multimedia 5(3), 368–378 (2003)
Yang, C., Junsong, Y., Jiebo, L.: Towards scalable summarization of consumer videos via sparse dictionary selection. IEEE Trans. Multimedia 14(1), 66–75 (2012)
Chandola, V., Kumar, V.: Summarization - compressing data into an informative representation. In: Proceedings 5th IEEE International Conference on Data Mining, New Orleans (2005)
Hahn, U., Mani, I.: The challenges of automatic summarization. Computer 33(11), 29–36 (2000)
Bishop, C.M.: Model-based machine learning. Philos. Trans. R. Soc. A (2013)
Langley, P.: Artificial intelligence and cognitive systems. AISB Q. 133, 1–4 (2012)
Goldstein, J., Kantrowitz, M., Mittal, V., Carbonell, J.: Summarizing text documents: sentence selection and evaluation metrices. In: Proceedings 22nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 121–128 (1999)
Steinberger, J., Jezek, K.: Evaluation measures for text summarization. Comput. Inf. 28, 1001–1026 (2009)
d’Acierno, A., Moscato, V., Persia, F., Picariello, A., Penta, A.: Semantic summarization of web documents. In: Proceedings IEEE 4th International Conference on Semantic Computingng, pp: 430–435 (2010)
Verma, R., Chen, P., Lu, W.: A semantic free-text summarization system using ontology knowledge. IEEE Trans. Inf. Technol. Biomed. 5(4), 261–270 (2007)
Li, T., Zhu, S., Ogihara, M.: Hierarchical document classification using automatically generated hierarchy. J. Intell. Inf. Syst. 29(2), 211–230 (2007)
Vohra, S.M., Teraiya, J.B.: A comparative study of sentiment analysis techniques. J. Inf. Knowl. Res. Comput. Eng. 2(2), 313–317 (2013)
Saeys, Y., Inza, I., Larranaga, P.: A review of feature selection techniques in bioinformatics. Bioinformatics 19(23), 2507–2517 (2007)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley, New York (2001)
Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)
Pan, F., Wang, W., Tung, K.H., Yang, J.: Finding Representative Set from Massive Data. Springer, New York (2006)
Aggarwal, C.C., Reddy, C.K.: Data Clustering: Algorithms and Applications. CRC Press, Boca Raton (2014)
Roth, V.: Outlier detection with one-class kernel fisher discriminant. Adv. Neural Inf. Process. Syst. 17, 1169–1176 (2004)
Mitra, P.: Density-based multiscale data condensation. IEEE Trans. Pattern Anal. Mach. Intell. 24(6), 734–747 (2002)
Yang, P., Li, J.-S., Huang, Y.-X.: HDD: a hypercube division-based algorithm for discretisation. Int. J. Sys. Sci. 42(4), 557–566 (2010)
Sauvola, J., Pietikainen, M.: Adaptive document image binarization. Pattern Recogn. 33, 225–236 (2000)
Zhou, X., Wang, X., Dougherty, E.R.: Binarization of microarray data on the basis of a mixture model. Mol. Cancer Ther. 2(7), 679–684 (2003)
Tomas, J., Cascuberta, F.: Binary feature classification for word disambiguation in statistical machine translation. In: Proceedings 2nd International Workshop on Pattern Recognition in Information Systems (2002)
Mitchell, T.: Generative and discriminative classifiers: naive bayes and logistic regression. In: Machine Learning. McGraw Hill (2010)
Ng, A.Y., Jordan, M.I.: On discriminative vs. generative classifiers: a comparison of logistic regression and naive Bayes. In: NIPS 14 (2002)
Friedman, J., Hastie, T., Tibshirani, R.: Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33(1), 1–22 (2010)
Sugiyama, M., Yamamoto, A.: A fast and flexible clustering algorithm using binary discretization. In: 11th IEEE International Conference on Data Mining, pp. 1213–1217 (2011)
Dietterich, T.G., Bakiri, G.: Solving multiclass learning problems via error-correcting output codes. J. Artif. Intell. Res. 2, 263–286 (1995)
Aly, M.: Survey on multiclass classification methods. Caltech Technical report (2005)
Rokach, L.: Pattern Classification Using Ensemble Methods. World Scientific, River Edge (2010)
Maimon, O., Rokach, L.: Improving supervised learning by feature decomposition. In: Eiter, T., Schewe, K.-D. (eds.) FoIKS 2002. LNCS, vol. 2284, pp. 178–196. Springer, Heidelberg (2002). doi:10.1007/3-540-45758-5_12
Drineas, P., Mahoney, M.W., Muthukrishnan, S., Sampling, S.: Approximation, relative-error matrix: column-row-based methods. In: Proceedings 14th Annual Symposium on Algorithms, pp. 304–314 (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Ruta, D., Cen, L., Damiani, E. (2017). Summarization-Guided Greedy Optimization of Machine Learning Model. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2017. Lecture Notes in Computer Science(), vol 10358. Springer, Cham. https://doi.org/10.1007/978-3-319-62416-7_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-62416-7_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-62415-0
Online ISBN: 978-3-319-62416-7
eBook Packages: Computer ScienceComputer Science (R0)