Abstract
In recent years the amount of data available at World Wide Web grew drastically. Therefore the need to analyze these large amount of data is a problem faced by researchers in this field. The problem that occurs is analyzing this data in real time before they are stored. Analyzing network log data in order to find system anomalies is another problem that researchers face. There are many tools that are used in order to detect anomalies in real time big data but in this research Fluentd is used. Submission of questionnaires in such large quantities of data is a way to take our decisions in various businesses and organizations. Another challenge is how to monitor the data generated in the network and to detect errors in order to scale up performance. To enable the infrastructure to detect anomalies in streaming data outlier detection plugin for Fluentd is implemented. Another important thing is how to visualize the result in order to have a legible result. In this paper we show the visualization of Syslog by using Kibana, and also how the Fluentd plugin helps us to identify anomalies in real time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
References
Aggarwal, C.C.: Outlier Analysis. Springer, New York (2013)
Hasani, Z., Kon-Popovska, M., Velinov, G.: Survey of technologies for real time big data streams analytic. In: 11th International Conference on Informatics and Information Technologies, 11–13 April 2014, Bitola, Macedonia (2014)
Hasani, Z., Kon-Popovska, M., Velinov, G.: Lambda architecture for real time big data analytic. In: ICT Innovations 2014 Web Proceedings. ISSN 1857-7288 (2014)
Hasani, Z.: Performance comparison throw running job in Hadoop by defining the number of maps and reduces. In: 12th International Conference on Informatics and Information Technologies 2015, 24–26 April 2015, Bitola, Macedonia (2015)
Hasani, Z.: Virtuoso, system for saving semantic data. In: 12th International Conference on Informatics and Information Technologies 2015. 24–26 April 2015, Bitola, Macedonia (2015)
Tamura, K.: Elasticsearch, Fluentd, and Kibana: Open Source Log Search and Visualization. https://www.digitalocean.com/community/tutorials/elasticsearch-fluentd-and-kibana-open-source-log-search-and-visualization. Accessed 8 Oct 2015
GitHub: fluent-plugin-anomalydetect. https://github.com/muddydixon/fluent-plugin-anomalydetect. Accessed 8 Oct 2015
Fluentd. http://www.fluentd.org/architecture. Accessed 8 Oct 2015
Apache Lucena. https://lucene.apache.org/. Accessed 30 April 2015
Hasani, Z., Jakimovski, B., Kon-Popovska, M., Velinov, G.: Real time analytic of SQL queries based on log analytic. In: ICT Innovations 2015 Web Proceedings (2015). ISSN 1857-7288. http://proceedings.ictinnovations.org/attachment/conference/12/ict-innovations-2015-web-proceedings.pdf
Tamura, K.: Elasticsearch, Fluentd, and Kibana: Open Source Log Search and Visualization. https://www.digitalocean.com/community/tutorials/elasticsearch-fluentd-and-kibana-open-source-log-search-and-visualization. Accessed 7 Jan 2016
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Hasani, Z. (2017). Implementation of Infrastructure for Streaming Outlier Detection in Big Data. In: Rocha, Á., Correia, A., Adeli, H., Reis, L., Costanzo, S. (eds) Recent Advances in Information Systems and Technologies. WorldCIST 2017. Advances in Intelligent Systems and Computing, vol 570. Springer, Cham. https://doi.org/10.1007/978-3-319-56538-5_51
Download citation
DOI: https://doi.org/10.1007/978-3-319-56538-5_51
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-56537-8
Online ISBN: 978-3-319-56538-5
eBook Packages: EngineeringEngineering (R0)