Abstract
Performance logs contain rich information about a system’s state. Large-scale web service infrastructures deployed in the cloud are notoriously difficult to troubleshoot, especially performance bugs. Detecting, isolating and diagnosing fine-grained performance anomalies requires integrating system performance measures across space and time. To achieve scale, we present our megatables approach, which automatically interprets performance log data and outputs millibottleneck predictions along with supporting visualizations. We evaluate our method with three illustrative scenarios, and we assess its predictive ability. We also evaluate its ability to extract meaningful information from many log samples drawn from the wild.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kohavi, R., Longbotham, R.: Online experiments: lessons learned. Computer 40, 103–105 (2007)
Kohavi, R., Henne, R.M., Sommerfield, D.: Practical guide to controlled experiments on the web: listen to your customers not to the hippo. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2007)
Pu, C., et al.: The millibottleneck theory of performance bugs, and its experimental verification. In: 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS) (2017)
Du, M., Li, F., Zheng, G., Srikumar, V.: Deeplog: anomaly detection and diagnosis from system logs through deep learning. In: Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security (2017)
Gan, Y., et al.: Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices. In: Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (2019)
Wang, Q., et al.: Lightning in the cloud: a study of transient bottlenecks on n-tier web application performance. In: 2014 Conference on Timely Results in Operating Systems ({TRIOS} 14) (2014)
Wang, Q., Kanemasa, Y., Kawaba, M., Pu, C.: When average is not average: large response time fluctuations in n-tier systems. In: Proceedings of the 9th International Conference on Autonomic Computing (2012)
Wang, Q., Kanemasa, Y., Li, J., Lai, C.A., Matsubara, M., Pu, C.: Impact of DVFS on n-tier application performance. In: Proceedings of the First ACM SIGOPS Conference on Timely Results in Operating Systems (2013)
Wang, Q., et al.: An experimental study of rapidly alternating bottlenecks in n-tier applications. In: 2013 IEEE Sixth International Conference on Cloud Computing (2013)
Park, J., Wang, Q., Li, J., Lai, C.-A., Zhu, T., Pu, C.: Performance interference of memory thrashing in virtualized cloud environments: a study of consolidated n-tier applications. In: 2016 IEEE 9th International Conference on Cloud Computing (CLOUD) (2016)
Gao, Y., Huang, S., Parameswaran, A.G.: Navigating the Data Lake with DATAMARAN - Automatically Extracting Structure from Log Datasets. SIGMOD Conference (2018)
Xu, W., Huang, L., Fox, A., Patterson, D., Jordan, M.I.: Detecting large-scale system problems by mining console logs. In: Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles (2009)
Nagaraj, K., Killian, C., Neville, J.: Structured comparative analysis of systems logs to diagnose performance problems. In: Presented as part of the 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI 12) (2012)
Stenzel, O.: and. The Physics of Thin Film Optical Spectra. SSSS, vol. 44, pp. 163–180. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-21602-7_8
Zhang, K., Xu, J., Min, M. R., Jiang, G., Pelechrinis, K., Zhang, H.: Automated IT system failure prediction: a deep learning approach. In: 2016 IEEE International Conference on Big Data (Big Data) (2016)
Zhao, X., Rodrigues, K., Luo, Y., Yuan, D., Stumm, M.: Non-intrusive performance profiling for entire software stacks based on the flow reconstruction principle. In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16) (2016)
Liu, L., Pu, C., Han, W.: XWRAP: an XML-enabled wrapper construction system for web information sources. In: Proceedings of 16th International Conference on Data Engineering (Cat. No. 00CB37073) (2000)
Han, W., Buttler, D., Pu, C.: Wrapping web data into XML. ACM SIGMOD Record 30, 33–38 (2001)
Arasu, A., Garcia-Molina, H.: Extracting structured data from web pages. SIGMOD Conference (2003)
Fisher, K., Walker, D., Zhu, K.Q., White, P.: From dirt to shovels - fully automatic tool generation from ad hoc data. POPL (2008)
He, P., Zhu, J., Zheng, Z., Lyu, M.R.: Drain: an online log parsing approach with fixed depth tree. In: 2017 IEEE International Conference on Web Services (ICWS)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Kimball, J., Lima, R.A., Pu, C. (2020). Finding Performance Patterns from Logs with High Confidence. In: Ku, WS., Kanemasa, Y., Serhani, M.A., Zhang, LJ. (eds) Web Services – ICWS 2020. ICWS 2020. Lecture Notes in Computer Science(), vol 12406. Springer, Cham. https://doi.org/10.1007/978-3-030-59618-7_11
Download citation
DOI: https://doi.org/10.1007/978-3-030-59618-7_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59617-0
Online ISBN: 978-3-030-59618-7
eBook Packages: Computer ScienceComputer Science (R0)