ABSTRACT
Big data analytics platforms on cloud are becoming mainstream technology enabling cost-effective rapid deployment of customer's Big Data applications delivering quicker insights from their data. It is, therefore, even more imperative that we have high performant platform infrastructure and application at a reasonable cost. This is only possible if we make a transition from traditional approach to execute and measure performance by adopting new AI techniques such as Machine Learning (ML) & predictive approach to performance benchmarking for every application domain.
This paper proposes a high-level conceptual model for automated performance benchmarking which includes execution engine that has been designed to support a self-service model covering automated benchmarking in every application domain. The automated engine is supported by performance scaling recommendations via prescriptive analytics from real performance data set.
We furthermore extended the recommendation capabilities of our self-service automated engine by introducing predictive analytics for making it more flexible in addressing 'what-if' scenarios to predict 'Right Scale' with measurement of "Performance Cost Ratio" (PCR). Finally, we also present some real-world industry examples which have seen the performance benefits in their applications with the recommendations given by our proposed model.
- Demystifying Cloud Benchmarking Paradigm: An In-Depth View, Jul 2012, IEEE 36th International Conference on Computer Software and Applications COMPSAC 2012 {Jayanti Vemulapati, Venu Vedam} Google ScholarDigital Library
- BigDataBench, A Big Data and AI Benchmark Suite, ICT, Chinese Academy of Sciences, http://prof.ict.ac.cn/Google Scholar
- YCSB, https://github.com/cloudius-systems/osv/wiki/Benchmarking-Cassandra-and-other-NoSQL-databases-with-YCSBGoogle Scholar
- TPC, http://www.tpc.org/information/about/about.aspGoogle Scholar
- TPC-DS, http://www.tpc.org/tpcds/Google Scholar
- Benchmarking -- Wikipedia. http://en.wikipedia.org/wiki/BenchmarkingGoogle Scholar
- Amazon Web Services. http://aws.amazon.comGoogle Scholar
- Amazon Instance Types. http://aws.amazon.com/ec2/instance-types/Google Scholar
- NetPerf. http://www.netperf.orgGoogle Scholar
- NetSpec. http://www.ittc.ku.edu/netspec/Google Scholar
Index Terms
- AI Based Performance Benchmarking & Analysis of Big Data and Cloud Powered Applications: An in Depth View
Recommendations
Issues in big data testing and benchmarking
DBTest '13: Proceedings of the Sixth International Workshop on Testing Database SystemsThe academic community and industry are currently researching and building next generation data management systems. These systems are designed to analyze data sets of high volume with high data ingest rates and short response times executing complex ...
Tutorial on Benchmarking Big Data Analytics Systems
ICPE '20: Companion of the ACM/SPEC International Conference on Performance EngineeringThe proliferation of big data technology and faster computing systems led to pervasions of AI based solutions in our life. There is need to understand how to benchmark systems used to build AI based solutions that have a complex pipeline of pre-...
Big data analytics in Cloud computing: an overview
AbstractBig Data and Cloud Computing as two mainstream technologies, are at the center of concern in the IT field. Every day a huge amount of data is produced from different sources. This data is so big in size that traditional processing tools are unable ...
Comments