Abstract
Massive collection of data at high rates is critical for many industries. Typically, a massive stream of records is gathered from the business information network at a very high rate. Because of the complexity of the collection process, the classical database solution falls short. The high volume and rate of records involved requires a heterogeneous pipeline comprised of two major parts: a system that carries out massive collection and then uploads the information to a database, and a subsequent data analysis and management system consisting of an Extract Transform and Load component. We developed a massive collection and loading system, based on a highly scalable heterogeneous architecture solution. The solution has been applied successfully for Telco revenue assurance, and can be applied to other industrial areas. The solution was successful in scaling up a Telco client system to handle streams of records ten times larger than was previously possible.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
WebSphere Application Server V6 Scalability and Performance Handbook, IBM Redbook, SG24-6392-00, ISBN: 0738490601
WebSphere Application Server V6 Technical Overview, by Carla Sadtler, IBM Redbook, REDP-3918-00
WebSphere Application Server V6 Planning and Design WebSphere Handbook Series, IBM Redbook, SG24-6446-00, ISBN: 0738492183
JavaTM Message Service Specification Version 1.1, Sun Microsystems® (April 2002), http://java.sun.com/products/jms/docs.html
JavaTM 2 Platform Enterprise Edition Specification, v1.4, Sun Microsystems® (November 2003), http://java.sun.com/j2ee/j2ee-1_4-fr-spec.pdf
IBM DB2 Information Management (Accessed January 29, 2007), http://www.redbooks.ibm.com/portals/Data
The Eclipse Modeling Framework (Accessed Janurary 29, 2007), http://www.eclipse.org/modeling/emf/?project=emf
The Eclipse project organization (Accessed January 29, 2007), http://www.eclipse.org
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shani, U., Sela, A., Akilov, A., Skarbovski, I., Berk, D. (2007). A Scalable Heterogeneous Solution for Massive Data Collection and Database Loading. In: Bussler, C., Castellanos, M., Dayal, U., Navathe, S. (eds) Business Intelligence for the Real-Time Enterprises. BIRTE 2006. Lecture Notes in Computer Science, vol 4365. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73950-0_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-73950-0_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73949-4
Online ISBN: 978-3-540-73950-0
eBook Packages: Computer ScienceComputer Science (R0)