ABSTRACT
We present AMADA, a platform for storing Web data (in particular, XML documents and RDF graphs) based on the Amazon Web Services (AWS) cloud infrastructure. AMADA operates in a Software as a Service (SaaS) approach, allowing users to upload, index, store, and query large volumes of Web data. The demonstration shows (i) the step-by-step procedure for building and exploiting the warehouse (storing, indexing, querying) and (ii) the monitoring tools enabling one to control the expenses (monetary costs) charged by AWS for the operations involved while running AMADA.
- D. Battré, S. Ewen, F. Hueske, O. Kao, V. Markl, and D. Warneke. Nephele/PACTs: a programming model and execution framework for web-scale analytical processing. In SoCC, 2010. Google ScholarDigital Library
- M. Brantner, D. Florescu, D. A. Graf, D. Kossmann, and T. Kraska. Building a database on S3. In SIGMOD, 2008. Google ScholarDigital Library
- F. Bugiotti, F. Goasdoué, Z. Kaoudi, and I. Manolescu. RDF data management in the Amazon Cloud. In DanaC Workshop (collocated with EDBT/ICDT), 2012. Google ScholarDigital Library
- J. Camacho-Rodríguez, D. Colazzo, and I. Manolescu. Building Large XML Stores in the Amazon Cloud. In DMC Workshop (collocated with ICDE), 2012. Google ScholarDigital Library
- J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In OSDI, 2004. Google ScholarDigital Library
- L. Fegaras, C. Li, U. Gupta, and J. Philip. XML Query Optimization in Map-Reduce. In WebDB, 2011.Google Scholar
- J. Huang, D. J. Abadi, and K. Ren. Scalable SPARQL querying of large RDF graphs. PVLDB, 4(11), 2011.Google Scholar
- M. Husain, J. McGlothlin, M. M. Masud, L. Khan, and B. M. Thuraisingham. Heuristics-Based Query Processing for Large RDF Graphs Using Cloud Computing. IEEE Trans. on Knowl. and Data Eng., 2011. Google ScholarDigital Library
- V. Kantere, D. Dash, G. Gratsias, and A. Ailamaki. Predicting cost amortization for query services. In SIGMOD, 2011. Google ScholarDigital Library
- S. Khatchadourian, M. P. Consens, and J. Siméon. Having a ChuQL at XML on the Cloud. In A. Mendelzon Int'l. Workshop, 2011.Google Scholar
- D. Kossmann, T. Kraska, and S. Loesing. An evaluation of alternative architectures for transaction processing in the cloud. In SIGMOD, 2010. Google ScholarDigital Library
- T. Neumann and G. Weikum. The RDF-3X Engine for Scalable Management of RDF Data. VLDBJ, 19(1), 2010. Google ScholarDigital Library
- ViP2P web site. http://vip2p.saclay.inria.fr.Google Scholar
- Technical report. http://jesus.camachorodriguez.name/_media/xml-aws/tech.pdf, 2012.Google Scholar
Index Terms
- AMADA: web data repositories in the amazon cloud
Recommendations
Web data indexing in the cloud: efficiency and cost reductions
EDBT '13: Proceedings of the 16th International Conference on Extending Database TechnologyAn increasing part of the world's data is either shared through the Web or directly produced through and for Web platforms, in particular using structured formats like XML or JSON. Cloud platforms are interesting candidates to handle large data ...
DevOps patterns to scale web applications using cloud services
SPLASH '13: Proceedings of the 2013 companion publication for conference on Systems, programming, & applications: software for humanityScaling a web applications can be easy for simple CRUD software running when you use Platform as a Service Clouds (PaaS). But if you need to deploy a complex software, with many components and a lot users, you will need have a mix of cloud services in ...
MATE-EC2: a middleware for processing data with AWS
MTAGS '11: Proceedings of the 2011 ACM international workshop on Many task computing on grids and supercomputersRecently, there has been growing interest in using Cloud resources for a variety of high performance and data-intensive applications. While there is currently a number of commercial Cloud service providers, Amazon Web Services (AWS) appears to be the ...
Comments