Topic 5 Parallel and Distributed Databases, Data Mining and Knowledge Discovery

Talia, Domenico; Kargupta, Hillol; Valduriez, Patrick; Camacho, Rui

doi:10.1007/11549468_40

Topic 5 Parallel and Distributed Databases, Data Mining and Knowledge Discovery

Domenico Talia¹⁸,
Hillol Kargupta¹⁸,
Patrick Valduriez¹⁸ &
…
Rui Camacho¹⁸

Conference paper

944 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3648))

Abstract

To manage the very large amount of data available today, computer scientists are working on efficient systems, algorithms and applications that can handle and analyze very large databases. Intensive data consuming applications are running on very large databases (on data warehouses, on multimedia databases) with the task to extract information diamonds. Data mining is one of the key applications here. However, these intensive data consuming applications suffer from performance problems and single database sources. Introducing data distribution and parallel processing help to overcome resource bottlenecks and to achieve guaranteed throughput, quality of service, and system scalability. Distributed architectures, cluster systems and P2P systems, supported by high performance networks and intelligent middleware offer parallel and distributed databases a great opportunity to support cost-effective everyday applications.

Data processing and knowledge discovery on large data sources can benefit from parallel and distributed computing both to improve performance and quality of results. Development of data mining tools on high-performance parallel computers allows for analyzing massive databases in a reasonable time. Faster processing also means that users can experiment with more models to understand complex data. Furthermore, high performance makes it practical for users to analyze greater quantities of data. Distribution of data sources and data mining tasks is another key issue that the increasing decentralization of human activities and large availability of connection facilities are making more and more critical.

This year, 9 papers discussing some the those issues were submitted to this topic. Each paper was reviewed by at least three reviewers and, finally, we were able to select 3 regular papers. The accepted papers discuss very interesting issues such as middleware for database replication, mining global association rules on Grids, and hierarchical aggregation in networked aata management.

We would like to take the opportunity of thanking the authors who submitted a contribution, as well as the Euro-Par Organizing Committee, and the referees with there highly useful comments, whose efforts have made this conference, and Topic 5 possible.

Download to read the full chapter text

Chapter PDF

Author information

Authors and Affiliations

Topic Chairs,
Domenico Talia, Hillol Kargupta, Patrick Valduriez & Rui Camacho

Authors

Domenico Talia
View author publications
You can also search for this author in PubMed Google Scholar
Hillol Kargupta
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Valduriez
View author publications
You can also search for this author in PubMed Google Scholar
Rui Camacho
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Topic Chairs,
José C. Cunha
Faculdade de Ciências e Technologia CITI Centre, Quinta da Torre, Universidade Nova de Lisboa, 2829-516, Caparica, Portugal
Pedro D. Medeiros

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Talia, D., Kargupta, H., Valduriez, P., Camacho, R. (2005). Topic 5 Parallel and Distributed Databases, Data Mining and Knowledge Discovery. In: Cunha, J.C., Medeiros, P.D. (eds) Euro-Par 2005 Parallel Processing. Euro-Par 2005. Lecture Notes in Computer Science, vol 3648. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11549468_40

Download citation

DOI: https://doi.org/10.1007/11549468_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28700-1
Online ISBN: 978-3-540-31925-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics