Article

Knowledge discovery in very large databases

Author:
Xindong Wu

University of Vermont

University of Vermont
View Profile

SEKE '02: Proceedings of the 14th international conference on Software engineering and knowledge engineeringJuly 2002Pages 15https://doi.org/10.1145/568760.568764

Published:15 July 2002Publication History

SEKE '02: Proceedings of the 14th international conference on Software engineering and knowledge engineering

Pages 15

ABSTRACT

Dealing with very large databases is one of the defining challenges in data mining research and development. When a data base is not a static repository of data, or if the data come from different data sources and putting all data together might amass a huge database for centralized processing, knowledge discovery in such data environments cannot be a one-time process. Existing techniques include data sampling, windowing, bagging, boosting, batch learning, hierarchical meta-learning, and parallel and distributed data mining. This talk will provide a review on these techniques, and present our own recent research efforts on multi-layer induction and synthesizing association rules from different data sources.

Knowledge discovery in very large databases
1. Information systems
  1. Information systems applications

Recommendations

Mining concept associations for knowledge discovery in large textual databases
SAC '05: Proceedings of the 2005 ACM symposium on Applied computing

In this paper, we describe a new approach for mining concept associations from large text collections. The concepts are short sequences of words that occur frequently together across the text collections. It is these concepts that convey most of the ...
Read More
Discovery of Direct and Indirect Association Patterns in Large Transaction Databases
CIS '07: Proceedings of the 2007 International Conference on Computational Intelligence and Security

Association rules mining is one of the important tasks in data mining research. While most of the existing discovery algorithms are dedicated to efficiently mining of frequent patterns, it has been noted recently that some of the infrequent patterns can ...
Read More
Knowledge Discovery in Multiple Databases
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SEKE '02: Proceedings of the 14th international conference on Software engineering and knowledge engineering
July 2002
859 pages
ISBN:1581135564
DOI:10.1145/568760
Conference Chairs:
Genny Tortora,
Shi-Kuo Chang
Copyright © 2002 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 15 July 2002
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 191
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Knowledge discovery in very large databases

SEKE '02: Proceedings of the 14th international conference on Software engineering and knowledge engineering

ABSTRACT

Cited By

Recommendations

Mining concept associations for knowledge discovery in large textual databases

Discovery of Direct and Indirect Association Patterns in Large Transaction Databases

Knowledge Discovery in Multiple Databases