Skip to main content

Document explorer: Discovering knowledge in document collections

  • Communications Session 2A Learning and Discovery Systems
  • Conference paper
  • First Online:
Book cover Foundations of Intelligent Systems (ISMIS 1997)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1325))

Included in the following conference series:

Abstract

Document Explorer is a data mining system for document collections. Such a collection represents an application domain, and the primary goal of the system is to derive patterns that provide knowledge about this domain. Additionally, the derived patterns can be used to browse the collection. Document Explorer searches for patterns that capture relations between concepts of the domain. The patterns which have been verified as interesting are structured and presented in a visual user interface allowing the user to operate on the results to refine and redirect mining queries or to access the associated documents. The system offers preprocessing tools to construct or refine a knowledge base of domain concepts and to create an intermediate representation of the document collection that will be used by all subsequent data mining operations. The main pattern types, the system can search for, are frequent sets, associations, concept distributions, and keyword graphs. To enable the user to provide some explicit bias, the system provides a dedicated query language for searching the vast implicit spaces of pattern instances that exist in the collection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

5 References

  1. Amir A., Aumann Y., Feldman R., and Katz O. Efficient Algorithm for Association Generation. Technical Report, Department of Computer Science, Bar-Ilan University, Israel.

    Google Scholar 

  2. Agrawal R., Mannila H., Srikant R., Toivonen H., and Verkamo I. Fast Discovery of Association Rules. In Advances in Knowledge Discovery and Data Mining, Eds. U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, pages 307–328, AAAI Press.

    Google Scholar 

  3. Feldman R., Kloesgen W. and Zilberstein A. Visualization Techniques for Exploring Data mining Results in Document Collections. In Proceedings of the Third International Conference on Knowledge Discovery (KDD-97), August 1997.

    Google Scholar 

  4. Feldman R., and Hirsh H. “Exploiting Background Information in Knowledge Discovery from Text,”, Journal of Intelligent Information Systems, 1997.

    Google Scholar 

  5. Feldman R. and Dagan I. KDT-knowledge discovery in texts. In Proceedings of (KDD-95), August 1995.

    Google Scholar 

  6. Klösgen W. Efficient Discovery of Interesting Statements. The Journal of Intelligent Information Systems, Vol. 4, No l.

    Google Scholar 

  7. Klösgen W. Explora: A Multipattern and Multistrategy Discovery Assistant. In Advances in Knowledge Discovery and Data Mining, eds.

    Google Scholar 

  8. U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, Cambridge, MA: MIT Press.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Zbigniew W. Raś Andrzej Skowron

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Feldman, R., Klösgen, W., Zilberstein, A. (1997). Document explorer: Discovering knowledge in document collections. In: Raś, Z.W., Skowron, A. (eds) Foundations of Intelligent Systems. ISMIS 1997. Lecture Notes in Computer Science, vol 1325. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63614-5_13

Download citation

  • DOI: https://doi.org/10.1007/3-540-63614-5_13

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-63614-4

  • Online ISBN: 978-3-540-69612-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics