Conferences >2010 2nd International Worksh...

Finding surprising patterns in textual data streams

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We address the task of detecting surprising patterns in large textual data streams. These can reveal events in the real world when the data streams are generated by onlin...Show More

Metadata

Abstract:

We address the task of detecting surprising patterns in large textual data streams. These can reveal events in the real world when the data streams are generated by online news media, emails, Twitter feeds, movie subtitles, scientific publications, and more. The volume of interest in such text streams often exceeds human capacity for analysis, such that automatic pattern recognition tools are indispensable. In particular, we are interested in surprising changes in the frequency of n-grams of words, or more generally of symbols from an unlimited alphabet size. Despite the exponentially large number of possible n-grams in the size of the alphabet (which is itself unbounded), we show how these can be detected efficiently. To this end, we rely on a data structure known as a generalised suffix tree, which is additionally annotated with a limited amount of statistical information. Crucially, we show how the generalised suffix tree as well as these statistical annotations can efficiently be updated in an on-line fashion.

Published in: 2010 2nd International Workshop on Cognitive Information Processing

Date of Conference: 14-16 June 2010

Date Added to IEEE Xplore: 14 October 2010

ISBN Information:

ISSN Information:

DOI: 10.1109/CIP.2010.5604085

Conference Location: Elba, Italy

Contents

References is not available for this document.

Finding surprising patterns in textual data streams

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Finding surprising patterns in textual data streams

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?