Handling Weighted Sequences Employing Inverted Files and Suffix Trees

Klev Diamanti; Andreas Kanavos; Christos Makris; Thodoris Tokis

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Handling Weighted Sequences Employing Inverted Files and Suffix Trees

Topics: Searching and Browsing; Text Mining; Web Information Filtering and Retrieval

In Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, 231-238, 2014 , Barcelona, Spain

Authors: Klev Diamanti ¹ ; Andreas Kanavos ² ; Christos Makris ² and Thodoris Tokis ²

Affiliations: ¹ Uppsala University, Sweden ; ² University of Patras, Greece

Keyword(s): Searching and Browsing, Web Information Filtering and Retrieval, Text Mining, Indexing Structures, Inverted Files, n-gram Indexing, Sequence Analysis and Assembly, Weighted Sequences, Weighted Suffix Trees.

Abstract: In this paper, we address the problem of handling weighted sequences. This is by taking advantage of the inverted files machinery and targeting text processing applications, where the involved documents cannot be separated into words (such as texts representing biological sequences) or word separation is difficult and involves extra linguistic knowledge (texts in Asian languages). Besides providing a handling of weighted sequences using n-grams, we also provide a study of constructing space efficient n-gram inverted indexes. The proposed techniques combine classic straightforward n-gram indexing, with the recently proposed two-level n-gram inverted file technique. The final outcomes are new data structures for n-gram indexing, which perform better in terms of space consumption than the existing ones. Our experimental results are encouraging and depict that these techniques can surely handle n-gram indexes more space efficiently than already existing methods.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 18.222.112.64

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Diamanti, K., Kanavos, A., Makris, C. and Tokis, T. (2014). Handling Weighted Sequences Employing Inverted Files and Suffix Trees. In Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST; ISBN 978-989-758-024-6; ISSN 2184-3252, SciTePress, pages 231-238. DOI: 10.5220/0004788502310238

@conference{webist14,
author={Klev Diamanti and Andreas Kanavos and Christos Makris and Thodoris Tokis},
title={Handling Weighted Sequences Employing Inverted Files and Suffix Trees},
booktitle={Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST},
year={2014},
pages={231-238},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004788502310238},
isbn={978-989-758-024-6},
issn={2184-3252},
}

TY - CONF

JO - Proceedings of the 10th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST
TI - Handling Weighted Sequences Employing Inverted Files and Suffix Trees
SN - 978-989-758-024-6
IS - 2184-3252
AU - Diamanti, K.
AU - Kanavos, A.
AU - Makris, C.
AU - Tokis, T.
PY - 2014
SP - 231
EP - 238
DO - 10.5220/0004788502310238
PB - SciTePress