Skip to main content

Building a Digital Collection of Web-Pages: Access and Filtering Information with Textual Expansion

  • Conference paper
  • First Online:
Text, Speech and Dialogue (TSD 2001)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2166))

Included in the following conference series:

  • 410 Accesses

Abstract

This paper describes an approach to the design of an information retrieval of providing an search of users. Textual analysis is a part of information treatment systems. Next generation of information systems will rely on collaborative agents for playing a fundamental action in actively searching and finding relevant information in complex systems. The explosive growth of Web sites and Usenet news demands effectives filtering solutions. The access to digital data through WEB servers is facilitated by search engines. A number of Internet search engines provide classified search directories. The aim of the present paper is to suggest a method of filtering based only on the address URL, titles, abstracts. The problem of information searching in texts is mainly a linguistic problem. The objective is to construct a system for access and filtering information with using the model of Noun Phrases (NP). The intensional predicate and NP are used from retrieval, navigations (discrete & continue) and filtering the solutions captured from the WEB.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Belkin, N.J., Croft, W.B.: Information Filtering and Information Retrieval: Two Sides of the Same Coin? In: Communication of the ACM. Vol.35,No. 12., (1992), pp. 29–38.

    Article  Google Scholar 

  2. Blair, D.C.: Language and Representation in Information Retrieval. Elsevier. Amsterdam (1990).

    Google Scholar 

  3. Bush, V.: As We May Think. The Atlantic Monthly. 6 (1945), pp. 101–108.

    Google Scholar 

  4. Copestake, A., Sparck-Jones, K. (1990).: Natural Language Interfaces To Databases. The Knowledge Engineering review, no 5, part 4.

    Google Scholar 

  5. Delisle, C., Larouk, O.: Convergence et Divergence des Moteurs de recherche: Tests sur les moteurs et méta-moteurs. Pré-rapport DEA-Sc. de l’information, ENSSIB, Lyon (1999), 18 p.

    Google Scholar 

  6. Foltz, P.W, Dumais, T.: Personalized Information Delivery: An Analysis of Information Filtering Methods. In: Communication of the ACM. Vol.35,No. 12, (1992), pp. 51–60.

    Article  Google Scholar 

  7. Grishman, R.: Natural Language Interfaces. Journal of the ASIS, Vol 35, (1984). pp. 291–296.

    Google Scholar 

  8. Larouk, O.: Application of Non-Classical Logics to Optimise Textual Knowledge Representation in an Information Retrieval System. in HEURISTICS: THE JOURNAL of Knowledge Engineering. Volume 6, Number 1 Spring (1993), Gaithersburg, MD-USA; pp. 25–37.

    Google Scholar 

  9. Le Guern, M.: FRUn Analyseur Morpho-Syntaxique pour l’Indexation Automatique. Le Français Moderne, tome LIX, no 1, (1992), pp. 22–35.

    Google Scholar 

  10. Nicols, D., Pemberton, D., Dalhoumi, S., Larouk, O., Belisle, C., Twidale, M.: DEBORA: Developing an interface to Support Collaboration in a digital Library. ECDL’2000, Research and Advanced Technology for Digital Libraies. Lecture Notes in Computer Science-no 1923, (J. Borbinha & T. Baker eds.), Springer-Verlag, (2000), Lisbon, pp. 239–248.

    Chapter  Google Scholar 

  11. Rich, E.: Natural Language Interfaces. Computer. (1984).

    Google Scholar 

  12. Salton, G, Mc Gill, M.J.: Introduction to Modern Information Retrieval. Mc Graw-Hill, New York, (1983).

    MATH  Google Scholar 

  13. Smeaton, A.F, Van Rijsbergen, C.J.: Experiments on Incorporating Syntactic Processing of User Queries into a Document Retrieval Strategy. ACM-SIGIR, Grenoble, (1988), pp. 31–51.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Larouk, O. (2001). Building a Digital Collection of Web-Pages: Access and Filtering Information with Textual Expansion. In: Matoušek, V., Mautner, P., Mouček, R., Taušer, K. (eds) Text, Speech and Dialogue. TSD 2001. Lecture Notes in Computer Science(), vol 2166. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44805-5_21

Download citation

  • DOI: https://doi.org/10.1007/3-540-44805-5_21

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-42557-1

  • Online ISBN: 978-3-540-44805-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics