Skip to main content

Sentence Boundary Verification in Polish Text

  • Conference paper
Book cover Computer Recognition Systems 2

Part of the book series: Advances in Soft Computing ((AINSC,volume 45))

  • 750 Accesses

Abstract

In this paper the heuristic metod based on phrase analysis is proposed for sentence boundary verification in Polish texts. The decision rules, maximum entropy and neural network as reference methods are compared with the phrase analysis. The results elaborated by the proposed method are more acurate than the reference methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Krzysztof Jassem. Przetwarzanie tekstów polskich w systemie tlumaczenia automatycznego POLENG. Wydawnictwo Naukowe UAM, 2006.

    Google Scholar 

  2. Slawomir Kulików. Implementacja serwera analizy lingwistycznej dla systemu THETOS-translatora tekstu na jcezyk migowy. Studio, Informatica, 24(3 (55)):171–178, 2003.

    Google Scholar 

  3. Kelley Herndon Ford Neha Agarwal and Max Shneider. Sentence boundary detection using a maxent classifier.

    Google Scholar 

  4. David D. Palmer and Marti A. Hearst. Adaptive multilingual sentence boundary disambiguation. Computational Linguistics, 23(2):241–267, 1997.

    Google Scholar 

  5. Ross Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, 1993.

    Google Scholar 

  6. Adwait Ratnaparkhi. A simple introduction to maximum entropy models for natural language processing. Technical report, Institute for Research in Cognitive Science, University of Pennsylvania, 1997.

    Google Scholar 

  7. Jeffrey C. Reynar and Adwait Ratnaparkhi. A maximum entropy approach to identifying sentence boundaries. Proceedings of the Fifth Conference on Applied Natural Language Processing, pages 16–19.

    Google Scholar 

  8. E. Stamatatos, N. Fakotakis, and G. Kokkinakis. Automatic extraction of rules for sentence boundary disambiguation. Proceedings of the Workshop in Machine Learning in Human Language Technology, Advance Course on Artificial Intelligence, pages 88–92, 1999.

    Google Scholar 

  9. Nina Suszczanska, Miroslaw Forczek, and Artur Migas. Wieloetapowy analizator morfologiczny. Speech and Language Technology, 4:155–165, 2000.

    Google Scholar 

  10. Stanislaw Urbanczyk, editor. Encyklopedia jezyka polskiego. Zaklad Narodowy im. Ossolinskich — Wydawnictwo, 1991.

    Google Scholar 

  11. Daniel J. Walker, David E. Clements, Maki Darwin, and Jan W. Amtrup. Sentence boundary detection: A comparison of paradigms for improving MT quality. In MT Summit Proceedings VIII, September 2001.

    Google Scholar 

  12. Haoyi Wang and Yang Huang. Bondex — a sentence boundary detector, 2003.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Siminski, K. (2007). Sentence Boundary Verification in Polish Text. In: Kurzynski, M., Puchala, E., Wozniak, M., Zolnierek, A. (eds) Computer Recognition Systems 2. Advances in Soft Computing, vol 45. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75175-5_62

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-75175-5_62

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-75174-8

  • Online ISBN: 978-3-540-75175-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics