skip to main content
10.1145/1499799.1499842acmotherconferencesArticle/Chapter ViewAbstractPublication PagesafipsConference Proceedingsconference-collections
research-article
Free Access

From text to structured information: automatic processing of medical reports

Published:07 June 1976Publication History

ABSTRACT

This paper describes the analysis and processing programs for a set of natural language texts in a medical area (x-ray reports on patients with breast cancer). The programs convert the information in the text into a tabular form suitable for further automatic information processing (e.g., editing of records, question answering on the data collected, or statistical summaries of the data). To set up a tabular form appropriate for the data, we first perform a manual linguistic analysis on a sample of the texts. From this we obtain the word classes and the form of the table (called an information format) for this type of material. We then apply the series of processing programs to the sentences of the texts. Each sentence is parsed with the Linguistic String Parser English grammar in order to obtain its grammatical structure; certain standard English transformations are then applied to regularize the grammatical form of the sentence; and finally a set of "formatting transformations" map the words of the sentence into the slots of the format or table, in such a way that the sentence is reconstructible (up to paraphrase) from its representation in the table. The results of applying these programs to a corpus are described. This procedure enables us to convert a natural language corpus into a structured data base.

References

  1. Simmons, R., S. Klein and K. McConlogue, "Indexing and Dependency Logic for Answering English Questions," American Documentation 15, p. 196, 1964.Google ScholarGoogle ScholarCross RefCross Ref
  2. Harris, Z. S., "Linguistic Transformations for Information Retrieval," Proc. Int'l. Conf. on Scientific Information (1958) 2, p. 158, 1959.Google ScholarGoogle Scholar
  3. Sager, N., J. Touger, Z. S. Harris, J. Hamann, and B. Bookchin, "An Application of Syntactic Analysis to Information Retrieval," String Program Reports No. 6, Linguistic String Project, New York University, 1970.Google ScholarGoogle Scholar
  4. Sager, N., "Syntactic Formatting of Scientific Information," Proceedings of the 1972 Fall Joint Computer Conference, AFIPS Conference Proceedings, Vol. 41, pp. 791--800, AFIPS Press, Montvale, N.J., 1972. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Sager, N., "The Sublanguage Technique in Science Information Processing," Journal of the American Society for Information Science, Vol. 26, pp. 10--16, 1975.Google ScholarGoogle ScholarCross RefCross Ref
  6. Sager, N., "Syntactic Analysis of Natural Language," Advances in Computers, Vol. 8, pp. 153--188, Academic Press, Inc., New York, 1967.Google ScholarGoogle ScholarCross RefCross Ref
  7. Grishman, R., N. Sager, C. Raze, and B. Bookchin, "The Linguistic String Parser," Proceedings of the 1973 Computer Conference, pp. 427--434, AFIPS Press, 1973. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Anderson, B., I. D. J. Bross and N. Sager, "Grammatical Compression in Notes and Records: Analysis and Computation," paper delivered at the 13th Annual Meeting of the Association of Computational Linguistics, Boston, Nov. 1, 1975, American Journal of Computational Linguistics, Vol. 2, No. 4, 1975.Google ScholarGoogle Scholar
  9. Hirschman, L., R. Grishman and N. Sager, "Grammatically-based Automatic Word Class Formation," Information Processing and Management, Vol. 11, pp. 39--57, 1975.Google ScholarGoogle Scholar
  10. Hobbs, J. and R. Grishman, "The Automatic Transformational Analysis of English Sentences: An Implementation," to appear in International Journal of Computer Mathematics.Google ScholarGoogle Scholar
  11. Sager, N. and R. Grishman, "The Restriction Language for Computer Grammars of Natural Language," Communications of the ACM, Vol. 18, pp. 390--400, 1975. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. From text to structured information: automatic processing of medical reports

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        AFIPS '76: Proceedings of the June 7-10, 1976, national computer conference and exposition
        June 1976
        1125 pages
        ISBN:9781450379175
        DOI:10.1145/1499799

        Copyright © 1976 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 7 June 1976

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader