Radiological reporting based on voice recognition

Antoniol, G.; Fiutem, R.; Flor, R.; Lazzari, G.

doi:10.1007/3-540-57433-6_53

G. Antoniol¹,
R. Fiutem¹,
R. Flor¹ &
…
G. Lazzari¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 753))

Included in the following conference series:

International Conference on Human-Computer Interaction

124 Accesses
2 Citations

Abstract

Speech recognition has proved to be a natural interaction modality and an effective technology for medical reporting, in particular in the speciality of radiology. High-volume text creation requirement and the complex structure of these texts make voice technologies useful. By employing speech, professionals in the field can generate reports and do so at a speed that approaches traditional dictation methods.

However, the integration of speech recognition in a user interface creates new problems: speech recognizers may introduce errors and moreover they should be adaptable to spoken language variations.

This paper describes a radiological reporting system and the related motivations for the use of the speech modality. A preliminary evaluation of the system has shown that, on average, although text recalling functions and keyword shortcuts are available, more than two thirds of a radiological report are generated by means of dictation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Reporting and Dictation

IXHEALTH: A Multilingual Platform for Advanced Speech Recognition in Healthcare

Development of Radiology Reporting

References

G. Antoniol, F. Brugnara, F. Dalla Palma, G. Lazzari, and E. Moser A.RE.S.: An interface for automatic reporting by speech. In Proceedings of the European Conference on Speech Communication and Technology, Genova, Italy, 1991.
Google Scholar
L. R. Bahl, F. Jelinek, and R. L. Mercer. A maximum likelihood approach to continuous speech recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-5(2):179–190, 1983.
Google Scholar
L. R. Bahl, F. Jelinek, and R. L. Mercer. A Maximum Likelihood Approach to Continuous Speech Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 5(2):179–190, March 1983.
Google Scholar
J. K. Baker. Trainable Grammars for Speech Recognition. In Proceedings of the Spring Conference of the Acoustical Society of America, 1979.
Google Scholar
H. Cerf-Danon, S. DeGennaro, M. Ferretti, J.Gonzalez, and E. Keppel. Tangora — a large vocabulary speech recognition system for five languages. In Proceedings of the European Conference on Speech Communication and Technology, pages 215–218, Genova, Italy, September 1991.
Google Scholar
M. Grice and B. Barry. Esprit project 2589 (sam) multi-lingual speech input/output assessment, methodology and standardisation, 1985. Doc. SAM-UC-149.
Google Scholar
R. Joseph. Large vocabulary voice-to-text systems for medical reporting. Speech Technology, 4(4):49–51, 1989.
Google Scholar
L. F. Lamel, R. H. Kassel, and S. Seneff. Speech Database Development: Design and Analysis of the Acoustic-Phonetic Corpus. In Proceedings of the DARPA Speech Recognition Workshop, 1986.
Google Scholar
J.A. Larson. Interactive software: tools for building interactive user interfaces. Prentice-Hall, Englewood Cliffs, NJ, 1992.
Google Scholar
H. Ney and U. Essen. On Smoothing Techniques for Bigram-Based Natural Language Modelling. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 825–828, Toronto, Canada, 1991.
Google Scholar
David S. Pallett. Performance assessment of automatic speech recognizers, 1985. Journal of Research of the National Bureau of Standards.
Google Scholar
A. I. Rudnicky and M. H. Sakamoto. Transcription Conventions and Evaluation Techniques for Spoken Language System Research. Technical Report 9204-11, School of Computer Science, CMU, Pittsburgh, PA, 1989.
Google Scholar

Download references

Author information

Authors and Affiliations

IRST, Pantè di Povo, I-38050, Trento, Italy
G. Antoniol, R. Fiutem, R. Flor & G. Lazzari

Authors

G. Antoniol
View author publications
You can also search for this author in PubMed Google Scholar
R. Fiutem
View author publications
You can also search for this author in PubMed Google Scholar
R. Flor
View author publications
You can also search for this author in PubMed Google Scholar
G. Lazzari
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Leonard J. Bass Juri Gornostaev Claus Unger

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Antoniol, G., Fiutem, R., Flor, R., Lazzari, G. (1993). Radiological reporting based on voice recognition. In: Bass, L.J., Gornostaev, J., Unger, C. (eds) Human-Computer Interaction. EWHCI 1993. Lecture Notes in Computer Science, vol 753. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-57433-6_53

Download citation

DOI: https://doi.org/10.1007/3-540-57433-6_53
Published: 28 May 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-57433-0
Online ISBN: 978-3-540-48152-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics