Skip to main content
Log in

Frame rate as a QoS parameter and its influence on speech perception

  • Published:
Multimedia Systems Aims and scope Submit manuscript

Abstract.

The preservation of QoS for multimedia traffic through a data network is a difficult problem. We focus our attention on video frame rate and study its influence on speech perception.

When sound and picture are discrepant (e.g., acoustic ‘ba’ combined with visual ‘ga’), subjects perceive a different sound (such as ‘da’). This phenomenon is known as the McGurk effect. In this paper, the influence of degraded video frame rate on speech perception is studied.

It is shown that, when frame rate decreases, correct hearing is improved for discrepant stimuli and is degraded for congruent (voice and picture are the same) stimuli. Furthermore, we studied the case where lip closure was always captured by the synchronization of sampling time and lip position. In this case, frame rate has little effect on mishearing for congruent stimuli. For discrepant stimuli, mishearing is decreased with degraded frame rate. These results indicate that the stiff motion of lips resulting from low frame rate cannot give enough labial information for speech perception. In addition, the effect of delaying the picture to correct for low frame rate was studied. The results, however, were not as definitive as expected, because of compound effects related to the synchronization of sound and picture. Finally, we inspected the still pictures of normal Japanese speech and determined a lower limit of frame rate from the view point of assisting hearing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nakazono, K. Frame rate as a QoS parameter and its influence on speech perception. Multimedia Systems 6, 359–366 (1998). https://doi.org/10.1007/s005300050099

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s005300050099

Navigation