Skip to content
Licensed Unlicensed Requires Authentication Published by De Gruyter Mouton February 23, 2017

Perspectives on Speech Timing: Coupled Oscillator Modeling of Polish and Finnish

  • Zofia Malisz ORCID logo , Michael O’Dell , Tommi Nieminen and Petra Wagner
From the journal Phonetica

Abstract

This stud y was ai med at analyzing empirical duration data for Polish spoken at different tempos using an updated version of the Coupled Oscillator Model of speech timing and rhythm variability (O'Dell and Nieminen, 1999, 2009). We use Bayesian inference on parameters relating to speech rate to investigate how tempo affects timing in Polish. The model parameters found are then compared with parameters obtained for equivalent material in Finnish to shed light on which of the effects represent general speech rate mechanisms and which are specific to Polish. We discuss the model and its predictions in the context of current perspectives on speech timing.


verified



*Zofia Malisz, Department of Speech, Music and Hearing, KTH, SE-10044 Stockholm (Sweden), E-Mail malisz@kth.se

References

1 Abercrombie D (1973): A phonetician's view of verse structure; in Phonetics in Linguistics. London, Longman's Publishing Group.Search in Google Scholar

2 Abraham RH, Shaw CD (2000): Dynamics: The Geometry of Behavior, ed 4. Aerial Press.Search in Google Scholar

3 Aylett M, Turk A (2004): The smooth signal redundancy hypothesis: a functional explanation for relationships between redundancy, prosodic prominence, and duration in spontaneous speech. Lang Speech 47(pt 1):31-56.10.1177/00238309040470010201Search in Google Scholar

4 Aylett M, Turk A (2006): Language redundancy predicts syllabic duration and the spectral characteristics of vocalic syllable nuclei. J Acoust Soc Am 119(5 pt 1):3048-3058.10.1121/1.2188331Search in Google Scholar

5 Barbosa PA (2006): Incursões em torno do ritmo da fala [Investigations of speech rhythm]. Campinas, Pontes.Search in Google Scholar

6 Barbosa PA (2007): From syntax to acoustic duration: a dynamical model of speech rhythm production. Speech Commun 49:725-742.10.1016/j.specom.2007.04.013Search in Google Scholar

7 Beckman ME (1992): Evidence for speech rhythms across languages; in Tohkura Y, Vatikiotis-Bateson E, Sagisaka Y (eds): Speech Perception, Production and Linguistic Structure. Tokyo, OHM Publishing Co., pp 457-463.Search in Google Scholar

8 Bertinetto PM, Bertini C (2010): Towards a unified predictive model of natural language rhythm; in Russo M (ed): Prosodic Universals: Comparative Studies in Rhythmic Modeling and Rhythm Typology. Naples, Aracne, pp 43-78.Search in Google Scholar

9 Bolinger DLM (1965): Forms of English: Accent, Morpheme, Order. Cambridge, MA, Harvard University Press.Search in Google Scholar

10 Bouzon C, Hirst D (2004): Isochrony and prosodic structure in British English. Proceedings of the 2nd International Conference on Speech Prosody, Nara, pp 223-226.Search in Google Scholar

11 Brady MC, Port RF (2007): Quantifying vowel onset periodicity in Japanese. Proceedings of the 16th International Congress of Phonetic Sciences, Saarbrücken, pp 337-342.Search in Google Scholar

12 Browman CP, Goldstein L (1992): Articulatory phonology: an overview. Phonetica 49:155-180.10.1159/000261913Search in Google Scholar

13 Cetnarowska B (2000): On the (non-)recursivity of the prosodic word in Polish. ZAS Papers Linguist 19:1-21.10.21248/zaspil.19.2000.66Search in Google Scholar

14 Cummins F (2011): Periodic and aperiodic synchronization in skilled action. Front Hum Neurosci 170.10.3389/fnhum.2011.00170Search in Google Scholar

15 Cummins F, Port R (1998): Rhythmic constraints on English stress timing. J Phon 26:145-171.10.1006/jpho.1998.0070Search in Google Scholar

16 Cutler A (1994): The perception of rhythm in language. Cognition 50:79-81.10.1016/0010-0277(94)90021-3Search in Google Scholar

17 De Jong K (1995): The supraglottal articulation of prominence in English: linguistic stress as localized hyperarticulation. J Acoust Soc Am 97:491-504.10.1121/1.412275Search in Google Scholar

18 Dellwo V, Steiner I, Aschenberner B, Dankovicova J, Wagner P (2004): BonnTempo-corpus and BonnTempo-tools: a database for the study of speech rhythm and rate. Proceedings of INTERSPEECH, Jeju, pp 777-780.10.21437/Interspeech.2004-294Search in Google Scholar

19 Dellwo V, Wagner P (2003): Relations between language rhythm and speech rate. 15th International Congress of Phonetic Sciences, Barcelona, pp 471-474.Search in Google Scholar

20 Dłuska M (1950): Fonetyka polska [The phonetics of Polish]. PWN Polskie Wydawnictwo Naukowe, Warszawa.Search in Google Scholar

21 Dogil G (1979): Autosegmental Account of Phonological Emphasis. Carbondale, Illinois and Edmonton, Canada, Linguistic Research. Inc.Search in Google Scholar

22 Eriksson A (1991): Aspects of Swedish Speech Rhythm; PhD thesis, University of Göteborg.Search in Google Scholar

23 Fowler CA (1980): Coarticulation and theories of extrinsic timing. J Phon 8:113-133.10.1016/S0095-4470(19)31446-9Search in Google Scholar

24 Gahl S, Yao Y, Johnson K (2012): Why reduce? Phonological neighborhood density and phonetic reduction in spontaneous speech. J Mem Lang 66:789-806.10.1016/j.jml.2011.11.006Search in Google Scholar

25 Gelman A, Carlin JB, Stern HS, Rubin DB (2004): Bayesian Data Analysis, ed 2. Boca Raton, FL, Chapman & Hall/CRC.10.1201/9780429258480Search in Google Scholar

26 Gelman A, Hill J (2007): Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge, Cambridge University Press.10.1017/CBO9780511790942Search in Google Scholar

27 Gibson JJ (1975): Events are perceivable but time is not; in Fraser JT, Lawrence N (eds): The Study of Time II: Proceedings of the Second Conference of the International Society for the Study of Time, Lake Yamanaka-Japan. Berlin, Heidelberg, Springer Berlin Heidelberg, pp 295-301.Search in Google Scholar

28 Gordon M (2011): Stress systems; in Goldsmith J, Riggle J, Yu AC (eds): The Handbook of Phonological Theory, ed 2. Chichester, Wiley Blackwell, pp 141-163.Search in Google Scholar

29 Hanson K, Kiparsky P (1996): A parametric theory of poetic meter. Language 72:287-335.10.2307/416652Search in Google Scholar

30 Hayes B (1980): A Metrical Theory of Stress Rules; PhD thesis, Cambridge, MIT.Search in Google Scholar

31 Hayes B, Puppel S (1985): On the rhythm rule in Polish; in van der Hulst H, Smith N (eds): Advances in Nonlinear Phonology. Dordrecht, Foris Publications, pp 59-81.Search in Google Scholar

32 Hualde JI, Nadeu M (2014): Rhetorical stress in Spanish; in van der Hulst H (ed): Word Stress: Theoretical and Typological Issues. Cambridge, Cambridge University Press, p 228.Search in Google Scholar

33 Jaeger TF, Buz E (2016): Signal reduction and linguistic encoding; in Fernández EM, Cairns HS (eds): Handbook of Psycholinguistics. Hoboken, NJ, Wiley-Blackwell.10.1002/9781118829516.ch3Search in Google Scholar

34 Jassem W, Hill DR, Witten IH (1984): Isochrony in English speech: its statistical validity and linguistic relevance; in Gibbon D, Richter H (eds): Intonation, Accent and Rhythm: Studies in Discourse Phonology. Berlin, Walter de Gruyter, pp 203-225.Search in Google Scholar

35 Jones MR, Boltz M (1989): Dynamic attending and responses to time. Psychol Rev 96:459-491.10.1037/0033-295X.96.3.459Search in Google Scholar

36 Kelso JAS (1995): Dynamic patterns: the self organization of brain and behavior. Cambridge, MA, MIT Press.Search in Google Scholar

37 Kim H, Cole J (2005): The stress foot as a unit of planned timing: evidence from shortening in the prosodic phrase. Proceedings of INTERSPEECH 2005, Lisbon, pp 2365-2368.10.21437/Interspeech.2005-37Search in Google Scholar

38 Kochanski G, Loukina, A, Keane E, Shih C, Rosner B (2010): Long-range prosody prediction and rhythm. Proceedings of the 5th International Conference on Speech Prosody, Chicago, IL, pp 1-4.Search in Google Scholar

39 Kopell N (1988): Toward a theory of modelling central pattern generators; in Cohen AH, Rossignol S, Grillner S (eds): Neural Control of Rhythmic Movements in Vertebrates. New York, John Wiley & Sons, pp 369-413.Search in Google Scholar

40 Kuperman V, Ernestus M, Baayen H (2008): Frequency distributions of uniphones, diphones, and triphones in spontaneous speech. J Acoust Soc Am 124:3897-3908.10.1121/1.3006378Search in Google Scholar

41 Lee MW, Gibbons J (2007): Rhythmic alternation and the optional complementiser in English: new evidence of phonological influence on grammatical encoding. Cognition 105:446-456.10.1016/j.cognition.2006.09.013Search in Google Scholar

42 Lehiste I (1977): Isochrony reconsidered. J Phon 5:253-263.10.1016/S0095-4470(19)31139-8Search in Google Scholar

43 Lehtonen J (1970): Aspects of Quantity in Standard Finnish. No. VI in Studia Philologica Jyväskyläensia. Jyväskylä, University of Jyväskylä.Search in Google Scholar

44 Liberman M, Prince A (1977): On stress and linguistic rhythm. Linguist Inq 8:249-336.Search in Google Scholar

45 Lindblom B (1990): Explaining phonetic variation: a sketch of the H&H theory; in Marchal A (ed): Speech Production and Speech Modeling. Dordrecht, Kluwer Academic Publishers.10.1007/978-94-009-2037-8_16Search in Google Scholar

46 Louwerse MM, Dale R, Bard EG, Jeuniaux P (2012): Behavior matching in multimodal communication is synchronized. Cogn Sci 36:1404-1426.10.1111/j.1551-6709.2012.01269.xSearch in Google Scholar

47 Malisz Z (2011): Tempo differentiated analyses of timing in Polish. Proceedings of the 17th International Congress of Phonetic Sciences, Hong Kong, pp 1322-1325.Search in Google Scholar

48 Malisz Z (2013): Speech Rhythm Variability in Polish and English: A Study of Interaction between Rhythmic Levels; PhD thesis, Adam Mickiewicz University, Poznań.Search in Google Scholar

49 Malisz Z, Zygis M, Pompino-Marschall B (2013): Rhythmic structure effects on glottalisation: a study of different speech styles in Polish and German. Lab Phonol 4:119-158.10.1515/lp-2013-0006Search in Google Scholar

50 McAuley JD, Fromboluti EK (2014): Attentional entrainment and perceived event duration. Philos Trans R Soc Lond B Biol Sci 369:20130401.10.1098/rstb.2013.0401Search in Google Scholar

51 Newlin-Łukowicz L (2012): Polish stress: looking for phonetic evidence of a bidirectional system. Phonology 29:271-329.10.1017/S0952675712000139Search in Google Scholar

52 Nolan F, Jeon HS (2014): Speech rhythm: a metaphor? Philos Trans R Soc Lond B Biol Sci 369:20130396.10.1098/rstb.2013.0396Search in Google Scholar

53 Ntzoufras I (2002): Gibbs variable selection using BUGS. J Stat Softw 7:1-19.10.18637/jss.v007.i07Search in Google Scholar

54 O'Dell M, Lennes M, Nieminen T (2008): Hierarchical levels of rhythm in conversational speech. Proceedings of the 4th International Conference on Speech Prosody, Campinas, pp 355-358.Search in Google Scholar

55 O'Dell M, Lennes M, Werner S, Nieminen T (2007): Looking for rhythms in conversational speech. Proceedings of the 16th International Congress of Phonetic Sciences, Saarbrücken, pp 1201-1204.Search in Google Scholar

56 O'Dell M, Nieminen T (1998): Reasons for an underlying unity in rhythm dichotomy. Linguistica Uralica 3:178-185.Search in Google Scholar

57 O'Dell M, Nieminen T (2002): How long is a stress group? Cadernos de Estudos. Lingüísticos 43:93-108.Search in Google Scholar

58 O'Dell M, Nieminen T (2006): Tahdin ajoitus suomessa oskillaattorimallin näkökulmasta [Timing of feet in Finnish from an oscillator model perspective]; in Aulanko R, Wahlberg L, Vainio M (eds): Fonetiikan päivät 2006 [The Phonetics Symposium 2006]. No. 53 in Helsingin Yliopiston Puhetieteiden Laitoksen Julkaisuja, pp 134-143.Search in Google Scholar

59 O'Dell ML (2003): Intrinsic Timing and Quantity in Finnish; PhD thesis, University of Tampere.Search in Google Scholar

60 O'Dell ML, Nieminen T (1999): Coupled oscillator model of speech rhythm. Proceedings of the 14th International Congress of Phonetic Sciences, San Francisco, pp 1075-1078.Search in Google Scholar

61 O'Dell ML, Nieminen T (2009): Coupled oscillator model for speech timing: overview and examples. Nordic Prosody: Proceedings of the 10th Conference, Helsinki, pp 179-190.Search in Google Scholar

62 O'Dell ML, Nieminen T, Mustanoja L (2011): The effect of synchronous reading on speech rhythm. Presented at the 13th International Rhythm Perception and Production Workshop, Leipzig.Search in Google Scholar

63 Pate JK, Goldwater S (2015): Talkers account for listener and channel characteristics to communicate efficiently. J Mem Lang 78:1-17.10.1016/j.jml.2014.10.003Search in Google Scholar

64 Pellegrino F, Coupé C, Marsico E (2011): A cross-language perspective on speech information rate. Language 87:539-558.10.1353/lan.2011.0057Search in Google Scholar

65 Perkell JS, Klatt DH (1986): Invariance and Variability in Speech Processes. Lawrence Erlbaum.Search in Google Scholar

66 Port R (2013): Coordinative structures for the control of speech production. www.cs.indiana.edu/∼port/teach/641/coord.strctr.html (last checked April 1, 2013).Search in Google Scholar

67 Port R, Tajima K, Cummins F (1999): Speech and rhythmic behavior; in Savelsburgh GJP, van der Maas H, van Geert PCL (eds): The Non-Linear Analysis of Developmental Processes. Amsterdam, Elsevier, pp 5-45.Search in Google Scholar

68 Port RF, Leary AP (2005): Against formal phonology. Language 81:927-964.10.1353/lan.2005.0195Search in Google Scholar

69 Port RF, Van Gelder T (1995): Mind as Motion: Explorations in the Dynamics of Cognition. Cambridge, MA, MIT Press.Search in Google Scholar

70 Quené H, Port RF (2005): Effects of timing regularity and metrical expectancy on spoken-word perception. Phonetica 62:1-13.10.1159/000087222Search in Google Scholar

71 Rubach J, Booij G (1985): A grid theory of stress in Polish. Lingua 66:281-319.10.1016/0024-3841(85)90032-4Search in Google Scholar

72 Sadeniemi M (1949): Metriikkamme perusteet [Foundations of our metrics]. No. 236 in SKS:n toimituksia. Helsinki, Suomalaisen Kirjallisuuden Seura.Search in Google Scholar

73 Schlink B (1994): Selbs Betrug. Zürich, Diogenes Verlag AG.Search in Google Scholar

74 Schlüter J (2005): Rhythmic Grammar. The Influence of Rhythm on Grammatical Variation and Change in English, Volume 46 of Topics in English Linguistics. Berlin, Mouton de Gruyter.10.1515/9783110219265Search in Google Scholar

75 Seyfarth S (2014): Word informativity influences acoustic duration: effects of contextual predictability on lexical representation. Cognition 133:140-155.10.1016/j.cognition.2014.06.013Search in Google Scholar

76 Shannon CE (1948): A mathematical theory of communication. Bell System Technical J 27:379-423, 623-656.10.1002/j.1538-7305.1948.tb00917.xSearch in Google Scholar

77 Shih SS (2014): Towards Optimal Rhythm; PhD thesis, Stanford University.Search in Google Scholar

78 Shockley K, Richardson DC, Dale R (2009): Conversation and coordinative structures. Top Cogn Sci 1:305-319.10.1111/j.1756-8765.2009.01021.xSearch in Google Scholar

79 Sievers E (1893): Grundzüge der Phonetik zur Einführung in das Studium der Lautlehre der Indogermanischen Sprachen, ed 4. Leipzig, Breitkopf & Härtel.Search in Google Scholar

80 Sovijärvi A (1946): Huomioita puherytmiikasta [Notes on speech rhythm]. Virittäjä 50:117-129.Search in Google Scholar

81 Temperley D (2009): Distributional stress regularity: a corpus study. J psycholinguist Res 38:75-92.10.1007/s10936-008-9084-0Search in Google Scholar

82 Tilsen S (2011): Metrical regularity facilitates speech planning and production. Lab Phonol 2:185-218.10.1515/labphon.2011.006Search in Google Scholar

83 Turk A (2010): Does prosodic constituency signal relative predictability? A smooth signal redundancy hypothesis. Lab Phonol 1:227-262.10.1515/labphon.2010.012Search in Google Scholar

84 Turk A, Shattuck-Hufnagel S (2013): What is speech rhythm? A commentary on Arvaniti and Rodriquez, Krivokapic and Goswami and Leong. Lab Phonol 4:93-118.10.1515/lp-2013-0005Search in Google Scholar

85 Turk A, Shattuck-Hufnagel S (2014): Timing in talking: what is it used for, and how is it controlled? Philos Trans R Soc B Biol Sci 369:20130395.10.1098/rstb.2013.0395Search in Google Scholar

86 Turvey MT (1990): Coordination. Am Psychol 45:938-953.10.1037/0003-066X.45.8.938Search in Google Scholar

87 van der Hulst H (2014): Representing rhythm; in van der Hulst H (ed): Word Stress: Theoretical and Typological Issues. Cambridge, Cambridge University Press, p 325.Search in Google Scholar

88 Vogel R, van de Vijver R, Kotz S, Kutscher A, Wagner P (2015): Function words in rhythmic optimisation; in van de Vijver R, Vogel R (eds): Rhythm in Cognition and Grammar: A Germanic Perspective. Berlin, De Gruyter, pp 253-274.Search in Google Scholar

89 Wagner P (2012): Meter specific timing and prominence in German poetry and prose; in Niebuhr·(ed): Understanding Prosody, Language, Context and Cognition. Berlin, Walter de Gruyter, pp 219-236.Search in Google Scholar

90 Wagner P, Malisz Z, Inden B, Wachsmuth I (2013): Interaction phonology - a temporal co-ordination component enabling representational alignment within a model of communication. Alignment in Communication. Towards a New Theory of Communication, pp 109-132.10.1075/ais.6.06wagSearch in Google Scholar

91 Wheeldon LR, Lahiri A (2002): The minimal unit of phonological encoding: prosodic or lexical word. Cognition 85:B31-B41.Search in Google Scholar

92 White L (2014): Communicative function and prosodic form in speech timing. Speech Commun 63:38-54.10.1016/j.specom.2014.04.003Search in Google Scholar

93 Windmann A, Šimko J, Wagner P (2014a): Probing theories of speech timing using optimization modeling. Proceedings of the 7th International Conference on Speech Prosody, Dublin, pp 346-350.10.21437/SpeechProsody.2014-57Search in Google Scholar

94 Windmann A, Šimko J, Wagner P (2014b): A unified account of prominence effects in an optimization-based model of speech timing. Proceedings of INTERSPEECH 2014, Singapore.10.21437/Interspeech.2014-44Search in Google Scholar

95 Windmann A, Šimko J, Wagner P (2015): Optimization-based modeling of speech timing. Speech Commun 74:76-92.10.1016/j.specom.2015.09.007Search in Google Scholar

96 Zipf GK (1935): The Psycho-Biology of Language. Houghton, Mifflin.Search in Google Scholar

Received: 2015-03-20
Accepted: 2016-09-15
Published Online: 2017-02-23
Published in Print: 2017-02-01

© 2017 S. Karger AG, Basel

Downloaded on 26.4.2024 from https://www.degruyter.com/document/doi/10.1159/000450829/html
Scroll to top button