Skip to main content

What Weighs for Word Stress? Big Data Mining and Analyses of Phonotactic Distributions in Brazilian Portuguese

  • Conference paper
  • First Online:
Computational Processing of the Portuguese Language (PROPOR 2018)

Abstract

For about four decades, phonological theories have claimed that word stress assignment depends on the word’s syllabic phonotactic complexity in relation to syllabic position. This study analyzes the phonotactic implications for word stress Brazilian Portuguese. After creating a phonotactic corpus and applying Random Forest modeling, phonotactic distributions for word stress were found to be bound to stress pattern and word length in number of syllables. To account for these observations, models of word naming must be extended with aspects of word stress.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    Antepenultimate and final stress patterns are considered to be exceptional in Portuguese and have diacritics on the stressed vowel to mark stress in orthography.

  2. 2.

    The sequence /tS/ in X-SAMPA corresponds to one sound, the voiceless palato-alveolar affricate, which in IPA is represented by the symbol /ʧ/.

References

  1. Cutler, A.: Forbear is a homophone: lexical prosody does not constrain lexical access. Lang. Speech 29, 201–220 (1986)

    Article  Google Scholar 

  2. Cooper, N., Cutler, A., Wales, R.: Constraints of lexical stress on lexical access in English: evidence from native and non-native listeners. Lang. Speech 45(3), 207–228 (2002)

    Article  Google Scholar 

  3. van Heuven, V.J.J.P., Sluijter, A.M.C.: Effects of focus distribution, pitch accent and lexical stress on the temporal organisation of syllables in Dutch. Phonetica 55, 71–89 (1995)

    Google Scholar 

  4. Braun, B., Galts, T., Kabak, B.: Lexical encoding of L2 tones: the role of L1 stress, pitch accent and intonation. Second Lang. Res. 30(3), 323–350 (2014)

    Article  Google Scholar 

  5. Post da Silveira, A., van Heuven, V., Caspers, J., Schiller, N.O.: Dual activation of word stress from orthography: the effect of the cognate status of words on the production of L2 stress. Dutch J. Appl. Linguist. 3(2), 170–196 (2014)

    Article  Google Scholar 

  6. Post da Silveira, A., van Leussen, J.W.: Generating a bilingual lexical corpus using interlanguage normalized Levenshtein distances. In: Proceeding of the 18th International Conference of Phonetic Sciences (XVII ICPhS), Glasgow, UK (2015)

    Google Scholar 

  7. Domahs, U., Plag, I., Carroll, R.: Word stress assignment in German, English and Dutch: quantity-sensitivity and extrametricality revisited. J. Comp. German. Linguist. 17(1), 59–96 (2014)

    Article  Google Scholar 

  8. Vitevitch, M.S., Luce, P.A., Charles-Luce, J., Kemmerer, D.: Phonotactics and syllable stress: implications for the processing of spoken nonsense words. Lang. Speech 40, 47–62 (1997)

    Article  Google Scholar 

  9. Hayes, B.: A metrical theory of stress rules. [Doctoral thesis MIT, US]. Revised version distributed by IULC, published by Garland Press, New York (1981)

    Google Scholar 

  10. Hyman, L.: A Theory of Phonological Weight. Foris, Dordrecht (1985)

    Google Scholar 

  11. Kager, R.: A metrical theory of stress and distressing in english and dutch. [Doctoral thesis, Utrecht University, NL] (1989)

    Google Scholar 

  12. Kiparsky, P.: From cyclic phonology to lexical phonology. Struct. Phonol. Represent. 1, 131–175 (1982)

    Google Scholar 

  13. Hayes, B.: Extrametricality and English Stress. Linguist. Inquiry 13, 227–276 (1982)

    Google Scholar 

  14. Trommelen, M., Zonneveld, W.: Klemtoon en metrische fonologie. Dick Coutinho, Muiderberg (1989)

    Google Scholar 

  15. Mattoso Câmara Jr., J.: Problemas de Lingüística Descritiva. Vozes, Petrópolis (1969)

    Google Scholar 

  16. Bisol, L.: Mattoso Câmara Jr. e a palavra prosódica. DELTA 20, 59–70 (2004)

    Article  Google Scholar 

  17. Ladefoged, P.: A Course in Phonetics. Harcourt Brace Jovanovich, Fort Worth (1975)

    Google Scholar 

  18. Peterson, G.E., Lehiste, I.: Duration of syllable nuclei in English. J. Acoust. Soc. Am. 32, 693 (1960)

    Article  Google Scholar 

  19. Major, R.: Stress and rhythm in Brazilian Portuguese. Language 61, 259–282 (1985)

    Article  Google Scholar 

  20. Barbosa, P.A.: Incursões em torno do ritmo da fala. Pontes, Campinas (2006)

    Google Scholar 

  21. Lehiste, I.: Suprasegmentals. The MIT Press, Cambridge (1970)

    Google Scholar 

  22. Wang, X., Pols, L.C.W., ten Bosch, L.F.M.: Analysis of context-dependent segmental duration for automatic speech recognition. In: Proceedings of 4th International Conference on Spoken Language, pp. 1181–1184 (1996)

    Google Scholar 

  23. Ciszewski, T.: Stressed vowel duration and phonemic length contrast. Res. Lang. 10(2), 215–223 (2012)

    Article  Google Scholar 

  24. Cristófaro-Silva, T., de Almeida, L.S., Fraga, T.: ASPA: A Formulação de um Banco de Dados de Referência da Estrutura Sonora do Português Contemporâneo. In: Proceedings XXV Congress of Brazilian Society of Computing Science, São Leopoldo, RS, Brazil (2005)

    Google Scholar 

  25. Mendonça, G., Aluísio, S.: Using a hybrid approach to build a pronunciation dictionary for Brazilian Portuguese. In: Proceedings of the 15th INTERSPEECH, pp. 1278–1282 (2014)

    Google Scholar 

  26. Berber Sardinha, T.: The Bank of Portuguese, DIRECT Papers, 50, São Paulo/Liverpool: LAEL, PUCSP/AELSU, University of Liverpool (2003)

    Google Scholar 

  27. Tagliamonte, S., Baayen, H.: Models, forests and trees of York English: was/were variation as a case study for statistical practice. Lang. Var. Change 24, 135–178 (2012)

    Article  Google Scholar 

  28. Strobl, C., Boulesteix, A.-L., Kneib, T., Augustin, T., Zeileis, A.: Conditional variable importance for random forests. BMC Bioinform. 9, 307 (2008)

    Article  Google Scholar 

  29. Strobl, C., Boulesteix, A.-L., Zeileis, A., Hothorn, T.: Bias in random forest variable importance measures: illustrations, sources and a solution. BMC Bioinform. 8, 25 (2007)

    Google Scholar 

  30. Bernaisch, T., Gries, S.T., Mukherjee, J.: The dative alternation in South Asian English(es): modelling predictors and predicting prototypes. Engl. World-Wide 35(1), 7–31 (2014)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Amanda Post da Silveira .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Post da Silveira, A., Sanders, E., Mendonça, G., Dijkstra, T. (2018). What Weighs for Word Stress? Big Data Mining and Analyses of Phonotactic Distributions in Brazilian Portuguese. In: Villavicencio, A., et al. Computational Processing of the Portuguese Language. PROPOR 2018. Lecture Notes in Computer Science(), vol 11122. Springer, Cham. https://doi.org/10.1007/978-3-319-99722-3_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-99722-3_40

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-99721-6

  • Online ISBN: 978-3-319-99722-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics