A General Parsing Model for Music and Language

Bod, Rens

doi:10.1007/3-540-45722-4_3

Rens Bod^3,4

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2445))

Included in the following conference series:

International Conference on Music and Artificial Intelligence

1163 Accesses
2 Citations

Abstract

Is there a general mechanism that governs the perception of phrase structure in music and language? While it is usually assumed that humans have separate faculties for music and language, this work focuses on the commonalities rather than on the differences between these modalities, aiming at finding a deeper “faculty”. We present a series of data-oriented parsing (DOP) models which aim at balancing the simplest structure with the most likely structure of an input. Experiments with the Essen Folksong Collection and the Penn Treebank show that exactly the same model with the same parameter setting achieves maximum parse accuracy for both music and language. This suggests an interesting parallel between musical and linguistic processing. We show that our results outperform both the melodic component of Temperley (2001) and the musical parser of Bod (2001b).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Black, E. et al. (1991). iA Procedure for Quantitatively Comparing the Syntactic Coverage of English, Proceedings DARPA Speech and Natural Language Workshop, Morgan Kaufmann.
Google Scholar
Bod, R. (1993). Using an Annotated Language Corpus as a Virtual Stochastic Grammar, Proceedings AAAI-93, Menlo Park, Ca.
Google Scholar
Bod, R. (1998). Beyond Grammar: An Experience-Based Theory of Language, Stanford: CSLI Publications (Lecture notes number 88), distributed by Cambridge University Press.
Google Scholar
Bod, R. (2000). Parsing with the Shortest Derivation. Proceedings COLING-2000, Germany.
Google Scholar
Bod, R. (2001a). What is the Minimal Set of Fragments that Achieves Maximal Parse Accuracy? Proceedings ACL’2001, Toulouse, France.
Google Scholar
Bod, R. (2001b). A Memory-Based Model for Music Analysis. Proceedings International Computer Music Conference (ICMC’2001), Havana, Cuba.
Google Scholar
Bod, R. (2001c). Memory-Based Models of Melodic Analysis: Challenging the Gestalt Principles. Journal of New Music Research, 31(1), in press. (available at http://turing.wins.uva.nl/~rens/jnmr01.pdf)
Bod, R. (2002). Combining Simplicity and Likelihood in Language and Music. Proceedings CogSci’2002. Fairfax, Virginia.
Google Scholar
Bod, R., J. Hay and S. Jannedy (eds.) (2002a). Probabilistic Linguistics. Cambridge, The MIT Press. (in press)
Google Scholar
Bod, R., R. Scha and K. Sima’an (eds.) (2002b). Data-Oriented Parsing. Stanford, CSLI Publications. (in press)
Google Scholar
Buffart, H., E. Leeuwenberg and F. Restle (1983). Analysis of Ambiguity in Visual Pattern Completion. Journal of Experimental Psychology 9, 980–1000.
Google Scholar
Charniak, E. (1997). Statistical Techniques for Natural Language Parsing, AI Magazine, Winter 1997, 32–43.
Google Scholar
Charniak, E. (2000). A Maximum-Entropy-Inspired Parser. Proceedings ANLP-NAACL’2000.
Google Scholar
Chater, N. (1999). The Search for Simplicity: A Fundamental Cognitive Principle? The Quarterly Journal of Experimental Psychology, 52A(2), 273–302.
Google Scholar
Chomsky, N. (1965). Aspects of the Theory of Syntax, Cambridge, The MIT Press.
Google Scholar
Collard, R., P. Vos and E. Leeuwenberg, (1981). What Melody Tells about Metre in Music. Zeitschrift fŸr Psychologie. 189, 25–33.
Google Scholar
Collins, M. (2000). Discriminative Reranking for Natural Language Parsing, Proceedings ICML-2000, Stanford, Ca.
Google Scholar
Collins, M. and N. Duffy (2002). New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron. Proceedings ACL’2002, Philadelphia.
Google Scholar
Frazier, L. (1978). On Comprehending Sentences: Syntactic Parsing Strategies. PhD. Thesis, University of Connecticut.
Google Scholar
Goodman, J. (2002). Efficient Parsing of DOP with PCFG-Reductions. In R. Bod et al. 2002b. von Helmholtz, H. (1910). Treatise on Physiological Optics (Vol. 3), Dover, New York.
Google Scholar
Jurafsky, D. (2002). Probabilistic Modeling in Psycholinguistics: Comprehension and Production. In R. Bod et al. 2002a.
Google Scholar
Kersten, D. (1999). High-level vision as statistical inference. In S. Gazzaniga (ed.), The New Cognitive Neurosciences, Cambridge, The MIT Press.
Google Scholar
Leeuwenberg, E. (1971). A Perceptual Coding Language for Perceptual and Auditory Patterns. American Journal of Psychology. 84, 307–349.
Article Google Scholar
Lerdahl, F. and R. Jackendoff (1983). A Generative Theory of Tonal Music. The MIT Press.
Google Scholar
Longuet-Higgins, H. (1976). Perception of Melodies. Nature 263, October 21, 646–653.
Article Google Scholar
Longuet-Higgins, H. and C. Lee, (1987). The Rhythmic Interpretation of Monophonic Music. In: Mental Processes: Studies in Cognitive Science, Cambridge, The MIT Press.
Google Scholar
Manning, C. and H. Schütze (1999). Foundations of Statistical Natural Language Processing. Cambridge, The MIT Press.
MATH Google Scholar
Marcus, M., B. Santorini and M. Marcinkiewicz (1993). Building a Large Annotated Corpus of English: the Penn Treebank, Computational Linguistics 19(2).
Google Scholar
Marr, D. (1982). Vision. San Francisco, Freeman.
Google Scholar
Mumford, D. (1999). The dawning of the age of stochasticity. Based on a lecture at the Accademia Nazionale dei Lincei. (available at http://www.dam.brown.edu/people/mumford/Papers/Dawning.ps)
Osborne, M. (1999). Minimal description length-based induction of definite clause grammars for noun phrase identification. Proceedings EACL Workshop on Computational Natural Language Learning. Bergen, Norway.
Google Scholar
Palmer, S. (1977). Hierarchical Structure in Perceptual Representation. Cognitive Psychology, 9, 441–474.
Article MathSciNet Google Scholar
Raphael, C. (1999). Automatic Segmentation of Acoustic Musical Signals Using Hidden Markov Models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 21, 360-370.
Google Scholar
Rissanen, J. (1989). Stochastic Complexity in Statistical Inquiry. Series in Computer Science—Volume 15. World Scientific, 1989.
Google Scholar
Saffran, J., M. Loman and R. Robertson (2000). Infant Memory for Musical Experiences. Cognition 77, B 16–23.
Google Scholar
Schaffrath, H. (1995). The Essen Folksong Collection in the Humdrum Kern Format. D. Huron (ed.). Menlo Park, CA: Center for Computer Assisted Research in the Humanities.
Google Scholar
Shannon, C. (1948). A Mathematical Theory of Communication. Bell System Technical Journal. 27, 379–423, 623-656.
MathSciNet Google Scholar
Simon, H. (1972). Complexity and the Representation of Patterned Sequences of Symbols. Psychological Review. 79, 369–382.
Article Google Scholar
Temperley, D. (2001). The Cognition of Basic Musical Structures. Cambridge, The MIT Press.
Google Scholar
Wertheimer, M. (1923). Untersuchungen zur Lehre von der Gestalt. Psychologische Forschung 4, 301–350.
Article Google Scholar
Wundt, W. (1901). Sprachgeschichte und Sprachpsychologie. Engelmann, Leipzig.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Logic, Language and Computation, University of Amsterdam, Amsterdam
Rens Bod
School of Computing, University of Leeds, Leeds
Rens Bod

Authors

Rens Bod
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Music, University of Edinburgh, 12 Nicolson Square, EH8 9DF, Edinburgh, UK
Christina Anagnostopoulou & Miguel Ferrand &
Division of Informatics, University of Edinburgh, 80 South Bridge, EH1 1HN, Edinburgh, UK
Alan Smaill

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bod, R. (2002). A General Parsing Model for Music and Language. In: Anagnostopoulou, C., Ferrand, M., Smaill, A. (eds) Music and Artificial Intelligence. ICMAI 2002. Lecture Notes in Computer Science(), vol 2445. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45722-4_3

Download citation

DOI: https://doi.org/10.1007/3-540-45722-4_3
Published: 27 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44145-8
Online ISBN: 978-3-540-45722-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics