Abstract
Developments in the theory of auditory processing of rhythmic signals have enabled the construction of a robust algorithm for recovery of rhythmic grouping structure. This algorithm appears to be effective for both speech and music signals. The theory upon which the algorithm was based was inspired by the theory of edge detection in vision. The output of the algorithm can be visualised in the form of a “rhythmogram”, examples of which are shown for a variety of speech signals. The relationship between rhythm, time perception and metre is discussed in the light of a recent “auditory-motor” theory of beat induction.
Similar content being viewed by others
References
Brown, G. J. (1992). Computational Auditory Scene Analysis: A Representational Approach. Ph.D. Thesis, University of Sheffield.
Brown, G. J. & Cooke, M. (1994). Perceptual Grouping of Musical Sounds: A Computational Model. J. New Music Research.
Clarke, E. (1988). Generative Principles in Musical Performance. In Sloboda, J. (ed.) Generative Processes in Music: The Psychology of Performance, Improvisation and Composition. Oxford: Claredon Press.
Cutler, A. & Ladd, D. R. (1983). Prosody: Models and Measurement. Springer-Verlag: Berlin.
Desain, P. (1992) A (De) Composable Theory of Rhythm Perception. Music Perception 9: 439–454.
Dishal, M. (1959). Gaussian Response Filter Design. Electrical Communications 36(1): 3–26.
Drake, C. & Botte, M. (1993). Tempo Sensitivity in Auditory Sequences: Evidence for a Multiple-Look Model. Perception and Psychophysics 54(3): 277–286.
Geigerich, H. J. (1985). Metrical Phonology and Phonological Structure. Cambridge University Press: Cambridge.
Glasberg, B. & Moore, B. (1990). Derivation of Auditory Filter Shapes from Notched-Noise Data. Hearing Research 47: 103–138.
Hardy, T. (1923). Collected Poems of Thomas Hardy. Vol. I. London: Macmillan.
Kingston, J. & Beckman, M. E. (1990). Papers in Laboratory Phonology: Between the Grammar and Physics of Speech. CUP: Cambridge.
Large, E. (1994). The Resonant Dynamics of Beat Tracking and Meter Perception. Proceedings of The International Computer Music Conference. Denmark: Aarhaus.
Lee, C. S. (1991). Perception of Metrical Structure: Experimental Evidence and a Model. In Howell, P., West, R. & Cross, I. (eds.) Representing Musical Structure, 59–127. London: Academic Press.
Leman, M. (1994). Introduction to Auditory Models in Music Research. J. New Music Research 23(1): 5–9.
Lerdahl, F. & Jackendoff, R. (1983). A Generative Theory of Tonal Music. MIT Press: Cambridge, MA.
Liberman, M. & Prince, A. (1977). On Stress and Linguistic Rhythm. Linguistic Inquiry 8(2): 249–336.
Longuet-Higgins, H. C. (1976). The Perception of Melodies. Nature 263: 646–653.
Longuet-Higgins, H. C. & Lee, C. S. (1982). Perception of Musical Rhythms. Perception 11: 115–128.
Longuet-Higgins, H. C. & Lee, C. S. (1984). The Rhythmic Interpretation of Monophonic Music. Music Perception 1(4): 424–441.
Longuet-Higgins, H. C. & Steedman, M. J. (1971, 1987). On Interpreting Bach. In Longuet-Higgins, H. C. (ed.) Mental Processes: Studies in Cognitive Science, 82–104. MIT Press: Cambridge, MA.
Marr, D. (1982). Vision. Freeman: New York.
Meddis, R. (1988). Simulation of Auditory-Neural Transduction: Further Studies. J. Acoust. Soc. Am 83(3): 1056–1063.
Parncutt, R. (1994). A Model of Beat Induction Accounting for Perceptual Ambiguity by Continuously Variable Parameters. Proceedings of The International Computer Music Conference. Denmark: Aarhaus.
Patterson, R. D. & Holdsworth, J. (1992). A Functional Model of Neural Activity Patterns and Auditory Images. In Ainsworth, W. A. (ed.) Advances in Speech, Hearing and Language Processing. Vol. 3. JAI Press: London.
Popper, A. N. & Fay, R. R. (1992). The Mammalian Anditory Pathway: Neurophysiology. Springer-Verlag: NY.
Povel, D. J. & Essens, P. (1985). Perception of Temporal Patterns. Music Perception 2(4): 411–440.
Repp, B. (1990). Patterns of Expressive Timing in Performances of a Beethoven Minuet by Nineteen Famous Painists. Journal of the Acoustical Society of America 88(2): 622–641.
Repp, B. (1992). Probing the Cognitive Representation of Musical Time: Structural Constraints on the Perception of Timing Perturbatons. Cognition 44: 241–281.
Rosenthal, D. (1992). Machine Rhythm: Computer Emulation of Human Rhythm Perception. MIT Media Lab. Ph.D Thesis.
Seashore, C. (1938). The Psychology of Music. McGraw-Hill: New York.
Selkirk, E. (1984). Phonology and Syntax: The Relation between Sound and Structure. MIT Press: Cambridge, MA.
Shaffer, H. (1981). Performances of Chopin, Bach and Bartok: Studies in Motor Programming. Cognitive Psychology 13: 326–376.
Sloboda, J. (1983). The Communication of Musical Meter. Quarterly Journal Of Experimental Psychology 35: 377–396.
Todd, N. P. (1985). A Model of Expressive Timing in Tonal Music. Music Perception 3: 33–58.
Todd, N. P. McAngus (1989). Towards a Cognitive Theory of Expression: The Performance and Perception of Rubato. Contemporary Music Review 4: 405–416.
Todd, N. P. McAngus (1992). The Dynamics of Dynamics: A Model of Musical Expression. J. Acoust. Soc. Am 91(6): 3540–3550.
Todd, N. P. McAngus (1994a). The Auditory “primal sketch”: A Multi-Scale Model of Rhythm Grouping. J. New Music Research 23(1): 25–70.
Todd, N. P. McAngus (1994b). A New Theory of Temporal Integration. British Journal of Audiology.
Todd, N. P. McAngus (1995). The Kinematics of Musical Expression. J. Acoust. Soc. Am 97(3), 1940–1950.
Todd, N. P. McAngus & Brown, G. (1994). A Multi-Scale Auditory Model of Prosodic Perception. Proceedings of The International Conference on Spoken Language Processing. Yokoyama, Japan.
Todd, N. P. McAngus & Lee, C. S. (1994). An Auditory-Motor Model of Beat Induction. Proceedings of The International Computer Music Conference. Denmark: Aarhaus.
Yost, W. A. & Sheft, S. (1993). Auditory Perception. In Yost, W., Popper, A. & Fay, R. (eds.) Human Psychophysics, 193–236. Springer-Verlag: NY.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
McAngus Todd, N.P., Brown, G.J. Visualization of rhythm, time and metre. Artif Intell Rev 10, 253–273 (1996). https://doi.org/10.1007/BF00127682
Issue Date:
DOI: https://doi.org/10.1007/BF00127682