The Human Speechome Project

Roy, Deb; Patel, Rupal; DeCamp, Philip; Kubat, Rony; Fleischman, Michael; Roy, Brandon; Mavridis, Nikolaos; Tellex, Stefanie; Salata, Alexia; Guinness, Jethran; Levit, Michael; Gorniak, Peter

doi:10.1007/11880172_15

Deb Roy²²,
Rupal Patel²³,
Philip DeCamp²²,
Rony Kubat²²,
Michael Fleischman²²,
Brandon Roy²²,
Nikolaos Mavridis²²,
Stefanie Tellex²²,
Alexia Salata²²,
Jethran Guinness²²,
Michael Levit²² &
…
Peter Gorniak²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4211))

Included in the following conference series:

International Workshop on Emergence and Evolution of Linguistic Communication

864 Accesses
54 Citations

Abstract

The Human Speechome Project is an effort to observe and computationally model the longitudinal course of language development for a single child at an unprecedented scale. We are collecting audio and video recordings for the first three years of one child’s life, in its near entirety, as it unfolds in the child’s home. A network of ceiling-mounted video cameras and microphones are generating approximately 300 gigabytes of observational data each day from the home. One of the worlds largest single-volume disk arrays is under construction to house approximately 400,000 hours of audio and video recordings that will accumulate over the three year study. To analyze the massive data set, we are developing new data mining technologies to help human analysts rapidly annotate and transcribe recordings using semi-automatic methods, and to detect and visualize salient patterns of behavior and interaction. To make sense of large-scale patterns that span across months or even years of observations, we are developing computational models of language acquisition that are able to learn from the childs experiential record. By creating and evaluating machine learning systems that step into the shoes of the child and sequentially process long stretches of perceptual experience, we will investigate possible language learning strategies used by children with an emphasis on early word learning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A modular, extensible approach to massive ecologically valid behavioral data

Article 16 November 2018

A thorough evaluation of the Language Environment Analysis (LENA) system

Article 29 July 2020

Collecting and Analyzing Spontaneous Speech Data

References

Tomasello, M., Stahl, D.: Sampling children’s spontaneous speech: How much is enough? Journal of Child Language 31, 101–121 (2004)
Article Google Scholar
Roy, D., Pentland, A.: Learning words from sights and sounds: A computational model. Cognitive Science 26(1), 113–146 (2002)
Article Google Scholar
Roy, D.: Semiotic schemas: A framework for grounding language in action and perception. Artificial Intelligence 167(1-2), 170–205 (2005)
Article Google Scholar
Gorniak, P.: The Affordance-Based Concept. PhD thesis, Massachusetts Institute of Technology (2005)
Google Scholar
Fleischman, M., Roy, D.: Why are verbs harder to learn than nouns? Initial insights from a computational model of situated word learning. In: Proceedings of the 27th Annual Meeting of the Cognitive Science Society (2005)
Google Scholar
Roy, D., Patel, R., DeCamp, P., Kubat, R., Fleischman, M., Roy, B., Mavridis, N., Tellex, S., Salata, A., Guinness, J., Levit, M., Gorniak, P.: The Human Speechome Project. In: Proceedings of the 28th Annual Cognitive Science Conference (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Cognitive Machines Group, MIT Media Laboratory,
Deb Roy, Philip DeCamp, Rony Kubat, Michael Fleischman, Brandon Roy, Nikolaos Mavridis, Stefanie Tellex, Alexia Salata, Jethran Guinness, Michael Levit & Peter Gorniak
Communication Analysis and Design Laboratory, Northeastern University,
Rupal Patel

Authors

Deb Roy
View author publications
You can also search for this author in PubMed Google Scholar
Rupal Patel
View author publications
You can also search for this author in PubMed Google Scholar
Philip DeCamp
View author publications
You can also search for this author in PubMed Google Scholar
Rony Kubat
View author publications
You can also search for this author in PubMed Google Scholar
Michael Fleischman
View author publications
You can also search for this author in PubMed Google Scholar
Brandon Roy
View author publications
You can also search for this author in PubMed Google Scholar
Nikolaos Mavridis
View author publications
You can also search for this author in PubMed Google Scholar
Stefanie Tellex
View author publications
You can also search for this author in PubMed Google Scholar
Alexia Salata
View author publications
You can also search for this author in PubMed Google Scholar
Jethran Guinness
View author publications
You can also search for this author in PubMed Google Scholar
Michael Levit
View author publications
You can also search for this author in PubMed Google Scholar
Peter Gorniak
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Vrije Universiteit Amsterdam, The Netherlands
Paul Vogt
RIKEN Brain Science Institute, 2-1 Hirosawa, 3510198, Wako-shi, Saitama, Japan
Yuuya Sugita
IRIDIA, CoDE, Université Libre de Bruxelles, Brussels, Belgium
Elio Tuci
Adaptive Systems Research Group, University of Hertfordshire, College Lane, AL10 9AB, Hatfield, Hertfordshire, U.K.
Chrystopher Nehaniv

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Roy, D. et al. (2006). The Human Speechome Project. In: Vogt, P., Sugita, Y., Tuci, E., Nehaniv, C. (eds) Symbol Grounding and Beyond. EELC 2006. Lecture Notes in Computer Science(), vol 4211. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11880172_15

Download citation

DOI: https://doi.org/10.1007/11880172_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45769-5
Online ISBN: 978-3-540-45771-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics