Learning words from sights and sounds: a computational model

Deb K. Roy; Alex P. Pentland

Download from

www.cs.utexas.edu

More download options

Learning words from sights and sounds: a computational model

Deb K. Roy & Alex P. Pentland

Cognitive Science 26 (1):113-146 (2002) Copy BIBT_EX

Abstract

This paper presents an implemented computational model of word acquisition which learns directly from raw multimodal sensory input. Set in an information theoretic framework, the model acquires a lexicon by finding and statistically modeling consistent cross‐modal structure. The model has been implemented in a system using novel speech processing, computer vision, and machine learning algorithms. In evaluations the model successfully performed speech segmentation, word discovery and visual categorization from spontaneous infant‐directed speech paired with video images of single objects. These results demonstrate the possibility of using state‐of‐the‐art techniques from sensory pattern recognition and machine learning to implement cognitive models which can process raw sensor data without the need for human transcription or labeling.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Edit

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Keywords

Cross‐modal Language acquisition Sensor grounded Computational model Learning

Reprint years

DOI

10.1207/s15516709cog2601_4

Other Versions

No versions found

My notes

Analytics

Added to PP
2013-11-21

Downloads
41 (#573,490)

6 months
4 (#864,415)

Historical graph of downloads

How can I increase my downloads?

Citations of this work

Word learning as Bayesian inference.Fei Xu & Joshua B. Tenenbaum - 2007 - Psychological Review 114 (2):245-272.

Integrating experiential and distributional data to learn semantic representations.Mark Andrews, Gabriella Vigliocco & David Vinson - 2009 - Psychological Review 116 (3):463-498.

Concepts as Semantic Pointers: A Framework and Computational Model.Peter Blouw, Eugene Solodkin, Paul Thagard & Chris Eliasmith - 2016 - Cognitive Science 40 (5):1128-1162.

Learning words from sights and sounds: a computational model.Deb K. Roy & Alex P. Pentland - 2002 - Cognitive Science 26 (1):113-146.

The Coordinated Interplay of Scene, Utterance, and World Knowledge: Evidence From Eye Tracking.Pia Knoeferle & Matthew W. Crocker - 2006 - Cognitive Science 30 (3):481-529.

View all 32 citations / Add more citations

References found in this work

Word and Object.Willard Van Orman Quine, Patricia Smith Churchland & Dagfinn Føllesdal - 1960 - Cambridge, MA, USA: MIT Press.

Word and Object.Willard Van Orman Quine - 1960 - Les Etudes Philosophiques 17 (2):278-279.

Word and Object.Henry W. Johnstone - 1961 - Philosophy and Phenomenological Research 22 (1):115-116.

On the genesis of abstract ideas.M. I. Posner & S. W. Keele - 1968 - Journal of Experimental Psychology 77 (2p1):353-363.

Distributional regularity and phonotactic constraints are useful for segmentation.Michael R. Brent & Timothy A. Cartwright - 1996 - Cognition 61 (1-2):93-125.

View all 10 references / Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Learning words from sights and sounds: a computational model

Abstract

Categories

Keywords

Reprint years

DOI

Other Versions

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Citations of this work

References found in this work