|
CAMBRIDGE, MA -- September 12, 2005 -- As MTV images flash on the screen, we instantly recognize a face and a car after just a brief glimpse. We need more time, however, to scan a scene when searching for our car in a parking lot or a friend's face in a crowd. To understand fully how we recognize objects in visually cluttered scenes, neuroscientists must first know how the brain filters the initial input. Yet little is known about how the brain represents briefly glimpsed scenes of visual clutter, even those with just two or three objects.
Now, researchers at The McGovern Institute for Brain Research at MIT have found strong evidence for a new understanding of how the brain processes several objects at once, organizing the visual information in a systematic, predictable manner for higher levels of the brain to interpret. The study appears in the September issue of the Journal of Neuroscience.
"Our brains recognize objects so easily we never think about it; however, it is a very complex problem," explains James DiCarlo, a McGovern Institute neuroscientist. "We must solve this problem before we can create artificial visual machines. We can teach a robot to recognize keys, but it has difficulty recognizing keys next to a wallet."
To dissect the problem, DiCarlo's group studied brain activity during controlled clutter situations. First authors Dr. Davide Zoccolan, a postdoctoral associate, and David Cox, a graduate student in DiCarlo's lab, showed monkeys quick glimpses of scenes with single objects, or with two or three objects. The exposures were too brief for the monkeys to scan the scene or direct attention towards any one object. Using microelectrodes, the team measured the responses of 104 different neurons in the monkeys' inferotemporal (IT) visual cortex, a brain region important for visual recognition.
Scientists have known that each IT neuron responds more intensely to some objects than others, suggesting that these neurons play a role in object discrimination. For example, a neuron may fire intensely at the sight of a key, but barely at all to a wallet. How does that neuron respond to the sight of both a key and a wallet in one scene? The limited existing data suggested that the presentation of the wallet degraded the responses of IT neurons responding to the key, as if noise randomly blurs the brain's representation of the key.
However, the McGovern team's study reveals how the brain may cleverly avoid being confused by simple clutter. They found that, indeed, additional objects in the scene do change an IT neuron's response. But this change obeys a remarkably systematic rule: the neuron's response to two objects together is the average of its responses to each item separately - halfway between the responses elicited by a key and wallet individually. The same rule holds true when three objects are presented.
This work was supported by the National Institutes of Mental Health (NIH), the National Eye Institute (NIH), the Pew Charitable Trusts, an International Human Frontier Science Program Organization Post-doctoral Fellowship, and a National Defense Science and Engineering Graduate Fellowship.
About the McGovern Institute at MIT
The McGovern Institute at MIT is a research and teaching institute committed to advancing human understanding and communications. Led by a team of world-renowned, multi-disciplinary scientists, The McGovern Institute was established in February 2000 by Lore Harp McGovern and Patrick McGovern to meet one of the great challenges of modern science - the development of a deep understanding of thought and emotion in terms of their realization in the human brain. Additional information is available at: http://web.mit.edu/mcgovern
Contacts
skyemedia, inc.
Lyn Chamberlin, 978-443-0400
lyn@skyepr.com
or
skyemedia, inc.
Derek Beckwith, 978-443-0400
derek@skyepr.com
###
|