Michael G. Ross
I am a computer scientist and my primary research interest is the application of statistical machine learning techniques to challenging problems in computer vision, human vision, and other real-world domains.
Currently, I am a postdoctoral associate at the MIT Department of Brain and Cognitive Sciences. I am a member of Aude Oliva's Computational Visual Cognition Laboratory
Previously, I have worked with:
Current Research
- Human perception and visual feature spaces (2007-present)
Researchers in computer vision and cognitive psychology have developed many features for image classification and recognition algorithms. This project explores the relationships between some of these feature spaces and human perception.
Collaborators: Emmanuelle Boloix, Aude Oliva
- Inducing visual classification features (2005-present)
What features do humans use to classify images? Much previous work on this question assumes that people are applying a linear classifier to image pixels. Our new approach assumes a two-stage classification process - probabilistic feature detection followed by classification - and uses machine learning algorithms for graphical model parameter learning to find the features. We are also exploring the use of more sophisticated features and the recognition of natural scenes.
Collaborators: Andrew Cohen, Michelle Greene
Publications:
Presentations:
- Markov chain Monte Carlo analysis of human mass perception (2006-present)
Using a technique developed by Tom Griffiths and Adam Sanborn, we are using Markov chain Monte Carlo sampling to better understand the human perception of mass in collisions.
Collaborators: Andrew Cohen
Presentations:
Previous Research
- Iterative contextual modeling for optical character recognition (2006-2007)
This is a new approach to optical character recognition (OCR) based on building dynamic document-specific models. Previous OCR algorithms have relied on global static document, language, and font models that are extremely expensive to construct and fail to adapt to specialized vocabulary, atypical fonts, or unusual formatting. Many of these problems can be solved by dynamically constructing a model during the recognition process that accounts for vocabulary, font, and formatting consistencies within a single document or corpus of related documents.
Collaborators: Erik Learned-Miller, Michael Wick, Jerod Weinman
Publications:
- Event-related potential signal alignment and modeling (2006-2007)
Cognitive psychologists study the EEG signals produced by different stimuli and tasks to understand the underlying cognitive processes. These event-related potentials (ERPs) are contaminated with noise from unrelated brain or muscle functions and from the recording and measurement process. The most common method of reducing noise is averaging large numbers of trials, but this can cause significant loss of data, particularly in cases where the latency of ERP components is highly variable. We are attempting to use machine learning and information theory techniques to better account for latency variation.
Collaborators: Marwan Mattar, Lisa Sanders, Erik Learned-Miller
- Using eye movements to understand complex visual comparisons (2005-2006)
This project employs the statistical analysis of eye-movement patterns to understand how humans determine the level of similarity between two graphs.
Publications:
- Kyle R. Cave, Andrew Cohen, Caren Rotello, Anthony McCaffrey, Michael G. Ross, Min Zeng, Xingshan Li, Matthew Zivot, Kris Chang. Using Eye Movements to Understand Complex Visual Comparisons. (In press)
- Learning Object Boundary Detection from Object Motion (2000-2005) [MIT Ph.D. Thesis]
Advisor:Leslie Pack Kaelbling
Publications:
Presentations:
- Combining Optical Flow and Image Segmentation (1998-2000) [MIT Master's Thesis]
Advisor: Paul Viola
Publications:
Code:
- Distributed Robot Sensing and Manipulation (1997-1998) [Dartmouth College Senior Thesis]
Advisor: Daniela Rus
Publications:
Last Modified: Sunday, 06-Jul-2008 20:32:59 EDT