Modeling attention

Models of attention and eye movements

In this work we describe a general probabilistic framework that accounts for deployment of attention in complex natural images, scene recognition and object detection. The framework integrates into a unique model three factors for attention guidance: bottom-up saliency (based on low-level image properties), target driven search (model of the appearance of the target object) and global scene priors (top-down priors provided by the gist of the scene). Using a Bayesian framework, we show how to determine (1) which are the locations and scales in the image that are best candidates to contain objects of interest, (2) how the expected appearance of the target modulates the saliency of local image regions and, (3) scene priors: used to modulate the saliency of image regions early during the visual search task.

Most models of attention and object recognition rely on the definition of sets of local features. In the local pathway, each location is represented by a vector of features that describe local image properties. It could be a collection of templates (e.g., object detection) or a vector composed by the output of wavelets at different orientations and scales (e.g., saliency models of attention). In the global pathway, the entire image is represented by a unique set of features that summarizes the appearance of the scene without encoding specific objects or regions.

Copyright Notice
The documents distributed here have been provided as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Contextual influences on saliency

A. Torralba. (to appear)

Neurobiology of Attention. Eds. L. Itti, G. Rees and J. Tsotsos. Academic Press / Elsevier.

Modeling global scene factors in attention

A. Torralba. (2003)
Journal of Optical Society of America A. Special Issue on Bayesian and Statistical Approaches to Vision. Vol. 20(7), pages 1407-1418.

(paper.pdf)

Top-Down Control of Visual Attention in Object Detection

Aude Oliva, Antonio Torralba, Monica S. Castelhano and John M. Henderson. (2003)

International Conference on Image Processing (ICIP). Vol. I, pages 253-256. September 14-17, in Barcelona, Spain

(paper.pdf)

A. Oliva, A Torralba, M. Castelhano, J. Henderson, (2003). Top-down control of visual attention in real world scenes. Visual Science Society Meeting.

J. M. Wolfe, A. Torralba, T. S. Horowitz. (2002) Remodeling Visual Search: How gamma distributions can bring those boring old RTs to life. Abstract from Visual Science Society Meeting.