9.S912: Vision and learning - computers and brains

Fall 2013

Class Times: Friday 11:00-2:00 pm

Units: 3-0-9

Location: 46-5193

Instructors: Shimon Ullman, Tomaso Poggio, Ethan Meyers

Office Hours: By appointment

Email Contact : emeyers@mit.edu

Previous Class: FALL 2012

Course description

We will review and discuss research on the problem of learning to understand the world and interact with it using sensory information. We will use vision as the main domain, and review relevant learning approaches from both computational and biological perspectives. Topics will include learning in computational vision, recent advances and limitations, face processing by computers and brain, learning in synapses, reinforcement leaning and Markov decision process in computers and brains.

Class projects: The final assignment is a short paper, about 5 pages or so (due by December 11th). It needs to be related to the study of vision and learning, but other than that it's open. It can take one of several forms.

First, it can be a topic that interests you, and can be related to your research. If you have a suggested topic, send Shimon Ullman an e-mail with a short description in order to get some feedback.
A second option is to read a number of papers on a selected topic that you found interesting. If you have a preference, again you can e-mail to Shimon who can help with papers selection. The project should then be a summary of the papers (about 3 pages), and then a critique and suggestions for future studies (about 2 pages).
Third, instead of a paper you can write a wikipedia entry related to vision and learning (again send Shimon and email about the topic you will cover and why the wikipedia entry should be updated)

Readings and video lectures

M-theory: Learning representations for learning like humans learn
[1] Slides
[2] Video lecture: Part 1 , Part 2

Sensory gating, integration and prediction
[1] Video lecture: Part 1, Part 2

Learning in Spiking Networks
[1] Perceptron Notes
[2] Gutig R, Sompolinsky H (2005). The tempotron: a neuron that learns spike timing-based decisions. Nature Neuroscience 9, 42-428.
[3] Gutig R, Sompolinsky H (2009). Time-Warp-Invariant Neuronal Processing. PLoS Biol 7(7).
[4] Video lecture: Part 1, Part 2

Computational Reinforcement Learning
[1] Reinfocement Learning: An Introduction Chapters 1, 3, and 6 are probably the most relevant.
[2] Scholarpedia article on temporal difference learning
[3] Recent developments in RL: Kober J, Bagnell JA, and Peters J (2013) Reinforcement Learning in Robotics: A Survey. Int. Journal of Robotics Research.
[4] Lecture slides
[5] Video lecture: Part 1, Part 2

Cortical processes for navigating complex acoustic environments
[1] Video lecture: Part 1, Part 2

Computer Perception with Deep Learning
[1] Video lecture: Part 1, Part 2
[2] Lecture notes

Reinforcement learning in the brain
[1] Niv Reinforcement learning in the brain
[2] Video lecture: Part 1, Part 2

Machine Learning Approaches to Face Detection and Related Problems
[1] P. Viola and M. Jones (2001). Rapid object detection using a boosted cascade of simple features CVPR.
[2] R. Schapire and Y. Singer (1999). Improved boosting algorithms using confidence-rated predictions . Machine Learning.
[3] Video lecture: Part 1, Part 2

The neural computations that underlie face processing in primates
[1] Kanwisher N, McDermott J, & Chun M (1997). The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for the Perception of Faces. Journal of Neuroscience. 17 4302-4311.
[2] Freiwald WA, Tsao DY,( 2010). Functional compartmentalization and viewpoint generalization within the macaque face-processing system. Science Vol. 330 no. 6005 pp. 845-851.
[3] Issa EB, DiCarlo JJ (2012). Precedence of the eye region in neural processing of faces. J. Neurosci. 32(47):16666-82.
[4] Tan, C. and T. Poggio (2013). Faces as a Model Category" for Visual Object Recognition. MIT-CSAIL-TR-2013-004, CBCL-311, Massachusetts Institute of Technology, Cambridge, MA, March 18.
[5] Video lecture: Winrich Freiwald and Elias Issa and Charles Cadieu

Using machine learning to understand biological vision and learning
[1] S Chase, A Schwartz (2011) Inference from populations: going beyond modelsProgress in Brain Research, Vol. 192
[2] F Tong, MS Prattle (2012) Decoding patterns of human brain activity. Annu Rev Psychol 64:483-509.
[3] E Meyers, XL Qi, C Constantinidis (2012). Incorporation of new information into prefrontal cortical activity after learning working memory tasks. PNAS, 108:4651-4656.
[4] Lecture slides

Prerequisites

The course is open to all graduate students; undergraduates can take the course with instructor permission.

Grading

Grading will be based on participation and a final project.

Schedule

	Date	Title	Instructor(s)
Class 01	Fri 06 Sep		Shimon Ullman
Class 02	Fri 13 Sep	M-theory: Learning representations for learning like humans learn	Tomaso Poggio
	Fri 20 Sep	No Class
Class 03	Fri 27 Sep	Sensory gating, integration and prediction	Larry Abbot
Class 04	Fri 04 Oct	Learning in spiking networks	Haim Sompolinsky
Class 05	Fri 11 Oct	Computational Reinforcement Learning: An Introduction	Andrew Barto
Class 06	Fri 18 Oct	Cortical processes for navigating complex acoustic environments	Shihab Shamma
Class 07	Fri 25 Oct	Computer Perception with Deep Learning	Yann LeCun
Class 08	Fri 01 Nov	Reinforcement learning in the brain	Yael Niv
Class 09	Fri 8 Nov	Machine Learning Approaches to Face Detection and Related Problems	Mike Jones
Class 10	Fri 15 Nov	The neural computations that underlie face processing in primates	Winrich Freiwald, Charles Cadieu, Elias Issa
Class 11	Fri 22 Nov	No class - workshop on Learning Data Representation: Hierarchies and Invariance	Contact Lorenzo Rosasco to attend workshop
	Fri 29 Nov	No Class
Class 12	Fri 6 Dec	Using machine learning to understand biological vision and learning	Ethan Meyers

Class Times:	Friday 11:00-2:00 pm
Units:	3-0-9
Location:	46-5193
Instructors:	Shimon Ullman, Tomaso Poggio, Ethan Meyers
Office Hours:	By appointment
Email Contact :	emeyers@mit.edu
Previous Class:	FALL 2012

9.S912: Vision and learning - computers and brains Fall 2013