6.S088 Modern Machine Learning: Simple Methods that Work

Welcome to 6.S088!

Introduction

Over the past decade, interest in machine learning research has spiked drastically, with advancements in deep learning being a significant driving force. Indeed, deep learning has transformed many areas in computer science including computer vision, natural language processing, and reinforcement learning. Unfortunately, given the rapid pace of progress in deep learning, a newcomer looking for a simple set of guiding principles for building machine learning applications can be easily overwhelmed by the nuances of training deep networks. Thus, motivated by recent developments in machine learning, we present a simple class of machine learning methods that are easy to implement and which achieve competitive performance in practice. In particular, our methods rely on the recently established equivalence between kernel regression and infinite width neural networks given by the neural tangent kernel (NTK). In addition to being a theoretical tool for understanding neural networks, we demonstrate that the NTK is a simple method for achieving competitive results in a variety of machine learning applications including regression, classification, and matrix/image completion. We provide problem sets containing both theoretical and coding exercises with the aim of (1) providing newcomers to machine learning a simple toolkit for building effective machine learning models in practice and (2) preparing interested students for research in the area.

Organization

Units: 1-0-5

Prereqs: Knowledge of linear algebra (level of 18.06 or 18.700), analysis (level of 18.100), and probability (level of 6.041 or 18.600) is generally assumed. Familiarity with Python (in particular, NumPy) is also assumed. Knowledge of Fourier analysis (18.103), functional analysis (18.102), random matrix theory (18.338), and complex analysis (18.112) is suggested for students who want to pursue research in this area.

Schedule: Monday – Friday, January 18 – January 28, 1-2:30pm, room 32-141

Instructor: Adityanarayanan Radhakrishnan, aradha@mit.edu

Faculty Advisor: Prof Caroline Uhler

TAs: Max Luyten, George Stefanakis, Cathy Cai

Schedule

Lecture Date Topics Problem Set Notes Videos
1 Tue, Jan. 18 Lecture 1: Course Overview and Preliminaries Problem Set 1: Review of Linear Algebra, Analysis, and Probability notes Course Overview
Math Review
2 Wed, Jan. 19 Lecture 2: Linear Regression Problem Set 2: Linear Regression and Kernel Regression notes Linear Regression
3 Thurs, Jan. 20 Lecture 3: Kernel Regression notes Kernel Regression
4 Fri, Jan. 21 Lecture 4: NNGP, Dual Activations, and Over-parameterization Problem Set 3: Random Fourier Features and NNGP Derivations notes NNGP
5 Mon, Jan. 24 Lecture 5: NTK Origin and Derivation Problem Set 4: NTK, Neural Tangents Library + Project Proposals notes NTK Introduction
6 Tue, Jan. 25 Lecture 6: NTK for Deep Networks and the Convolutional NTK (CNTK) notes NTK for Deep Networks
7 Wed, Jan. 26 Lecture 7: NTK Applications - Matrix Completion and Image Inpainting notes
8 Thurs, Jan. 27 Lecture 8: Additional Office Hours
9 Fri, Jan. 28 Project Proposal / Paper Review Presentations notes