Lectures
You can download the lectures here. We will try to upload lectures prior to their corresponding classes. Future lectures listed below likely have broken links.
-
MSDS Student Orientation Session
tl;dr: The first class will an orientation for the incoming MSDS students.
-
01 - Course Overview and Git
tl;dr: We'll give an overview of the course and a quick tour of Git.
[C1 Recording]
Suggested Readings and Slides:
-
02 - Intro to Pandas and SciKit-Learn
tl;dr: We'll do a brief introduction to Git, Pandas and Scikit-Learn which we will be using extensively in this course.
[C1 Recording]
Suggested Readings and Slides:
-
03 - A1 Spark Project Pitch Day
tl;dr: The Spark team will introduce the projects for this semester and what the selection process is.
Suggested Reading:
-
03 - C1 Spark Project Pitch Day
tl;dr: The Spark team will introduce the projects for this semester and what the selection process is.
[C1 Recording]
-
04 - Probability and Linear Algebra Refresher
tl;dr: We'll review some of the important concepts in probability and linear algebra which we will then apply in later lectures.
[C1 Recording]
Suggested Readings:
-
05 - Distance, Similarity Functions, and Time Series
tl;dr: We'll review the concepts of distance, (dis)similarity functions, and time series.
[C1 Recording]
Suggested Readings:
-
06 - Clustering I: k-Means
tl;dr: We'll introduce our first unsupervised learning method, clustering with k-means.
[C1 Recording]
Suggested Readings:
-
07 - Clustering II: In Practice
tl;dr: We'll discuss practical aspects of clustering using k-means.
[A1 Recording] [C1 Recording]
Suggested Readings:
-
08 - Clustering III: Hierarchical Clustering
tl;dr: Description here.
[A1 Recording] [C1 Recording]
Suggested Readings:
-
09 - Clustering IV: GMM and Expectation Maximization
tl;dr: Description here.
[A1 Recording] [C1 Recording]
Suggested Readings:
-
10 - Classification: Decision Trees and RandomForests
tl;dr: Description here.
[A1 Recording] [C1 Recording]
Suggested Readings:
-
11 - Classification - k-Nearest-Neighbors
tl;dr: Description here.
[A1 Recording] [C1 Recording]
Suggested Readings:
-
12 - Classification: Naive Bayes and Support Vector Machines
tl;dr: We will discuss the Naive Bayes and SVM classifiers.
[C1 Recording TBD]
Suggested Readings:
-
13 - SVD Low Rank Approximations
tl;dr: We cover how the SVD can be used to compute low rank approximations to your data.
[A1 Recording] [C1 Recording]
Suggested Readings:
-
14 -- Dimensionality Reduction - PCA and tSNE
tl;dr: We cover how to perform dimensionality reduction with PCA and tSNE.
[A1 Recording] [C1 Recording]
Suggested Readings:
-
15 -- Linear Regression
tl;dr: We explore linear regression and the methods for assessing your regression model.
[A1 Recording] [C1 Recording]
Suggested Readings:
-
16 -- Logistic Regression and Regularization
tl;dr: We explore logistic regression for binary classification and regularization for regression.
[A1 Recording] [C1 Recording]
Suggested Readings:
-
18 -- Neural Networks I
tl;dr: The focus of this lecture is on gradient descent.
[A1 Recording] [C1 Recording]
Suggested Readings:
-
19 -- Neural Networks II
tl;dr: This lecture introduces neural networks, the computational graph, and back propagation.
[A1 Recording] [C1 Recording]
Suggested Readings:
-
20 -- Neural Networks III
tl;dr: This lecture introduces convolutional neural networks (CNNs).
[A1 Recording] [C1 Recording]
Suggested Readings:
-
21 -- Recommender Systems
tl;dr: Description here.
[A1 Recording] [C1 Recording]
Suggested Readings:
-
-
-
-