Lecture Schedule

This is a tentative schedule. It is subject to change.

Date Lecture Notes Homework
Week 1
Sep 2 01 – Course Intro and Git Class Intro, Git
Sep 4 02 – Spark Project Pitch Day Pitch Deck
Week 2
Sep 9 03 – Class Survey Results, Intro to Pandas Survey Results (1), Pandas
(1) BU email address required
Sep 11 04 – DS Project Lifecycle, Intro to SciKit-Learn DS PLC, SciKit-Learn
Week 3
Sep 16 05 – Probability Refresher Probability
Sep 18 06 – Linear Algebra Refresher Linear Algebra
Week 4
Sep 23 07 – Distance, Similarity Functions and Time Series Distances, Time Series
Sep 25 08 – Clustering I: k-means k-means
Week 5
Sep 30 09 – Clusterin II: In Practice Clustering in Practice
Oct 2 10 – Clustering III: Hierarchical Hierarchical Clustering
Week 6
Oct 7 11 – Clustering IV: GMM and Expectation Maximization GMM and EM
Oct 9 12 – Learning From Data Learning From Data
Week 7
Oct 14 🍂 No Class – Monday Schedule 🍂
Oct 16 13 – Classification I: Decision Trees and Random Forests Decision Trees
Week 8
Oct 21 14 – Classification II: k-Nearest Neighbors k-NN
Oct 23 15 – Linear Regression Linear Regression
Week 9
Oct 28 16 – Logistic Regression and Regularization Logistic Regression Regularization
Oct 30 17 – SVD Low Rank Approximations SVD
Week 10
Nov 4 18 – Dimensionality Reduction: PCA and t-SNE PCA and t-SNE
Nov 6 19 – Neural Networks: Gradient Descent & Backpropagation NN I, NN II
Week 11
Nov 11 20 – Intro to NLP NLP
Nov 13 21 – Recommender Systems Recommender Systems
Week 12
Nov 18 22 – Graphs I Graphs I
Nov 20 23 – Graphs II Graphs II
Week 13
Nov 25 24 – Time Series Analysis Time Series
Nov 27 🦃 No Class – Thanksgiving Recess 🌽
Week 14
Dec 2 25 – Lecture to be announced
Dec 4 📽️ Project Presentation I 📽️
Week 15
Dec 9 📽️ Project Presentations II 📽️
Dec 10 🎉 Last Day of Classes 🎉 🔬 Projects Due 🔬
Dec 11 ✨ Spark Demo Day 4pm – 7pm ⚡️
Back to top