xkcd Tasks Source: xkcd

Quick Links:

News:

  • Homework 4 has been released (pdf,latex, code). It is due Thursday, November 30.
  • I have released a practice final exam with solutions.
  • The final exam will be Thursday, December 7 from 2-4pm in SLH 200.

Some problems in computer science admit precise algorithmic solutions. Checking if someone is in a national park is, in some sense, straightforward: get the user’s location, get the boundaries of all national parks, and check if the user location lies within any of those boundaries.

Other problems are less straightforward. Suppose you want your computer to determine if an image contains a bird. To your computer, an image is just a matrix of red, green, and blue pixels. How do you even begin to write the function is_bird(image)?

For problems like this, we turn to a powerful family of methods known as machine learning. The zen of machine learning is the following:

  1. I don’t know how to solve my problem.
  2. But I can obtain a dataset that describes what I want my computer to do.
  3. So, I will write a program that learns the desired behavior from the data.

This class will provide a broad introduction to machine learning. We will start with supervised learning, where our goal is to learn an input-to-output mapping given a set of correct input-output pairs. Next, we will study unsupervised learning, which seeks to identify hidden structure in data. Finally, we will cover reinforcement learning, in which an agent (e.g., a robot) learns from observations it makes as it explores the world.

Course Staff

Robin Jia
Robin Jia
Instructor

Soumya Sanyal
Soumya Sanyal
Teaching Assistant

Wang (Bill) Zhu
Wang (Bill) Zhu
Teaching Assistant

Vishesh Agrawal
Vishesh Agrawal
Course Producer

Ryan Wang
Ryan Wang
Course Producer

Lorena Yan
Lorena Yan
Course Producer

Wenyang Zhang
Wenyang Zhang
Course Producer

Logistics

Prerequisites

This class will also use some basic multivariate calculus (taking partial derivatives and gradients). However, knowledge of single-variable calculus is sufficient as we will introduce the required material during class and section.

Schedule

All assignments are due by 11:59pm on the indicated date.

Date Topic Related Readings Assignments
Tue Aug 22 Introduction (slides) PML 1 Homework 0 released (pdf,latex, code)
Thu Aug 24 Linear Regression (lecture, demo) PML 7.8, 8.2  
Fri Aug 25 Section: Probability, Linear Algebra, & Calculus Review [Soumya] (handout, notes)    
Tue Aug 29 Featurization, Convexity, Maximum Likelihood Estimation (lecture) PML 2.6.3, 4.2, 8.1  
Thu Aug 31 Logistic Regression, Softmax Regression (lecture) PML 10.1-10.3 Homework 0 due
Fri Sep 1 Section: Python & numpy tutorial [Bill] (colab)    
Tue Sep 5 Overfitting, Regularization (lecture) PML 4.5, 4.7, 11.3-11.4  
Thu Sep 7 Bias and Variance, Normal Equations (lecture) PML 11.2 Homework 1 released (pdf,latex, code)
Fri Sep 8 Section: Probability review continued, taking gradients [Soumya] (notes)    
Tue Sep 12 Generative Classifiers, Naive Bayes (lecture) PML 9.3-9.4  
Thu Sep 14 Nearest Neighbors, start of Kernels; Project discussion (lecture) PML 16.1, 16.3  
Fri Sep 15 Section: Cross-Validation, Evaluation Metrics [Bill] (slides)    
Tue Sep 19 Kernel methods continued (lecture) PML 4.3, 17.1, 17.3 Homework 1 due
Thu Sep 21 Introduction to Neural Networks, Dropout, Early Stopping (lecture) PML 13.1-13.3 Homework 2 released (pdf,latex, code)
Fri Sep 22 Section: Scikit-learn tutorial [Vishesh] (colab)    
Tue Sep 26 Backpropagation (slides; demo part 1, part 2, part 3) PML 13.4-13.5 Project Proposal due
Thu Sep 28 Convolutional Neural Networks (slides) PML 14.1-14.2  
Fri Sep 29 Section: Pytorch tutorial [Soumya] (colab)    
Tue Oct 3 Recurrent Neural Networks (slides) PML 15.1-15.2  
Thu Oct 5 Sequence-to-sequence, Attention (slides, review) PML 15.4 Homework 2 due
Fri Oct 6 Section: Midterm preparation    
Tue Oct 10 In-class Midterm Exam    
Oct 12-13 No class or section (Fall break)    
Tue Oct 17 Transformers I (slides) PML 15.5-15.6  
Thu Oct 19 Transformers II, Pretraining (slides) PML 15.7  
Fri Oct 20 Section: RNNs & backpropagation in pytorch [Soumya] (tutorial)    
Tue Oct 24 Decision Trees, ensembles (slides) PML 18.1-18.5 Homework 3 released (pdf,latex, code)
Thu Oct 26 k-Means Clustering (lecture) PML 21.3  
Fri Oct 27 Section: Optimization strategies for neural networks [Bill] (tutorial)    
Tue Oct 31 Gaussian Mixture Models, Expectation Maximization (lecture) PML 21.4, PML2 8.1-8.2 Project Midterm Report due
Thu Nov 2 Dimensionality Reduction, Principal Component Analysis (lecture) PML 20.1, 20.4  
Fri Nov 3 Section: Practical guide to pretrained language models    
Tue Nov 7 Embedding models, Word Vectors (pca part 2, wordvec slides) PML 20.5  
Thu Nov 9 Markov Decision Processes, Reinforcement Learning (lecture) PML2 34.5-34.6, 35.1, 35.4 Homework 3 due
Fri Nov 10 No section (Veteran’s Day)    
Tue Nov 14 Q-Learning (lecture) PML2 35.2-35.3 Homework 4 released (pdf,latex, code)
Thu Nov 16 Policy Gradient, Robustness, Adversarial Examples (policy gradient lecture, adversarial slides) PML2 19.1-19.8  
Fri Nov 17 Section: Practical guide to computer vision models [Vishesh] (tutorial) (slides)    
Tue Nov 21 Spurious Correlations, Fairness in Machine Learning (slides) FAML 1-4  
Nov 23-24 No class or section (Thanksgiving)    
Tue Nov 28 How does ChatGPT work? (slides)    
Thu Nov 30 Conclusion (slides)   Homework 4 due
Fri Nov 31 Section: Final Exam preparation    
Thu Dec 7 Final Exam, 2-4pm   Project Final Report due Dec 12

Grading

Grades will be based on homework assignments (40%), a class project (20%), and two exams (40%).

Homework Assignments (40% total):

Final Project (20% total). The final project will proceed in three stages:

Exams (40% total):

Late days

You have 6 late days you may use on any assignment excluding the Project Final Report. Each late day allows you to submit the assignment 24 hours later than the original deadline. You may use a maximum of 3 late days per assignment. If you are working in a group for the project, submitting the project proposal or midterm report one day late means that each member of the group spends a late day. We do not allow use of late days for the final project report because we must grade the projects in time to submit final course grades.

If you have used up all your late days and submit an assignment late, you will lose 10% of your grade on that assignment for each day late. We will not accept any assignments more than 3 days late.

Final project

The final project can be done individually or in groups of up to 3. This is your chance to freely explore machine learning methods and how they can be applied to a task of our choice. You will also learn about best practices for developing machine learning methods—inspecting your data, establishing baselines, and analyzing your errors. More information about the final project is available here.

Resources

This semester, I am writing Lecture Notes that accompany all the iPad lectures. I recommend using these notes as reference material for studying. There is no required textbook for this class. If you do want to learn from a textbook, the following may be useful:

To review mathematical background material, you may also find the following useful:

Other Notes

Collaboration policy and academic integrity: Our goal is to maintain an optimal learning environment. You may discuss the homework problems at a high level with other students, but you should not look at another student’s solutions. Trying to find solutions online or from any other sources for any homework or project is prohibited, will result in zero grade and will be reported. Using AI tools to automatically generate solutions to written or programming problems is also prohibited. To prevent any future plagiarism, uploading any material from the course (your solutions, quizzes etc.) on the internet is prohibited, and any violations will also be reported. Please be considerate, and help us help everyone get the best out of this course.

Please remember the expectations set forth in the USC Student Handbook. General principles of academic honesty include the concept of respect for the intellectual property of others, the expectation that individual work will be submitted unless otherwise allowed by an instructor, and the obligations both to protect one’s own academic work from misuse by others as well as to avoid using another’s work as one’s own. All students are expected to understand and abide by these principles. Suspicion of academic dishonesty may lead to a referral to the Office of Academic Integrity for further review.

Students with disabilities: Any student requesting academic accommodations based on a disability is required to register with Disability Services and Programs (DSP) each semester. A letter of verification for approved accommodations can be obtained from DSP. Please be sure the letter is delivered to the instructor as early in the semester as possible.