COMS 4771, Machine Learning
COMS 4771 is a graduate-level introduction to the statistical principles and algorithmic paradigms of machine learning (ML). Broadly speaking, ML is concerned with the tasks of learning models from data, generalizing to unseen scenarios, and solving problems without explicit instructions. We will focus mostly on supervised learning, including both classical and deep learning methods. We will also see how ML is used in various applications in NLP, vision, robotics, etc. throughout the course.
Course Objectives
- Identify, describe, and formulate a typical machine learning problem.
- Become proficient in the mathematical language of machine learning: Linear algebra, optimization, probability and statistics.
- Perform assessment and selection among different models for a given problem.
- Formulate and analyze the solutions for linear regression and its generalizations.
- Implement classical algorithms for linear classification, e.g. logistic regression and discriminant analysis.
- Implement classical algorithms for nonlinear classification, e.g. decision trees and kernelized methods.
- Implement unsupervised learning algorithms for clustering analysis and dimensionality reduction.
- Understand and implement the general framework for deep learning.
- Understand modern applications of deep learning, e.g. in vision and natural language.
Prerequisites
- Python proficiency
- Linear algebra
- Multivariable calculus
- Probability and/or statistics
General List of Topics
- Optimization and statistical foundations
- Nearest neighbors
- Linear regression
- Shrinkage methods
- Basis expansions
- Kernel smoothing methods
- Model selection
- Decision trees
- Boosting
- Logistic regression
- Discriminant analysis
- Support vector machines
- Clustering
- Principal components analysis
- Neural networks
- Training neural networks
- Convolutional neural networks
- Attention and transformers
