Topic
|
Date
|
Notes
|
Linear modeling
|
|
Linear models
Least squares notes
Least squares gradient descent
algorithm
Regularization
Stochastic gradient descent pseudocode
Stochastic gradient
descent (original paper)
|
Kernels
|
|
Kernels
More on kernels
|
Multiclass classification
|
|
Multiclass classification
One-vs-all method
|
Logistic regression
|
|
Logistic regression
|
Empirical and regularized risk minimization
|
|
Empirical risk minimization
Regularized risk minimization
Regularization and overfitting
|
Support vector machine
|
|
Support vector machines
|
Decision trees and random forests
|
|
Decision trees, bagging, boosting, and stacking
Decision trees (additional notes)
Ensemble methods (additional notes)
|
Feature selection
|
|
Feature selection
Feature selection (additional notes)
|
Dimensionality reduction
|
|
Dimensionality reduction
|
Clustering
|
|
Clustering
|
Maximum likelihood
|
|
Bayesian learning
|
Neural networks
|
|
Multilayer perceptrons
Single hidden layer neural network
Back propagation
|
Autoencoders
|
|
Generative models and networks
Autoencoder
|