Course webpage for Math 251 Statistical and Machine Learning Classification

MATH 251: Statistical and Machine Learning Classification

Fall 2018, San Jose State University

Course description [Syllabus]

image This is an advanced topics course in the machine learning field of classification, with the goals of introducing

  1. Dimensionality Reduction
  2. Instance-based Methods
  3. Discriminant Analysis
  4. Logistic Regression
  5. Support Vector Machine
  6. Kernel Methods
  7. Ensemble Methods
  8. Neural Networks

all based on the benchmark dataset of MNIST Handwritten Digits. Such a teaching strategy was partly inspired by Michael Nielsen's free online book - Neural Networks and Deep Learning, which notes explicitly that this dataset hits a ``sweet spot'' - it is challenging, but ``not so difficult as to require an extremely complicated solution, or tremendous computational power''. In addition, the digit recognition problem is very easy to understand, yet practically important.


Course progress

DateSlidesFurther Reading
8/22 Review

Linear Algebra Review

Method of Lagrangian Multipliers

8/27 Introduction Final project instructions
9/5 Instance-based classification Chapter 2 of textbook 1
9/12 PCA  [Matrix algebra] Section 10.2 of textbook 1
9/24 LDA (for dimensionailty reduction) Prof. Olga Veksler’s lecture
10/8 Bayes classifiers Section 4.4 of textbook 1
10/15 Midterm Midterm solution  
10/22 Logistic regression Section 4.3 of textbook 1
10/29 Support vector machines [Lagrange Dual] Chapter 9 of textbook 1
11/19 Ensemble learning [Trevor Hastie's slides] [Adele Cutler's lecture] [Chapter 8 of textbook]
11/26 Neural networks [Michael Nielsen’s book] [Olga Veksler’s lecture] [Perceptron]
12/5 Course summary and project information  
12/10 Final project presentations  
12/12 Final project presentations (cont'd)  


More learning resources

 Programming languages


Useful course websites


Data sets