Return to my homepage
MATH 203: Applied Mathematics, Computing & Statistics Projects (CAMCOS)
Spring 2017, San Jose State UniversityFinal product
Slides ReportToy data
The 20 newsgroups data set [Processed version] (use X_100)References
Overview of document clustering
- A Survey of Text Clustering Algorithms [Link]
Dimensionality reduction of document data
Spectral clustering
- Normalized cuts [paper] [software]
- NJW algorithm [paper]
- Self-tuning spectral clustering [paper] [webpage]
- A tutorial on spectral clustering [paper]
- Diffusion maps [lecture] [NIPS paper] [ACHA paper] [PNAS paper]
Landmark based spectral clustering (LSC)
- Papers: [AAAI version] [Journal version]
- Matlab code: link (download also the three functions bestMap, hungarian, MutualInfo under clustering evaluation)
- Websites: [Deng Cai] [Xinlei Chen]