CS345A:
Data Mining
Winter 2010

Handouts:

1/5: Introduction

1/7: MapReduce

1/12: Recommendation System

1/14: Near Neighbor Search in High Dimensional Data

1/19: Locality Sensitive Hashing (LSH)

1/21: Structure of the webgraph, PageRank and Project ideas

1/22: Section on Map-Reduce infrastructure

1/26: Link Analysis

1/28: HITS and web spam

2/2: Web spam

2/4: Proximity on Graphs

2/9: Dimensionality reduction

2/11: Clustering

2/16: Mining data streams

2/18: Mining data streams (Cont)

2/23: Large scale supervised machine learning (1)

2/25: Large scale supervised machine learning (2) (guest lecture by Sugato Basu)

3/2: Large scale supervised machine learning (3)

3/4: Association Rules

3/9: Optimizing submodular functions

3/11: Mining the Web for Structured Data