CS341:
Mining Massive Data Sets
Winter 2011
Handouts:
- 3/29/2011 Twitter Class Projects[PDF], Project Ideas[PDF]
- 3/31/2011 Methods for High Degrees of Similarity[PDF]
- 4/05/2011 Cluster-Based Join Algorithms[PDF]
- 4/07/2011 Introduction To Hive on Amazon EC2[PDF], Example files[ZIP]