CS341
Project in Mining Massive Data Sets
Spring 2012
CS341 is an advanced project based course. Students will work on data
mining and machine learning algorithms for analyzing very large amounts
of data. Both interesting big datasets as well as computational
infrastructure (large MapReduce cluster) will be provided by course
staff.
Announcements:
- 06/11: Final project reports
- 05/29: Writeups are due on 6/11 at 5pm. More details.
- 05/29: Project presentations will be held in class on 6/5 and 6/7. Exact schedule will be posted.
- 03/31: First class will be held on Tuesday 4/3 4:15-5:30 in Hewlett 103.
- 03/07: On Monday 3/19 at 5pm we will hold an Q&A information session in Gates 415.
We will discuss available datasets, potential project ideas and answer any questions students might have.
- 03/04: Course applications are now open! Applications are due on Thursday March 29 5pm.
Course information:
Instructors:
Jure Leskovec (jure@cs.stanford.edu)
Anand Rajaraman (datawocky@gmail.com)
Jeff Ullman (ullman@gmail.com)
Andreas Weigend (aweigend@stanford.edu)
Class meetings:
Tuesdays and Thursdays 4:15PM - 5:30PM in Hewlett 103.
This is a project course. There will be only a few weekly lectures,
and only one or two introductory homeworks. We will spend the quarter
working in teams on different large scale data mining related projects.
Teams will individually meet with the assigned mentor.
Teaching assistant:
Keith Siilats (siilats@stanford.edu)
Office Hours: Wednesdays 12:30-2:00pm, Gates B28
Communication:
You can reach us at cs341-spr1112-staff@lists.stanford.edu
Use Piazza to post class related questions: http://piazza.com/class#spring2012/cs341
(Piazza requires @stanford.edu email address to register. If you do not have @stanford.edu address, send us email with your email address and we will add you to Piazza.)
Previous versions of the course:
CS341: Spring 2011
Acknowledgment:
The course is generously supported by Amazon by giving us access to their EC2 platform.