The analysis of data (DNA, music, images, video, financial records, blogs, medical records,
software, computer game logs, multimedia, social networks) is an important frontier in computer
science. This frontier is expanding vastly thanks to new developments in
mathematical modelling, algorithms, and computing infrastucture. This course will teach the basic
principles and skills required for analysing data in a principled way: finding statistical patterns,
dimensionality reduction, clustering, classification and prediction. Students will also have the
opportunity of learning Python, one of the most widely used new programming languages.
Time: Mon Wed Fri 1:00-2:00pm
Location: Dempster 301
Instructor: Nando de Freitas (nando at cs)
Office hours: W 10:30-12:00 and 2:00-3:00pm (ICICS 104).
TAs:
Bo Chen (bochen at cs) and
Pooyan Fazli (pooyanf at cs).
Bo's TA hour: Monday 10:00am (Learning Centre - X150)
Pooyan's TA hour: Thursday 1pm (Learning Centre - X150)
Books
There's no official textbook. I will provide handouts, but do recommend the following books:
The elements of Statistical Learning.
Pattern Recognition and Machine Learning.
All of Statistics.
Pattern Classification.
Websites
You can befriend the user cpsc340 at Worio to see a set of websites related to the course content.
Grading
Assignments: 20%
Midterm 1: 20%. (Friday Oct 17)
Midterm 2: 20%. (Friday Nov 14)
Final: 40% (3:30pm, December 3)
There will also be a special research project for advanced students.
The instructor has the right to change the marking scheme under
reasonable circumstances.
Assignments
Assignments will involve both written and python programming problems.
All assignments are due on the specified time. 20% off for each day late.
Assignments will not be accepted after 5 days late.
Newsgroup: ubc.courses.cpsc.340