This is the CC5205 course from the Universidad de Chile. I restructured it so that it is more adapted to nowadays techniques and more machine learning oriented, it is heavily based on scikit!
Definitions of Data Mining, Data Science, and content of the class
(Un)structured data, Representation, Normalization, Noise removal,...
Basic statistics for data exploration.
Basics of Machine Learning and supervised learning.
How to avoid making bad models.