Mineria de Datos
Mineria de Datos
Mar 24, 2024
·
1 min read
All the different classes can be found here!
This is the CC5205 course from the Universidad de Chile. I restructured it so that it is more adapted to nowadays techniques and more machine learning oriented, it is heavily based on scikit!
Here’s a summary:
- General introduction: Definitions of Data Mining, Data Science, and content of the class
- Data I: (Un)structured data, Representation, Normalization, Noise removal, …
- [TODO] Data II: Basic statistics for data exploration.
- Intro to Supervised Learning: Basics of Machine Learning and supervised learning.
- Intro to Fairness and Biases: How to avoid making bad models.
- Linear Models: A very simple model, which is the base of deep neural networks!
- [TODO] Classifiers: KNN, Naive Bayes, Decision Tree, Boosting, Bagging, Random Forests.
- [TODO] Dimensionality Reduction: Principal Component Analysis, Independant Component Analysis, t-SNE, UMAP,…
- [TODO] Clustering methods: Clustering methods and associated metrics
- SVM, SVR: Hinge loss, Lagrangian, KKT conditions, non-linear SVM, Kernel trick, SV Regressor
- Introduction to Neural Nets: Basics of Deep Learning
- Introduction to NLP (Invited Speaker: Juan Jose Alegria): How to deal with natural language.