Machine Learning Intro
Machine learning is not only about writing algorithms, actually if an algorithm tells the machine what to do exactly; then it is explicitly programmed not learned how to interact.
Key Terminology:
- Model: a set of patterns learned from data.
- Algorithm - a specific ML process used to train a model.
- Training data - the dataset from which the algorithm learns the model.
- Test data - a new dataset for reliably evaluating model performance.
- Features - Variables (columns) in the dataset used to train the model.
- Target variable - A specific variable you’re trying to predict.
- Observations - Data points (rows) in the dataset.
3 elements for great machine learning:
- A skilled designer
- Cleaning and analyzing data
- Avoid overfitting
Machine learning basic steps:
- Exploratory Analysis: study your dataset carefully
- Data Cleaning: good data is the key for good models
- Feature Engineering: generate new features from the old features
- Algorithm Selection
- Model Training
*Resource: elitedatascience