| title | What is Machine Learning? | ||||
|---|---|---|---|---|---|
| sidebar_label | Introduction | ||||
| description | Understanding the paradigm shift from traditional programming to data-driven learning. | ||||
| tags |
|
At its simplest, Machine Learning (ML) is the field of study that gives computers the ability to learn without being explicitly programmed. Instead of a human writing a thousand "if-then" statements, we provide an algorithm with data, and the algorithm "finds" the patterns itself.
To understand ML, we must compare it to Traditional Programming.
In traditional software engineering, a human provides the Rules (code) and the Data. The computer follows the rules to produce an Output.
In ML, we provide the Data and the Output (labels). The computer analyzes these to produce the Rules (the Model).
graph TD
subgraph Traditional ["Traditional Programming"]
Data1[Data] --> Logic[Rules/Code]
Logic --> Out1[Output]
end
subgraph ML ["Machine Learning"]
Data2[Data] --> Answer[Expected Output]
Answer --> Learn[Learning Algorithm]
Learn --> Model[Rules/The Model]
end
style Logic fill:#f5f5f5,stroke:#333,color:#333
style Model fill:#e1f5fe,stroke:#01579b,color:#333
Machine Learning is generally divided into three main categories based on how the agent "learns."
The model is trained on labeled data. You give it inputs and the correct answers. It’s like a student learning with a teacher who corrects their homework.
- Regression: Predicting a continuous number (e.g., Home prices).
- Classification: Predicting a category (e.g., Is this email Spam or Not Spam?).
The model is given unlabeled data and must find hidden structures or patterns on its own. There is no "teacher."
- Clustering: Grouping customers by similar buying habits.
- Association: Finding that people who buy bread also tend to buy butter.
The model (agent) learns by interacting with an environment. It receives rewards for good actions and penalties for bad ones. It’s how AI learns to play chess or drive autonomous cars.
Every Machine Learning problem requires three components:
- The Dataset: High-quality, representative data.
- The Features: The specific attributes or variables the model looks at (e.g., mileage, year, and brand for a car).
- The Algorithm: The mathematical process used to find patterns (e.g., Linear Regression, Neural Networks).
Building a model isn't just writing code; it's a circular process:
- Define the Goal: What are we trying to predict?
- Data Collection: Gathering raw information.
- Data Preprocessing: Cleaning and scaling (what you learned in the Data Engineering module).
- Model Training: Feeding data to the algorithm.
- Evaluation: Testing the model on data it hasn't seen before.
- Deployment: Putting the model into a real-world app.
stateDiagram-v2
[*] --> Collection
Collection --> Preprocessing
Preprocessing --> Training
Training --> Evaluation
Evaluation --> Deployment
Deployment --> Collection : Feedback Loop
ML is powerful, but it isn't always the right tool. Avoid ML if:
- You have very little data.
- The problem can be solved with simple, static logic.
- You need 100% mathematical certainty (ML is probabilistic, not deterministic).
-
Elements of AI (Free Course): A non-technical conceptual deep dive.
-
Google Machine Learning Glossary: Quickly looking up confusing terminology.
Now that you understand the "Big Picture," let's look at the most fundamental math behind almost every predictive model.