Introduction to Machine Learning

"The field of study that gives computers the ability to learn without being explicitly programmed."
— Arthur Samuel (1959)

Categories of Machine Learning

Supervised learning involves training an algorithm on a labeled dataset, which means the data includes both input (X) and output (Y) values.

Used when the output (Y) is a continuous number (from infinitely many possibilities).
Example: Predicting house prices.

Used when the output (Y) is categorical (from a small set of categories).
Examples:
- Breast cancer detection (malignant vs. benign)
- Identifying whether a photo contains a cat or a dog

Inputs are called: Features

In unsupervised learning, the data only contains inputs (X) and no corresponding output labels (Y).

The algorithm must discover patterns, structures, or relationships in the data on its own.