Back to Basics: Machine Learning

When it comes to creating artificial intelligence, several approaches have been attempted to build an "intelligent" algorithm. However, if we look at recent developments, all the models we encounter are derived from one main paradigm: Machine Learning (often abbreviated as ML). Whether we're talking about LLMs, neural networks, or even simpler models, they all follow the principles of Machine Learning.

But what are these principles? How does a Machine Learning model work? That's what this article aims to highlight.

A Quick Note on Terminology

Before diving into the topic, I'd like to clarify a few key concepts.

Model: this is the common name for a Machine Learning program. It's used both for the final product that predicts values and for the program before it has been trained.
Target: the value that we want the Machine Learning algorithm to predict. You'll also see it called "Target" or "y" in the literature.
Dataset: the body of data we assemble to train the Machine Learning model. The data can come in different forms (data tables, PDF files, images, etc.).
Error function: a measure that defines how far off a model's predictions are from reality. Also known as a loss function.
Score function: a measure that determines how close a model's predictions are to reality. Score and error functions generally move in opposite directions.

A Machine That Learns?

Before getting into what a Machine Learning algorithm is and what makes it special, here's a brief reminder of what we generally expect from one. We want this program, given input data, to predict a value -- the target -- correctly. In other words, we want our model to generalize rules that apply to all the data it's likely to encounter, so it can accurately predict the target. All ML problems are variations of this challenge. What changes most often is what the value represents and its format (number, text, image, etc.).

A Machine Learning algorithm is something quite particular in the world of algorithms. Typically, the most common analogy for explaining an algorithm is a cooking recipe. You have ingredients, which represent the algorithm's inputs, and steps to follow, which represent the algorithm's rules. With these two elements, you produce the final dish, which represents the algorithm's output -- what we're trying to automate in the case of a computer program.

Simplified diagram of a traditional program

A Machine Learning program works in reverse compared to a traditional program: you give it input data, possibly the expected results, and it figures out on its own which rules transform the inputs into outputs -- or, if no expected results are provided, it tries to find commonalities among the data.

Diagram of a Machine Learning program

This mechanism of finding rules based on inputs is what we call learning, or training, and it's the common thread across every Machine Learning algorithm.

From there, one of the prerequisites of this field is to build a dataset that can be fed to the algorithm to generate these rules and ensure that the resulting Machine Learning model is sufficiently performant for the task we want to accomplish.

How Do You Teach a Machine?

Now that we've covered the general principle of Machine Learning, the question is: what strategies exist for teaching a machine? There are several, and I'll present three of the most common ones here:

Supervised learning: this is the most traditional learning method in ML. The idea is to build a dataset that contains the target the model needs to predict. We say the data is labeled. We then feed the Machine Learning algorithm the data along with the associated targets, and the ML model adapts and minimizes its errors by relying on the error or score function.
Unsupervised learning: here, the data is not labeled, and the algorithm tries to find commonalities among the data and group it accordingly.
Reinforcement learning: the goal here is to penalize and reward the algorithm during training based on the errors it makes and the progress it achieves. There are multiple ways to do this: the process can be manual, you can create a function that provides these rewards/penalties, etc.

There are other types of learning, but these are the ones I find most notable and easiest to remember.

What Are the Main Obstacles?

Now that we've covered the main ML mechanisms in fairly general terms, let's look at the main obstacles you encounter when doing Machine Learning. If I had to sum up the majority of obstacles in one word, it would be "bias." ML algorithms can fall into biases that may be linked to various factors.

Poor Model Score Measurement

One of the first biases you can encounter arises when measuring your model's performance. To make sure the model has truly generalized the rules it was supposed to learn, you measure the model's score on data it hasn't seen before -- test data. The risk of measuring performance on data the model already saw during training is that the model may have memorized the data, meaning it fails to generalize. That's why the dataset is split into training and test data (typically 70-80% for training and 20-30% for testing).

This problem is tied to the method used to measure the model's performance.

Overfitting and Underfitting

Overfitting and underfitting are problems related to the compatibility between the chosen ML algorithm and the dataset.

Overfitting is a problem where the model fails to generalize and instead memorizes the data. The main indicator of overfitting is that the model's score measured on training data is significantly higher than the score measured on test data. This means the model is too complex for the problem at hand and needs to be simplified. It can also be a consequence of a dataset that's too small.

Underfitting, on the other hand, is a problem where the model fails to capture and adapt to the complexity of the dataset. An indicator of this is that the model's score measured on both training and test data is very low. The algorithm needs to be adjusted so it better captures the data's complexity. Like overfitting, this can also result from a dataset that's too small.

Example of underfitting and overfitting

Data-Related Biases

Another area requiring attention is the composition of the dataset. There are all kinds of biases related to the representativeness of different types of data in the initial dataset. For example, when building a facial recognition model, the algorithm must recognize all types of faces, regardless of the person's ethnicity, whether they wear glasses, their hair color, etc. And for that, the training dataset must contain images of people with varied characteristics, with no population being over-represented.

Here, I used the example of facial recognition, but this problem can arise with other types of data as well.

Conclusion

ML models are particular computer programs that try to infer rules from the data fed to them as input. There are several ways to train a model, depending on whether our data is labeled or not. There are also several obstacles and biases to avoid. In future articles, we'll revisit these approaches and see how, in practice, you create a model and navigate these various biases.

But what are these principles? How does a Machine Learning model work? That's what this article aims to highlight.

A Quick Note on Terminology

Before diving into the topic, I'd like to clarify a few key concepts.

Model: this is the common name for a Machine Learning program. It's used both for the final product that predicts values and for the program before it has been trained.
Target: the value that we want the Machine Learning algorithm to predict. You'll also see it called "Target" or "y" in the literature.
Dataset: the body of data we assemble to train the Machine Learning model. The data can come in different forms (data tables, PDF files, images, etc.).
Error function: a measure that defines how far off a model's predictions are from reality. Also known as a loss function.
Score function: a measure that determines how close a model's predictions are to reality. Score and error functions generally move in opposite directions.

A Machine That Learns?

Simplified diagram of a traditional program

Diagram of a Machine Learning program

This mechanism of finding rules based on inputs is what we call learning, or training, and it's the common thread across every Machine Learning algorithm.

How Do You Teach a Machine?

Supervised learning: this is the most traditional learning method in ML. The idea is to build a dataset that contains the target the model needs to predict. We say the data is labeled. We then feed the Machine Learning algorithm the data along with the associated targets, and the ML model adapts and minimizes its errors by relying on the error or score function.
Unsupervised learning: here, the data is not labeled, and the algorithm tries to find commonalities among the data and group it accordingly.
Reinforcement learning: the goal here is to penalize and reward the algorithm during training based on the errors it makes and the progress it achieves. There are multiple ways to do this: the process can be manual, you can create a function that provides these rewards/penalties, etc.

There are other types of learning, but these are the ones I find most notable and easiest to remember.

What Are the Main Obstacles?

Poor Model Score Measurement

This problem is tied to the method used to measure the model's performance.

Overfitting and Underfitting

Overfitting and underfitting are problems related to the compatibility between the chosen ML algorithm and the dataset.

Example of underfitting and overfitting

Data-Related Biases

Here, I used the example of facial recognition, but this problem can arise with other types of data as well.

Back to Basics: Machine Learning

A Quick Note on Terminology

A Machine That Learns?

How Do You Teach a Machine?

What Are the Main Obstacles?

Poor Model Score Measurement

Overfitting and Underfitting

Data-Related Biases

Conclusion

Similar articles

N8N, What's That All About?

How AI Is Revolutionizing Marketing (Without Replacing You)

AI Training Needs Assessment Framework: A Guide for HR Directors and Managers

Newsletter

Go further

RAG pour Accès à l'Information

Extraction Documentaire Multimodale avec Gemini 2.5 Flash

Automatisation IA des audits énergétiques industriels

Machine Learning

Intelligence Artificielle

Formation Tech & IA

Back to Basics: Machine Learning

A Quick Note on Terminology

A Machine That Learns?

How Do You Teach a Machine?

What Are the Main Obstacles?

Poor Model Score Measurement

Overfitting and Underfitting

Data-Related Biases

Conclusion

Similar articles

N8N, What's That All About?

How AI Is Revolutionizing Marketing (Without Replacing You)

AI Training Needs Assessment Framework: A Guide for HR Directors and Managers

Newsletter

Go further

RAG pour Accès à l'Information

Extraction Documentaire Multimodale avec Gemini 2.5 Flash

Automatisation IA des audits énergétiques industriels

Machine Learning

Intelligence Artificielle

Formation Tech & IA