Overfitting

It is a modeling error that occurs when the model aligns very closely to the data points which used to train the model. As a result, the model fails to perform well with other data points. It becomes useful for the training g data set only. This happens when the model is trained for too long or if the model is complex.

( In Regression)

(In Classification)

Low bias and High variance are indicators of overfitting. In order to detect the presence of overfitting, a part of the training dataset is set aside as a test set to check for overfitting. The training accuracy will be higher than that of testing accuracy.

How to avoid Overfitting

Below are a number of techniques that you can use to prevent overfitting:

Early Stopping
Train with more data
Data augmentation
Feature selection
Regularization
Ensemble methods

I am doing a challenge - #66DaysofData in which I will be learning something new from the Data Science field for 66 days, and I will be posting daily topics on my LinkedIn, On my GitHub repository, and on my blog as well.

Stay Curious!

By Jerin Lalichan

Search This Blog

The Datamatics

Day 10 - Overfitting in Predictive Models

Overfitting

How to avoid Overfitting

Comments

Post a Comment

Popular posts from this blog

#66DaysOfData ? Here is why you should also accept the challenge.

What exactly is Data Science ? Who is a data scientist ? Explained in a simple way.

Day 17 - Ensemble Techniques in ML - Averaging, Weighted average