L1 and L2 regularization

L1 and L2 penalties are forms of regularization used to prevent overfitting in machine learning models by adding constraints to the model’s complexity.

L1 Penalty (Lasso Regularization)

The L1 penalty adds the absolute values of the coefficients (weights) to the loss function.

L 1 = λ \sum ∣ w_{i} ∣

Where: $w_{i}$ are the model weights (coefficients), and
$λ$ is the regularization parameter that controls the strength of the penalty.

It encourages sparsity by shrinking some coefficients to exactly zero, effectively selecting only the most relevant features. It’s ideal when you suspect many features are irrelevant and want to perform feature selection.

L2 Penalty (Ridge Regularization)

The L2 penalty adds the squared values of the coefficients (weights) to the loss function.

L 2 = λ \sum w_{i}^{2}

Where: $w_{i}$ are the model weights (coefficients), and
$λ$ is the regularization parameter that controls the strength of the penalty.

It penalizes large coefficients, leading to smaller weights but not exactly zero. This helps reduce model sensitivity to individual data points: it is suitable when you want to reduce overfitting while retaining all features in the model.

Combined Use: Elastic Net

In Elastic Net regularization, both L1 and L2 penalties are combined:

Elastic Net Loss = Loss Function + λ_{1} \sum ∣ w_{i} ∣ + λ_{2} \sum w_{i}^{2}

Where:
$λ_{1}$ controls the strength of the L1 penalty, and
$λ_{2}$ controls the strength of the L2 penalty.

Edmondo's Vault

Explorer

L1 and L2 regularization

L1 Penalty (Lasso Regularization)

L2 Penalty (Ridge Regularization)

Combined Use: Elastic Net

Graph View

Table of Contents

Backlinks