How do Lasso and Ridge regularization differ in feature selection?

Question

Accepted Answer

Lasso (L1) and Ridge (L2) both regularize models to prevent overfitting but differ in their penalty mechanisms. Lasso adds the absolute value of weights to the loss function, which can shrink some coefficients to exactly zero, thus performing automatic feature selection. Ridge adds the squared value of weights, which reduces large weights but keeps all features in the model. Therefore, Lasso is ideal when many features are irrelevant, while Ridge is better when all features contribute but need scaling.

How do Lasso and Ridge regularization differ in feature selection?

Why Interviewers Ask This

How to Answer This Question

Key Points to Cover

Sample Answer

Common Mistakes to Avoid

Sound confident on this question in 5 minutes

Related Interview Questions

How do you handle missing or inconsistent data in a dataset?

What are the steps involved in the typical lifecycle of a data science project?

What is Elastic Net and when should it be used?