As asked
Explain the bias-variance tradeoff. How does it manifest when you are choosing between a linear model and a deep neural network for a tabular dataset with 10,000 examples?
Sample answer outline
Bias is the systematic error from model assumptions; variance is sensitivity to training data fluctuations. A linear model has high bias if the true relationship is nonlinear but low variance. A deep network has low bias but high variance, tending to overfit on 10,000 examples unless regularized. With 10k tabular examples, gradient boosted trees or regularized neural nets often outperform deep models because they control variance better on small data. The right choice depends on the feature-target relationship's complexity.
Expect these follow-ups
- How does regularization reduce variance and does it always increase bias?
- When does adding more training data reduce variance versus when does it not help?