Feature Engineering

Math 158 - Spring 2022

Jo Hardin (from Mine Çetinkaya-Rundel)

Comparing Models

Data Modeling: The analysis in this culture starts with assuming a stochastic data model for the inside of the black box…The values of the parameters are estimated from the data and the model then used for information and/or prediction
Algorithmic Modeling: The analysis in this culture considers the inside of the box complex and unknown. [The] approach is to find a function f(x) — an algorithm that operates on x to predict the responses y.

Reference: Leo Breiman (2001)

metric	train	test	comparison
RMSE	0.302	0.411	RMSE lower for training
	0.67	0.468	higher for training

Feature Engineering Math 158 - Spring 2022 Jo Hardin (from Mine Çetinkaya-Rundel)