evaluation

our ultimate goal is to create a generalized model that learns training data and extrapolate to future test data.
We don’t really care about how good we fit the training data.
key idea: fit the model on train set, and test on separate test set.
requirements We split our training set into three parts
training set: to fit the model validation set: quasi-test set test set: actual test (we do it only once) additional information root-mean-square error this is basically least-squares error but with normalization

\begin{equation} \text{RMSE}\left(\theta\right) = \sqrt{\frac{1}{n} \sum_{i=1}^{n} \left(h_{\theta} \left(x^{(i)}\right) - y^{(i)}\right)^{2}} \end{equation}

we don’t train with this because its like more faff but monotonic against least-squares error so there’s no point in adding the more faff.