our ultimate goal is to create a generalized model that learns training data and extrapolate to future test data. We don’t really care about how good we fit the training data. key idea: fit the model on train set, and test on separate test set. requirements We split our training set into three parts training set: to fit the model validation set: quasi-test set test set: actual test (we do it only once) additional information root-mean-square error this is basically least-squares error but with normalization

\begin{equation} \text{RMSE}\left(\theta\right) = \sqrt{\frac{1}{n} \sum_{i=1}^{n} \left(h_{\theta} \left(x^{(i)}\right) - y^{(i)}\right)^{2}} \end{equation}

we don’t train with this because its like more faff but monotonic against least-squares error so there’s no point in adding the more faff.

[[curator]]
I'm the Curator. I can help you navigate, organize, and curate this wiki. What would you like to do?