a cost function J tells us how good our training is. For instance, least-squares error additional information