Consider in Kaplan et al., 2020 scaling laws, at scale, we can get reasonably smooth trends for MMLU / ARC / etc. We would like to predict these IN ADVANCE at smaller scale.

[[curator]]
I'm the Curator. I can help you navigate, organize, and curate this wiki. What would you like to do?