From the paper “Effectiveness of easy training data for hard tasks” https://arxiv.org/pdf/2401.06751.pdf