Understanding overparameterization in LLMs