@article{liu2023sophia, title={Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training}, author={Liu, Hong and Li, Zhiyuan and Hall, David and Liang, Percy and Ma, Tengyu} ...
Stay up-to-date with the latest and best audio content from CBC Listen delivered to your inbox every two weeks.