@article{liu2023sophia, title={Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training}, author={Liu, Hong and Li, Zhiyuan and Hall, David and Liang, Percy and Ma, Tengyu} ...
Stay up-to-date with the latest and best audio content from CBC Listen delivered to your inbox every two weeks.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results