Dion2 reduces the cost of Muon’s orthonormalization step by orthonormalizing only a small, selected submatrix at each iteration. This lightweight approach preserves Muon’s strong performance while ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results