Backslash: Rate Constrained Optimized Training of Large Language Models arxiv.org 3 points by PaulHoule a day ago