DiLoCo: Distributed Low-Communication Training of Language Models

3 points by panabee 11 months ago | 0 comments