this actually sounds like a game changer for training large models, really hoping someone with more expertise can break it down for me https://www.reddit.com/user/kertara