Distributed Training on Large DataΒΆ