dist_train
* Add training startup documentation * fix * fix * fix * fix * fix * fix * fix * fix * fix