I'm trying to build a distribute tensorflow framwork template, but there are serval problems confused me.
--sync_replas=True
in the script,does it mean I use Synchronous training
as in doc?1499169072.773628: Worker 0: training step 1 done (global step: 339)
what's the relation between training step
and global step
?
As you can see from the create cluster script, I created an independent cluster.Can I run multiple different models on this cluster at the same time?