Distributed Crunching #4
Loading…
Reference in New Issue
There is no content yet.
Delete Branch "%!s(<nil>)"
Deleting a branch is permanent. Although the deleted branch may exist for a short time before cleaning up, in most cases it CANNOT be undone. Continue?
Set up Tensorflow to use multiple nodes.
ml1, ml2, ml3, ml4, ml5 are set up with Debian Buster. Ready to install Tensorflow, etc.
It will correctly (afaict) distribute for Sequential() and model.compile but fails on model.fit.
Some startup warnings to clean up:
Also looks like a small thing to fix:
Main problem: during fit(), in the first epoch after running for awhile, it dies with this: