Update notes, deps on distributed nodes

master
root 2022-05-29 18:30:10 -06:00
parent 9c26f46464
commit 9fdf801eb7
2 changed files with 13 additions and 39 deletions

View File

@ -33,50 +33,17 @@ they are created.
```
# On main workstation or node where you built tensorflow:
NODES="ml1 ml2 ml3 ml4 ml5"
for i in $NODES
do scp -p tensorflow-2.1.0-cp37-cp37m-linux_x86_64.whl $i:
done
```
```
# On worker nodes:
sudo apt update
sudo apt install python3-pip sshfs
# XXX deps...
pip3 install --upgrade setuptools
pip3 install --user tensorflow-2.1.0-cp37-cp37m-linux_x86_64.whl
pip3 install --user simplejson
pip3 install --user pillow
```
Another way, using upstream tensorflow packages.
You also have to install the latest `pip` from `pip`,
or you'll get `tensorflow 1.x`.
```
pip3 install pip
pip3 install --upgrade pip
sudo apt install python3-pip sshfs jq
pip3 install --upgrade --user pip
# make sure new `pip3` at `~/.local/bin/pip3` is in front in `$PATH`.
# install tensorflow
pip3 install --user tensorflow
# If that fails due to the PATH, run like:
~/.local/bin/pip3 install --user tensorflow
pip3 list | grep tensorflow
# There's a bunch of tests that can be run, such as:
python3 ~/devel/tensorflow/tensorflow/tensorflow/python/distribute/multi_worker_continuous_run_test.py
pip3 install --upgrade --user -r requirements-node.txt
# If you have cloned the tensorflow repo, test with:
#python3 ~/devel/tensorflow/tensorflow/tensorflow/python/distribute/multi_worker_continuous_run_test.py
```
# Usage
`top`
# Meh
```
# for running some tensorflow tests:
pip3 install --user portpicker
# For other examples/tests:
#pip3 install --user opencv-python
apt install python3-opencv
pip3 install --user pandas
apt install jq
```

View File

@ -0,0 +1,7 @@
opencv-python
pandas
pillow
portpicker
setuptools
simplejson
tensorflow