Project Description

Examples for distributed training of machine learning/deep learning models in TensorFlow. Every model training example can be run on a multi-node cluster.

Usage

For model 1,2,3: you can find a script called xxx.py and a corresponding folder in which there are shell scripts to launch the distributed training job.
For model 4: please refer to the corresponding README

Note:

Change some default setting (e.g., python path, HOME path, host name) before running each training job.
Make sure you understand the basics of distributed Tensorflow. See the offical tutorial for more detail.

Version and Environment

Model 1,2,3: Tensorflow version: 0.11.0rc0, Python 3, Ubuntu 16
Model 4: Tensorflow version 1.5.0, Python 3, Ubuntu 16

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
run_nn_distributed_mnist		run_nn_distributed_mnist
run_softmax_distributed_mnist		run_softmax_distributed_mnist
run_twoLayerNN_distributed_mnist		run_twoLayerNN_distributed_mnist
tensorflow-cnn-example		tensorflow-cnn-example
README.md		README.md
mnist_2hiddenLayerNN_distributed_ph.py		mnist_2hiddenLayerNN_distributed_ph.py
mnist_nn_distibuted_placeholder.py		mnist_nn_distibuted_placeholder.py
mnist_softmax_distibuted_placeholder.py		mnist_softmax_distibuted_placeholder.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Description

Contents

Usage

Note:

Version and Environment

About

Releases

Packages

Languages

kzhang28/Distributed-TensorFlow-Training-Examples

Folders and files

Latest commit

History

Repository files navigation

Project Description

Contents

Usage

Note:

Version and Environment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages