Skip to content

Commit a31300d

Browse files
committed
Updated TensorFlow readme
1 parent 5f774a5 commit a31300d

2 files changed

Lines changed: 5 additions & 10 deletions

File tree

tensorflow_benchmark/tf_word_language_model/README.md

Lines changed: 4 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -8,11 +8,6 @@ This code was forked from https://github.com/rafaljozefowicz/lm and contains "BI
88
Current code runs on Tensorflow r1.5 and supports multi-GPU data parallelism using synchronized gradient updates.
99

1010
# Perplexity
11-
~~On One Billion Words benchmark using 8 GPUs in one DGX-1, BIG G-LSTM G4 was able to achieve 24.29 after 2 weeks of training and 23.36 after 3 weeks.~~
12-
13-
__On 02/06/2018 We found an issue with our experimental setup which makes perplexity numbers listed in the paper invalid.__
14-
15-
__We are currently re-running the experiments__ See current numbers in the table below.
1611

1712
On DGX Station, after 1 week of training using all 4 GPUs (Tesla V100) and batch size of 256 per GPU:
1813

@@ -26,22 +21,22 @@ On DGX Station, after 1 week of training using all 4 GPUs (Tesla V100) and batch
2621

2722

2823
## Dependencies
29-
* TensorFlow r1.5
24+
* TensorFlow r1.6
3025
* Python 2.7 (should work with Python 3 too)
3126
* 1B Word Benchmark Dataset (https://github.com/ciprian-chelba/1-billion-word-language-modeling-benchmark to get data)
3227

3328
## To run
34-
Assuming the data directory is in: `/raid/okuchaiev/Data/LM1B/1-billion-word-language-modeling-benchmark-r13output/`, execute:
29+
Assuming the data directory is in: `/raid/ubuntu/Data/LM1B/1-billion-word-language-modeling-benchmark-r13output/`, execute:
3530

3631
```
3732
export CUDA_VISIBLE_DEVICES=0,1,2,3
3833
3934
SECONDS=604800
4035
LOGSUFFIX=FLSTM-F512-1week
4136
42-
python /home/okuchaiev/repos/f-lm/single_lm_train.py --logdir=/raid/okuchaiev/Workspace/LM/GLSTM-G4/$LOGSUFFIX --num_gpus=4 --datadir=/raid/okuchaiev/Data/LM/LM1B/1-billion-word-language-modeling-benchmark-r13output/ --hpconfig run_profiler=False,float16_rnn=False,epochs=5,num_steps=20,num_shards=8,num_layers=2,learning_rate=0.2,max_grad_norm=1,keep_prob=0.9,emb_size=1024,projected_size=1024,state_size=8192,num_sampled=8192,batch_size=256,fact_size=512 >> train_$LOGSUFFIX.log 2>&1
37+
python /home/ubuntu/repos/f-lm/single_lm_train.py --logdir=/raid/ubuntu/Workspace/LM/GLSTM-G4/$LOGSUFFIX --num_gpus=4 --epochs=5 --datadir=/raid/ubuntu/Data/LM/LM1B/1-billion-word-language-modeling-benchmark-r13output/ --hpconfig run_profiler=False,float16_rnn=False,epochs=5,num_steps=20,num_shards=8,num_layers=2,learning_rate=0.2,max_grad_norm=1,keep_prob=0.9,emb_size=1024,projected_size=1024,state_size=8192,num_sampled=8192,batch_size=256,fact_size=512 >> train_$LOGSUFFIX.log 2>&1
4338
44-
python /home/okuchaiev/repos/f-lm/single_lm_train.py --logdir=/raid/okuchaiev/Workspace/LM/GLSTM-G4/$LOGSUFFIX --num_gpus=1 --mode=eval_full --datadir=/raid/okuchaiev/Data/LM/LM1B/1-billion-word-language-modeling-benchmark-r13output/ --hpconfig run_profiler=False,float16_rnn=False,epochs=5,num_steps=20,num_shards=8,num_layers=2,learning_rate=0.2,max_grad_norm=1,keep_prob=0.9,emb_size=1024,projected_size=1024,state_size=8192,num_sampled=8192,batch_size=1,fact_size=512
39+
python /home/ubuntu/repos/f-lm/single_lm_train.py --logdir=/raid/ubuntu/Workspace/LM/GLSTM-G4/$LOGSUFFIX --num_gpus=1 --epochs=5 --mode=eval_full --datadir=/raid/ubuntu/Data/LM/LM1B/1-billion-word-language-modeling-benchmark-r13output/ --hpconfig run_profiler=False,float16_rnn=False,epochs=5,num_steps=20,num_shards=8,num_layers=2,learning_rate=0.2,max_grad_norm=1,keep_prob=0.9,emb_size=1024,projected_size=1024,state_size=8192,num_sampled=8192,batch_size=1,fact_size=512
4540
4641
```
4742

utils/cfg_process.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -49,7 +49,7 @@ def generate_cfg(cfg_template, cfg_path, **infra_spec):
4949
if "num_gpus" in infra_spec and infra_spec["num_gpus"] is not None and infra_spec["num_gpus"] > 0:
5050
cmd = re.sub("--gpus=\d+", "--gpus=%d" % infra_spec["num_gpus"], cmd)
5151
elif "num_gpus" in infra_spec:
52-
cmd = re.sub("--gpus \d+", "", cmd)
52+
cmd = re.sub("--gpus=\d+", "", cmd)
5353
if "epochs" in infra_spec and infra_spec["epochs"] is not None and infra_spec["epochs"] > 0:
5454
cmd = re.sub("--epochs=\d+", "--epochs=%d" % infra_spec["epochs"], cmd)
5555
else:

0 commit comments

Comments
 (0)