You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: tensorflow_benchmark/tf_word_language_model/README.md
+4-9Lines changed: 4 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,11 +8,6 @@ This code was forked from https://github.com/rafaljozefowicz/lm and contains "BI
8
8
Current code runs on Tensorflow r1.5 and supports multi-GPU data parallelism using synchronized gradient updates.
9
9
10
10
# Perplexity
11
-
~~On One Billion Words benchmark using 8 GPUs in one DGX-1, BIG G-LSTM G4 was able to achieve 24.29 after 2 weeks of training and 23.36 after 3 weeks.~~
12
-
13
-
__On 02/06/2018 We found an issue with our experimental setup which makes perplexity numbers listed in the paper invalid.__
14
-
15
-
__We are currently re-running the experiments__ See current numbers in the table below.
16
11
17
12
On DGX Station, after 1 week of training using all 4 GPUs (Tesla V100) and batch size of 256 per GPU:
18
13
@@ -26,22 +21,22 @@ On DGX Station, after 1 week of training using all 4 GPUs (Tesla V100) and batch
26
21
27
22
28
23
## Dependencies
29
-
* TensorFlow r1.5
24
+
* TensorFlow r1.6
30
25
* Python 2.7 (should work with Python 3 too)
31
26
* 1B Word Benchmark Dataset (https://github.com/ciprian-chelba/1-billion-word-language-modeling-benchmark to get data)
32
27
33
28
## To run
34
-
Assuming the data directory is in: `/raid/okuchaiev/Data/LM1B/1-billion-word-language-modeling-benchmark-r13output/`, execute:
29
+
Assuming the data directory is in: `/raid/ubuntu/Data/LM1B/1-billion-word-language-modeling-benchmark-r13output/`, execute:
0 commit comments