Skip to content

Commit 3e373d4

Browse files
committed
Clip learning rate for the case of very large number of workers: needed to run > ~1800 GPUs on Titan
1 parent 630eff6 commit 3e373d4

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

plasma/models/mpi_runner.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -150,9 +150,9 @@ def __init__(self,model,optimizer,comm,batch_iterator,batch_size,num_replicas=No
150150
self.epoch = 0
151151
self.model = model
152152
self.optimizer = optimizer
153-
self.lr = lr
154-
self.DUMMY_LR = 0.1
155153
self.max_lr = 0.1
154+
self.lr = lr if (lr < self.max_lr) else self.max_lr
155+
self.DUMMY_LR = 0.1
156156
self.comm = comm
157157
self.batch_size = batch_size
158158
self.batch_iterator_func = batch_iterator()

0 commit comments

Comments
 (0)