MCPcopy
hub / github.com/horovod/horovod / check_exit

Function check_exit

test/integration/data/elastic_tensorflow_keras_main.py:82–91  ·  view source on GitHub ↗
(epoch, batch)

Source from the content-addressed store, hash-verified

80
81
82def check_exit(epoch, batch):
83 key = str((epoch, batch))
84 if key in exit_schedule:
85 ranks_to_exit = exit_schedule[key]
86 if start_rank in ranks_to_exit:
87 if args.exit_mode == 'exception':
88 raise RuntimeError('check_rank and exit epoch={} batch={} start_rank={} rank={}'
89 .format(epoch, batch, start_rank, hvd.rank()))
90 else:
91 psutil.Process(os.getpid()).kill()
92
93
94def log_state(state):

Callers 1

on_batch_beginMethod · 0.70

Calls 1

rankMethod · 0.45

Tested by

no test coverage detected

Used in the wild real call sites across dependent graphs

searching dependent graphs…