MCPcopy
hub / github.com/horovod/horovod / check_exit

Function check_exit

test/integration/data/elastic_tensorflow_main.py:80–89  ·  view source on GitHub ↗
(epoch, batch)

Source from the content-addressed store, hash-verified

78
79
80def check_exit(epoch, batch):
81 key = str((epoch, batch))
82 if key in exit_schedule:
83 ranks_to_exit = exit_schedule[key]
84 if start_rank in ranks_to_exit:
85 if args.exit_mode == 'exception':
86 raise RuntimeError('check_rank and exit epoch={} batch={} start_rank={} rank={}'
87 .format(epoch, batch, start_rank, hvd.rank()))
88 else:
89 psutil.Process(os.getpid()).kill()
90
91
92def log_state(state):

Callers 1

trainFunction · 0.70

Calls 1

rankMethod · 0.45

Tested by

no test coverage detected

Used in the wild real call sites across dependent graphs

searching dependent graphs…