MCPcopy
hub / github.com/horovod/horovod / check_exit

Function check_exit

test/integration/data/elastic_tensorflow2_main.py:73–82  ·  view source on GitHub ↗
(epoch, batch)

Source from the content-addressed store, hash-verified

71
72
73def check_exit(epoch, batch):
74 key = str((epoch, batch))
75 if key in exit_schedule:
76 ranks_to_exit = exit_schedule[key]
77 if start_rank in ranks_to_exit:
78 if args.exit_mode == 'exception':
79 raise RuntimeError('check_rank and exit epoch={} batch={} start_rank={} rank={}'
80 .format(epoch, batch, start_rank, hvd.rank()))
81 else:
82 psutil.Process(os.getpid()).kill()
83
84
85def log_state(state):

Callers 1

trainFunction · 0.70

Calls 1

rankMethod · 0.45

Tested by

no test coverage detected

Used in the wild real call sites across dependent graphs

searching dependent graphs…