Code
Hub
Workspaces
Connect
Indexed graphs
Engine
MCP
copy
Index your code
hub
/
github.com/deepspeedai/DeepSpeedExamples
/ split_json.py
File
split_json.py
Megatron-LM/scripts/split_json.py:None–None ·
view source on GitHub ↗
Source
from the content-addressed store, hash-verified
1
""
"
2
Takes a corpora of files (specified by `--input_files`)
with
json data separated
3
by newlines (loose json). Splits data into train.json, val.json, test.json files
4
under `output_dir`.
Callers
nothing calls this directly
Calls
6
get_lines
Function · 0.85
get_splits
Function · 0.85
get_filepaths
Function · 0.85
write_files
Function · 0.85
append
Method · 0.80
exists
Method · 0.45
Tested by
no test coverage detected