MCPcopy
hub / github.com/algorithmicsuperintelligence/optillm / load_optillm_bench

Function load_optillm_bench

scripts/eval_optillmbench.py:79–86  ·  view source on GitHub ↗

Load the OptiLLM Bench dataset.

()

Source from the content-addressed store, hash-verified

77]
78
79def load_optillm_bench() -> datasets.Dataset:
80 """Load the OptiLLM Bench dataset."""
81 try:
82 dataset = load_dataset("codelion/optillmbench")
83 return dataset["test"] # We use the test split for evaluation
84 except Exception as e:
85 logger.error(f"Error loading dataset: {e}")
86 raise
87
88def extract_gsm8k_answer(text: str) -> float:
89 """Extract numerical answer after ### from GSM8K responses."""

Callers 1

mainFunction · 0.85

Calls

no outgoing calls

Tested by

no test coverage detected