MCPcopy
hub / github.com/wandb/openui / eval

Function eval

backend/openui/eval/evaluate_weave.py:382–397  ·  view source on GitHub ↗
(mod="gpt-3.5-turbo")

Source from the content-addressed store, hash-verified

380
381
382async def eval(mod="gpt-3.5-turbo"):
383 pt("Initializing weave")
384 weave.init("openui-dev")
385 model = OpenUIModel(prompt_template=SYSTEM_PROMPT, model_name=mod)
386 pt("Loading dataset")
387 dataset = weave.ref("eval:v0").get()
388 # dataset = Dataset(
389 # name="eval",
390 # rows=[{"prompt": "Make a cool SaaS landing page for an AI startup"}],
391 # )
392 evaluation = Evaluation(
393 dataset=dataset,
394 scorers=[scores],
395 )
396 pt("Running evaluation")
397 await evaluation.evaluate(model)
398
399
400def run_prompt_search(mod: str):

Callers 1

evaluate_weave.pyFile · 0.70

Calls 3

OpenUIModelClass · 0.85
getMethod · 0.80
ptFunction · 0.70

Tested by

no test coverage detected