MCPcopy
hub / github.com/FareedKhan-dev/train-llm-from-scratch / _log1mexp

Function _log1mexp

src/post_training/dpo.py:43–45  ·  view source on GitHub ↗

Numerically stable log(1 - exp(x)) for x < 0.

(x: torch.Tensor)

Source from the content-addressed store, hash-verified

41
42
43def _log1mexp(x: torch.Tensor) -> torch.Tensor:
44 """Numerically stable log(1 - exp(x)) for x < 0."""
45 return torch.where(x > -0.6931, torch.log(-torch.expm1(x)), torch.log1p(-torch.exp(x)))
46
47
48def orpo_loss(

Callers 1

orpo_lossFunction · 0.85

Calls 1

logMethod · 0.80

Tested by

no test coverage detected