| 34 | |
| 35 | @dataclass |
| 36 | class GenerationConfig: |
| 37 | # this config is used for chat to provide more diversity |
| 38 | max_length: int = 32768 |
| 39 | top_p: float = 0.8 |
| 40 | temperature: float = 0.8 |
| 41 | do_sample: bool = True |
| 42 | repetition_penalty: float = 1.005 |
| 43 | |
| 44 | |
| 45 | @torch.inference_mode() |
no outgoing calls
no test coverage detected