MCPcopy
hub / github.com/lm-sys/FastChat / XftConfig

Class XftConfig

fastchat/modules/xfastertransformer.py:6–15  ·  view source on GitHub ↗

Source from the content-addressed store, hash-verified

4
5@dataclass
6class XftConfig:
7 max_seq_len: int = 4096
8 beam_width: int = 1
9 eos_token_id: int = -1
10 pad_token_id: int = -1
11 num_return_sequences: int = 1
12 is_encoder_decoder: bool = False
13 padding: bool = True
14 early_stopping: bool = False
15 data_type: str = "bf16_fp16"
16
17
18class XftModel:

Callers 3

create_model_workerFunction · 0.90
mainFunction · 0.90

Calls

no outgoing calls

Tested by

no test coverage detected

Used in the wild real call sites across dependent graphs

searching dependent graphs…