MCPcopy
hub / github.com/lm-sys/FastChat / ExllamaConfig

Class ExllamaConfig

fastchat/modules/exllama.py:6–9  ·  view source on GitHub ↗

Source from the content-addressed store, hash-verified

4
5@dataclass
6class ExllamaConfig:
7 max_seq_len: int
8 gpu_split: str = None
9 cache_8bit: bool = False
10
11
12class ExllamaModel:

Callers 3

create_model_workerFunction · 0.90
mainFunction · 0.90

Calls

no outgoing calls

Tested by

no test coverage detected

Used in the wild real call sites across dependent graphs

searching dependent graphs…