Code
Hub
Workspaces
Connect
Indexed graphs
Engine
MCP
copy
hub
/
github.com/lm-sys/FastChat
/ ExllamaConfig
Class
ExllamaConfig
fastchat/modules/exllama.py:6–9 ·
view source on GitHub ↗
Source
from the content-addressed store, hash-verified
4
5
@dataclass
6
class
ExllamaConfig:
7
max_seq_len: int
8
gpu_split: str = None
9
cache_8bit: bool = False
10
11
12
class
ExllamaModel:
Callers
3
create_model_worker
Function · 0.90
main
Function · 0.90
create_multi_model_worker
Function · 0.90
Calls
no outgoing calls
Tested by
no test coverage detected
Used in the wild
real call sites across dependent graphs
searching dependent graphs…