Code
Hub
Workspaces
Connect
Indexed graphs
Engine
MCP
copy
hub
/
github.com/NVIDIA/TensorRT-LLM
/ has_per_group_scaling
Method
has_per_group_scaling
tensorrt_llm/quantization/mode.py:161–162 ·
view source on GitHub ↗
(self)
Source
from the content-addressed store, hash-verified
159
return
self._any(self.PER_CHANNEL)
160
161
def
has_per_group_scaling(self):
162
return
self._any(self.PER_GROUP)
163
164
def
has_int8_kv_cache(self):
165
return
self._any(self.INT8_KV_CACHE)
Callers
15
to_dict
Method · 0.95
build_gpt
Function · 0.80
build
Function · 0.80
get_quant_method
Function · 0.80
has_int8_woq_per_channel
Method · 0.80
__init__
Method · 0.80
__init__
Method · 0.80
forward
Method · 0.80
forward
Method · 0.80
forward
Method · 0.80
forward
Method · 0.80
quantize
Function · 0.80
Calls
1
_any
Method · 0.95
Tested by
no test coverage detected