Code
Hub
Workspaces
Connect
Indexed graphs
Engine
MCP
copy
hub
/
github.com/NVIDIA/TensorRT-LLM
/ llm_quantization.py
File
llm_quantization.py
examples/llm-api/_tensorrt_engine/llm_quantization.py:None–None ·
view source on GitHub ↗
Source
from the content-addressed store, hash-verified
1
### Generation with Quantization
2
import
logging
3
4
import
torch
Callers
nothing calls this directly
Calls
5
CalibConfig
Class · 0.90
QuantConfig
Class · 0.85
main
Function · 0.70
append
Method · 0.45
error
Method · 0.45
Tested by
no test coverage detected