MCPcopy
hub / github.com/NVIDIA/TensorRT-LLM / llm_quantization.py

File llm_quantization.py

examples/llm-api/_tensorrt_engine/llm_quantization.py:None–None  ·  view source on GitHub ↗

Source from the content-addressed store, hash-verified

1### Generation with Quantization
2import logging
3
4import torch

Callers

nothing calls this directly

Calls 5

CalibConfigClass · 0.90
QuantConfigClass · 0.85
mainFunction · 0.70
appendMethod · 0.45
errorMethod · 0.45

Tested by

no test coverage detected