Method has_fp4_kv_cache

tensorrt_llm/quantization/mode.py:170–171 · view source on GitHub ↗

(self)

Source from the content-addressed store, hash-verified

168	return self._any(self.FP8_KV_CACHE)
169
170	def has_fp4_kv_cache(self):
171	return self._any(self.NVFP4_KV_CACHE)
172
173	def has_kv_cache_quant(self):
174	return (self.has_int8_kv_cache() or self.has_fp8_kv_cache()

has_kv_cache_quantMethod · 0.95

_attn_implMethod · 0.80

update_quant_configMethod · 0.80

__init__Method · 0.80

_create_kv_cache_managerFunction · 0.80

get_cache_size_per_tokenMethod · 0.80

get_kv_cache_dtype_byte_sizeMethod · 0.80

kv_dtypeMethod · 0.80

_anyMethod · 0.95

no test coverage detected