MCPcopy Index your code
hub / github.com/NVIDIA/TensorRT-LLM / input_buffers

Method input_buffers

tensorrt_llm/lora_manager.py:1239–1269  ·  view source on GitHub ↗
(self, lora_uids, mapping: Mapping, num_layers: int)

Source from the content-addressed store, hash-verified

1237 save_val(all_configs, uid_path, "lora_config", tp_num=None, write_npy=True)
1238
1239 def input_buffers(self, lora_uids, mapping: Mapping, num_layers: int):
1240 inputs = {}
1241 for layer_idx in mapping.pp_layers(num_layers):
1242 for lora_module in self.lora_target_modules + self.missing_qkv_modules:
1243 lora_ranks_ = []
1244 lora_ptrs_ = []
1245 for lora_uid in lora_uids:
1246 lora_rank = 0
1247 lora_ptrs = [0, 0, 0]
1248
1249 if lora_uid != "-1":
1250 low_ranks = self.uid_to_low_ranks(lora_uid)
1251
1252 if (
1253 layer_idx in low_ranks
1254 and lora_module in low_ranks[layer_idx].keys()
1255 and low_ranks[layer_idx][lora_module] != 0
1256 ):
1257 lora_rank = low_ranks[layer_idx][lora_module]
1258 lora_ptrs = self.lora_weights_pointers_list[lora_uid][layer_idx][
1259 lora_module
1260 ]
1261
1262 lora_ranks_.append(lora_rank)
1263 lora_ptrs_.append(lora_ptrs)
1264
1265 inputs[f"{lora_module}_lora_ranks_{layer_idx}"] = torch.IntTensor(lora_ranks_)
1266 inputs[f"{lora_module}_lora_weights_pointers_{layer_idx}"] = torch.LongTensor(
1267 lora_ptrs_
1268 )
1269 return inputs

Callers 2

encoder_runMethod · 0.80
setupMethod · 0.80

Calls 4

uid_to_low_ranksMethod · 0.95
pp_layersMethod · 0.80
keysMethod · 0.45
appendMethod · 0.45

Tested by

no test coverage detected