You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/models/llama/model.py", line 529, in quantize
convert.quantize(hf_model_dir,
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/models/llama/convert.py", line 1146, in quantize
weights = load_weights_from_hf_model(
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/models/llama/convert.py", line 1059, in load_weights_from_hf_model
lm_head_weights = get_weight(model_params, param_name_map["lm_head"], dtype)
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/models/llama/convert.py", line 211, in get_weight
if named_params[prefix + '.weight'].dtype != dtype:
KeyError: 'lm_head.weight'
additional notes
using pdb , shows model_params ,many value is none, for example
System Info
x86_64, ubuntu, A100
Who can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Expected behavior
no error
actual behavior
additional notes
using pdb , shows model_params ,many value is none, for example
The text was updated successfully, but these errors were encountered: