Update on the development branch #2334
kaiyux
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
The TensorRT-LLM team is pleased to announce that we have pushed an update to the development branch (and the Triton backend) this Oct 15, 2024.
This update includes:
examples/commandr/README.md
.examples/falcon/README.md
, thanks to the contribution from @puneeshkhanna in Add support for falcon2 #1926.LLM
API for Mamba2.examples/exaone/README.md
.examples/medusa/README.md
.gptManagerBenchmark
tool.builder_force_num_profiles
intrtllm-build
command to env var.use_fused_mlp
when constructingBuildConfig
from dict, thanks for the fix from @ethnzhng in Include use_fused_mlp when constructing BuildConfig from dict #2081.numNewTokensCumSum
. ([Bug] Lookahead decoding is nondeterministic and wrong after the first call to runner.generate #2263)Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions