Skip to content

Issues: NVIDIA/TensorRT-LLM

Beta
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Install from docs not working bug Something isn't working Installation triaged Issue has been triaged by maintainers
#4099 opened May 6, 2025 by darraghdog
4 tasks
LLama 3.1 405B FP8 perf low on B200 bug Something isn't working
#4043 opened May 3, 2025 by andyluo7
4 tasks
Deepseek R1 and V3, FP4 quant, output quality issues at batch size > 2 bug Something isn't working triaged Issue has been triaged by maintainers
#4037 opened May 2, 2025 by pankajroark
2 of 4 tasks
[bug] Lookahead spec-dec verifies w guesses instead of g bug Something isn't working
#4003 opened Apr 30, 2025 by mahmoudhas
3 of 4 tasks
Qserve-w4a8 Shows Lower Computational Efficiency on H20 not a bug Some known limitation, but not a bug.
#3977 opened Apr 30, 2025 by StaryDing
2 of 4 tasks
Reasoning-related enhancements feature request New feature or request roadmap
#3965 opened Apr 29, 2025 by mk-nvidia
Disaggregated Prefill & Decode serving optimizations Investigating Performance Issue about performance number roadmap triaged Issue has been triaged by maintainers
#3963 opened Apr 29, 2025 by mk-nvidia
MoE optimizations Investigating Performance Issue about performance number roadmap triaged Issue has been triaged by maintainers
#3962 opened Apr 29, 2025 by mk-nvidia
Support versioned github.io doc to make it easy to map code with the corresponding doc version Documentation Improvements or additions to documentation Investigating roadmap triaged Issue has been triaged by maintainers
#3961 opened Apr 29, 2025 by mk-nvidia
Re-organize the example directory into Feature level examples and Model level examples Documentation Improvements or additions to documentation Investigating roadmap triaged Issue has been triaged by maintainers
#3960 opened Apr 29, 2025 by mk-nvidia
Plenty of regressions in trt-llm v0.20.0 bug Something isn't working
#3955 opened Apr 29, 2025 by michaelfeil
4 tasks
Context node crash when using PD Disaggregation bug Something isn't working
#3937 opened Apr 29, 2025 by nsealati
2 of 4 tasks
ProTip! Updated in the last three days: updated:>2025-05-04.