Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q2 2025
#15735 opened Mar 29, 2025 by simon-mo
Open 9
[Usage] Qwen3 Usage Guide
#17327 opened Apr 28, 2025 by simon-mo
Open 43
Beta
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug]: fp8 w8a8 quantized Qwen2.5-VL hits AssertionError bug Something isn't working
#17595 opened May 2, 2025 by cebtenzzre
1 task done
[Bug]: vLLM pre-commit hook doesn't work with git worktree bug Something isn't working
#17592 opened May 2, 2025 by zou3519
[Bug]: Cannot load Gemma3 27b QAT GGUF on RTX 5090 bug Something isn't working
#17587 opened May 2, 2025 by FremyCompany
1 task done
[Bug]: Qwen3 FP8 on 0.8.5: type fp8e4nv not supported in this architecture. bug Something isn't working
#17581 opened May 2, 2025 by AlexBefest
1 task done
[Feature]: support for fp8 marlin with MoE feature request New feature or request
#17579 opened May 2, 2025 by ehartford
1 task done
[Bug]: Function calling does not work with Mistral Small bug Something isn't working
#17557 opened May 1, 2025 by menardorama
1 task done
[Bug]: top_k: 0 in generation_config.json can't disable top-k sampling bug Something isn't working
#17553 opened May 1, 2025 by toslunar
1 task done
[Bug]: failed to run LMCache example for v0 bug Something isn't working
#17545 opened May 1, 2025 by gaowayne
1 task done
[Bug]: cached_get_processor is not actually cached bug Something isn't working
#17543 opened May 1, 2025 by Zazzle516
1 task done
[Performance]: Performance comparison for v1 engine and v0 engine performance Performance-related issues
#17540 opened May 1, 2025 by hustxiayang
1 task done
[Bug]: Bad requests are not captured as traces bug Something isn't working
#17528 opened May 1, 2025 by frzifus
1 task done
[Bug]: Training with vllm not supports Qwen3 bug Something isn't working
#17527 opened May 1, 2025 by fly-dragon211
1 task done
ProTip! Find all open issues with in progress development work with linked:pr.