Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobodyLoading
Sort

Pull requests list

tuned fused configs for B300
#30629 opened Dec 14, 2025 by navmarri14Loading…
[MoE][Refactor 1/N] Separate Online Quantization
#30627 opened Dec 13, 2025 by robertgshaw2-redhatLoading…
5 tasks
[docker] Restructure Dockerfile for more efficient and cache-friendly builds ci/build documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#30626 opened Dec 13, 2025 by amrmahdiLoading…
[CI/Build] Ignore max transformers version skipping for initialization tests ready ONLY add when PR is ready to merge/full CI is needed
#30619 opened Dec 13, 2025 by Isotr0pyLoading…
1 of 5 tasks
[Docs] Add FlashInfer environment variables to env_vars documentation documentation Improvements or additions to documentation
#30616 opened Dec 13, 2025 by majiayu000Loading…
2 tasks done
[Feature] Default EPLB num_redundant_experts to minimum valid value
#30614 opened Dec 13, 2025 by majiayu000Loading…
2 tasks done
[Chore] Remove redundant RequestPrompt frontend ready ONLY add when PR is ready to merge/full CI is needed
#30612 opened Dec 13, 2025 by DarkLight1337Loading…
5 tasks
[ROCm][Perf] Replace cat to bmm's inplace write when aiter enabled rocm Related to AMD ROCm v1
#30611 opened Dec 13, 2025 by ganyi1996ppoLoading…
5 tasks
Fix incorrect dimension in reduce_scatter nvidia
#30610 opened Dec 13, 2025 by RKai025Loading…
[FixBug]fix gpt-oss v1/completions response bug frontend gpt-oss Related to GPT-OSS models tool-calling
#30608 opened Dec 13, 2025 by princeprideLoading…
3 of 5 tasks
[Bugfix] Improve DCP error hint in cp_utils v1
#30607 opened Dec 13, 2025 by jliu9515Loading…
3 of 5 tasks
[Bugfix] Fix ScalarType NanRepr enum comparisons
#30605 opened Dec 13, 2025 by NoonePausefergLoading…
3 of 5 tasks
[LoRA] Set default MXFP4 LoRA backend to Marlin
#30598 opened Dec 13, 2025 by xyang16Loading…
5 tasks
[docs][fix] Update Arm CPU vLLM wheel installation docs documentation Improvements or additions to documentation
#30594 opened Dec 13, 2025 by fadara01Loading…
5 tasks
[Misc] Improve error messages for unsupported types and parameters kv-connector nvidia performance Performance-related issues
#30593 opened Dec 13, 2025 by BlankRHLoading…
3 of 5 tasks
[ROCm][CI] Add "Qwen3-Next-80B-A3B-Instruct MTP Async EPLB Accuracy Test" Back Into AMD CI ci/build qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#30590 opened Dec 13, 2025 by micah-wilLoading…
ProTip!no:milestone will show everything without a milestone.