-
-
Notifications
You must be signed in to change notification settings - Fork 12.9k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Model] Support DeepSeek-OCR-2
deepseek
Related to DeepSeek models
new-model
Requests to new models
#33165
opened Jan 27, 2026 by
LiuLi1998
Loading…
5 tasks
[Bugfix] Disable CG for Whisper+FA2
bug
Something isn't working
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#33164
opened Jan 27, 2026 by
NickLucche
Loading…
Fix weight mapping test for Transfomers v5
multi-modality
Related to multi-modality (#4194)
ready
ONLY add when PR is ready to merge/full CI is needed
#33162
opened Jan 27, 2026 by
hmellor
Loading…
fix:paddle ocr infinite inference bug
bug
Something isn't working
v1
#33160
opened Jan 27, 2026 by
bellkjtt
Loading…
5 tasks
[Feature]: Container image WORKDIR consistency
ci/build
cpu
Related to CPU backends
#33159
opened Jan 27, 2026 by
SouthWest7
Loading…
5 tasks
[Release] [CI] Optim release pipeline
ci/build
rocm
Related to AMD ROCm
#33156
opened Jan 27, 2026 by
tjtanaa
Loading…
5 tasks
[CI] Split responses API MCP tests into separate CI step
ci/build
#33154
opened Jan 27, 2026 by
pacoxu
Loading…
5 tasks
[PluggableLayer][2/N] Apply PluggableLayer to linear layers
#33152
opened Jan 27, 2026 by
whx-sjtu
Loading…
5 tasks
[CI] minor fixes to pipeline generator and tests
ci/build
ready-run-all-tests
Trigger CI with all tests for wide-ranging PRs
#33151
opened Jan 27, 2026 by
khluu
Loading…
[CI] fix(tests): avoid Open-Meteo API timeout in test_function_calling_with_stream
#33150
opened Jan 27, 2026 by
pacoxu
Loading…
5 tasks
[BugFix] Fix minimax_m2 tool call parser for stream_interval > 1
bug
Something isn't working
tool-calling
#33149
opened Jan 27, 2026 by
MrIceCreamMan
Loading…
3 of 5 tasks
[Bugfix] Fix xgrammar cleanup leakage
bug
Something isn't working
structured-output
v1
#33148
opened Jan 27, 2026 by
DamonJiang777
Loading…
5 tasks
[DOC] [ROCm] Update docker deployment doc
documentation
Improvements or additions to documentation
nvidia
rocm
Related to AMD ROCm
#33146
opened Jan 27, 2026 by
vllmellm
Loading…
5 tasks
[Frontend] Frontend will only attach supported tasks corresponding entrypoints.
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
#33139
opened Jan 27, 2026 by
noooop
Loading…
5 tasks
[code clean] remove duplicated code
frontend
#33138
opened Jan 27, 2026 by
andyxning
Loading…
5 tasks
[Quantization][Refactor] use platform dict to choose kernel
#33130
opened Jan 27, 2026 by
zufangzhu
Loading…
[release] Minor fixes to release annotation and wheel upload
ci/build
#33129
opened Jan 27, 2026 by
khluu
Loading…
[Kernel] adding native nccl4py support
ci/build
kv-connector
nvidia
performance
Performance-related issues
#33127
opened Jan 27, 2026 by
pkousha
Loading…
[Core] Optimize SWA KV cache management for prefix caching
v1
#33125
opened Jan 27, 2026 by
jaewonlee-fb
Loading…
[Model] GPT-OSS: Use layer_types config for sliding window selection
gpt-oss
Related to GPT-OSS models
#33124
opened Jan 26, 2026 by
jaewonlee-fb
Loading…
[CPU][Feat] Enable KleidiAI accelerated int4 dynamic quant with BF16 activations on Arm CPUs
#33122
opened Jan 26, 2026 by
fadara01
Loading…
2 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2026-01-24.