Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Misc] add code to get git hash info for vllm
#5482 opened Jun 13, 2024 by dhuangnm Loading…
[CI/Build] Enable LLaVA test in CPU
#5481 opened Jun 13, 2024 by DarkLight1337 Loading…
Seperate dev requirements into lint and test
#5474 opened Jun 12, 2024 by Yard1 Loading…
Add cuda_device_count_stateless
#5473 opened Jun 12, 2024 by Yard1 Loading…
[MISC] Remove FP8 warning
#5472 opened Jun 12, 2024 by comaniac Loading…
[Doc] Update documentation on Tensorizer
#5471 opened Jun 12, 2024 by sangstar Loading…
[ci] Try building wheels and upload as artifact
#5469 opened Jun 12, 2024 by khluu Loading…
[Model] Bert Embedding Model
#5447 opened Jun 12, 2024 by laishzh Draft
[Bugfix] Avoid to warmup when world size is 1
#5442 opened Jun 12, 2024 by kerthcet Loading…
[Kernel] Add punica dimension for Qwen2 LoRA
#5441 opened Jun 12, 2024 by jinzhen-lin Loading…
[Doc] Update LLaVA docs
#5437 opened Jun 12, 2024 by DarkLight1337 Loading…
compressed-tensors marlin 24 support
#5435 opened Jun 12, 2024 by dsikka Draft
[ Misc ] Rs/compressed tensors cleanup
#5432 opened Jun 12, 2024 by robertgshaw2-neuralmagic Loading…
[Bugfix]Fix evict v2 with long context length
#5411 opened Jun 11, 2024 by puf147 Loading…
ProTip! no:milestone will show everything without a milestone.