|
|
aa1e77a19c
|
[Hardware][CPU] Support MOE models on x86 CPU (#11831)
Signed-off-by: jiang1.li <jiang1.li@intel.com>
|
2025-01-10 11:07:58 -05:00 |
|
|
|
482cdc494e
|
[Doc] Rename offline inference examples (#11927)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
2025-01-10 23:50:29 +08:00 |
|
|
|
12664ddda5
|
[Doc] [1/N] Initial guide for merged multi-modal processor (#11925)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-01-10 14:30:25 +00:00 |
|
|
|
d85c47d6ad
|
Replace "online inference" with "online serving" (#11923)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
2025-01-10 12:05:56 +00:00 |
|
|
|
3de2b1eafb
|
[Doc] Show default pooling method in a table (#11904)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-01-10 11:25:20 +08:00 |
|
|
|
c3cf54dda4
|
[Doc][5/N] Move Community and API Reference to the bottom (#11896)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
Co-authored-by: Simon Mo <simon.mo@hey.com>
|
2025-01-10 03:10:12 +00:00 |
|
|
|
36f5303578
|
[Docs] Add Modal to deployment frameworks (#11907)
|
2025-01-09 23:26:37 +00:00 |
|
|
|
9a228348d2
|
[Misc] Provide correct Pixtral-HF chat template (#11891)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-01-09 10:19:37 -07:00 |
|
|
|
65097ca0af
|
[Doc] Add model development API Reference (#11884)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-01-09 09:43:40 +00:00 |
|
|
|
a732900efc
|
[Doc] Intended links Python multiprocessing library (#11878)
|
2025-01-09 05:39:39 +00:00 |
|
|
|
730e9592e9
|
[Doc] Recommend uv and python 3.12 for quickstart guide (#11849)
Signed-off-by: mgoin <michael@neuralmagic.com>
|
2025-01-09 11:37:48 +08:00 |
|
|
|
5984499e47
|
[Doc] Expand Multimodal API Reference (#11852)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-01-08 17:14:14 +00:00 |
|
|
|
6cd40a5bfe
|
[Doc][4/N] Reorganize API Reference (#11843)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-01-08 21:34:44 +08:00 |
|
|
|
aba8d6ee00
|
[Doc] Move examples into categories (#11840)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
2025-01-08 13:09:53 +00:00 |
|
|
|
cfd3219f58
|
[Hardware][Apple] Native support for macOS Apple Silicon (#11696)
Signed-off-by: Wallas Santos <wallashss@ibm.com>
Co-authored-by: Michael Goin <michael@neuralmagic.com>
|
2025-01-08 16:35:49 +08:00 |
|
|
|
a1b2b8606e
|
[Docs] Update sponsor name: 'Novita' to 'Novita AI' (#11833)
|
2025-01-07 23:05:46 -08:00 |
|
|
|
ad9f1aa679
|
[doc] update wheels url (#11830)
Signed-off-by: youkaichao <youkaichao@gmail.com>
|
2025-01-08 14:36:49 +08:00 |
|
|
|
259abd8953
|
[Docs] reorganize sponsorship page (#11639)
Signed-off-by: simon-mo <simon.mo@hey.com>
|
2025-01-07 21:16:08 -08:00 |
|
|
|
5950f555a1
|
[Doc] Group examples into categories (#11782)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
2025-01-08 09:20:12 +08:00 |
|
|
|
973f5dc581
|
[Doc]Add documentation for using EAGLE in vLLM (#11417)
Signed-off-by: Sourashis Roy <sroy@roblox.com>
|
2025-01-07 19:19:12 +00:00 |
|
|
|
c0efe92d8b
|
[Doc] Add note to gte-Qwen2 models (#11808)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-01-07 21:50:58 +08:00 |
|
|
|
d9fa1c05ad
|
[doc] update how pip can install nightly wheels (#11806)
Signed-off-by: youkaichao <youkaichao@gmail.com>
|
2025-01-07 21:42:58 +08:00 |
|
|
|
2de197bdd4
|
[V1] Support audio language models on V1 (#11733)
Signed-off-by: Roger Wang <ywang@roblox.com>
|
2025-01-07 19:47:36 +08:00 |
|
|
|
869e829b85
|
[doc] add doc to explain how to use uv (#11773)
Signed-off-by: youkaichao <youkaichao@gmail.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
|
2025-01-07 18:41:17 +08:00 |
|
|
|
8082ad7950
|
[V1][Doc] Update V1 support for LLaVa-NeXT-Video (#11798)
Signed-off-by: Roger Wang <ywang@roblox.com>
|
2025-01-07 09:55:39 +00:00 |
|
|
|
ce1917fcf2
|
[Doc] Create a vulnerability management team (#9925)
Signed-off-by: Russell Bryant <rbryant@redhat.com>
|
2025-01-06 22:57:32 -08:00 |
|
|
|
8ceffbf315
|
[Doc][3/N] Reorganize Serving section (#11766)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-01-07 11:20:01 +08:00 |
|
|
|
91b361ae89
|
[V1] Extend beyond image modality and support mixed-modality inference with Llava-OneVision (#11685)
Signed-off-by: Roger Wang <ywang@roblox.com>
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-01-06 19:58:16 +00:00 |
|
|
|
4ca5d40adc
|
[doc] explain how to add interleaving sliding window support (#11771)
Signed-off-by: youkaichao <youkaichao@gmail.com>
|
2025-01-06 21:57:44 +08:00 |
|
|
|
ee77fdb5de
|
[Doc][2/N] Reorganize Models and Usage sections (#11755)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-01-06 21:40:31 +08:00 |
|
|
|
2a622d704a
|
k8s-config: Update the secret to use stringData (#11679)
Signed-off-by: Suraj Deshmukh <surajd.service@gmail.com>
|
2025-01-06 08:01:22 +00:00 |
|
|
|
402d378360
|
[Doc] [1/N] Reorganize Getting Started section (#11645)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-01-06 02:18:33 +00:00 |
|
|
|
d1d49397e7
|
Update bnb.md with example for OpenAI (#11718)
|
2025-01-04 06:29:02 +00:00 |
|
|
|
9c93636d84
|
Update tool_calling.md (#11701)
|
2025-01-04 06:16:30 +00:00 |
|
|
|
2f1e8e8f54
|
Update default max_num_batch_tokens for chunked prefill (#11694)
|
2025-01-03 00:25:53 +00:00 |
|
|
|
84c35c374a
|
According to vllm.EngineArgs, the name should be distributed_executor_backend (#11689)
|
2025-01-02 18:14:16 +00:00 |
|
|
|
365801fedd
|
[VLM] Add max-count checking in data parser for single image models (#11661)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
Signed-off-by: Roger Wang <ywang@roblox.com>
Co-authored-by: Roger Wang <ywang@roblox.com>
|
2024-12-31 22:15:21 -08:00 |
|
|
|
e7c7c5e822
|
[V1][VLM] V1 support for selected single-image models. (#11632)
Signed-off-by: Roger Wang <ywang@roblox.com>
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
Signed-off-by: Isotr0py <2037008807@qq.com>
Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>
Co-authored-by: Isotr0py <2037008807@qq.com>
|
2024-12-31 21:17:22 +00:00 |
|
|
|
a2a40bcd0d
|
[Model][LoRA]LoRA support added for MolmoForCausalLM (#11439)
Signed-off-by: Matthias Vogler <matthias.vogler@joesecurity.org>
Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>
Co-authored-by: Matthias Vogler <matthias.vogler@joesecurity.org>
Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>
|
2024-12-30 17:33:06 -08:00 |
|
|
|
b12e87f942
|
[platforms] enable platform plugins (#11602)
Signed-off-by: youkaichao <youkaichao@gmail.com>
|
2024-12-30 20:24:45 +08:00 |
|
|
|
32b4c63f02
|
[Doc] Convert list tables to MyST (#11594)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2024-12-29 15:56:22 +08:00 |
|
|
|
328841d002
|
[bugfix] interleaving sliding window for cohere2 model (#11583)
Signed-off-by: youkaichao <youkaichao@gmail.com>
|
2024-12-28 16:55:42 +00:00 |
|
|
|
d427e5cfda
|
[Doc] Minor documentation fixes (#11580)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2024-12-28 21:53:59 +08:00 |
|
|
|
d34be24bb1
|
[Model] Support InternLM2 Reward models (#11571)
Signed-off-by: Isotr0py <2037008807@qq.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
|
2024-12-28 06:14:10 +00:00 |
|
|
|
101418096f
|
[VLM] Support caching in merged multi-modal processor (#11396)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2024-12-27 17:22:48 +00:00 |
|
|
|
5ce4627a7e
|
[Doc] Add xgrammar in doc (#11549)
Signed-off-by: ccjincong <chenjincong11@gmail.com>
|
2024-12-27 13:05:10 +00:00 |
|
|
|
d003f3ea39
|
Update deploying_with_k8s.md with AMD ROCm GPU example (#11465)
Signed-off-by: Alex He <alehe@amd.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
|
2024-12-27 10:00:04 +00:00 |
|
|
|
0c0c2015c5
|
Update openai_compatible_server.md (#11536)
Co-authored-by: Simon Mo <simon.mo@hey.com>
|
2024-12-26 16:26:18 -08:00 |
|
|
|
82d24f7aac
|
[Docs] Document Deepseek V3 support (#11535)
Signed-off-by: simon-mo <simon.mo@hey.com>
|
2024-12-26 16:21:56 -08:00 |
|
|
|
b85a977822
|
[Doc] Add video example to openai client for multimodal (#11521)
Signed-off-by: Isotr0py <2037008807@qq.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
|
2024-12-26 17:31:29 +00:00 |
|