youngkingdom/vllm - vllm - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Andrew Sansom	d4006bd84d	[docs] Prompt Embedding feature support (#25288 ) Signed-off-by: Andrew Sansom <andrew@protopia.ai> Signed-off-by: yewentao256 <zhyanwentao@126.com>	2025-10-03 13:35:53 -07:00
Harry Mellor	12aed7e453	Encoder model support for the Transformers backend (#25174 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-19 19:15:22 +01:00
Harry Mellor	058525b997	Move `PoolerConfig` from `config/__init__.py` to `config/pooler.py` (#25181 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-19 11:02:55 +00:00
wang.yuqi	5f696c33b1	[New Model] Support BertForTokenClassification / Named Entity Recognition (NER) task (#24872 ) Signed-off-by: wang.yuqi <noooop@126.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-09-18 23:22:01 +08:00
Roger Wang	21da73343a	[Misc] Clean up flags in `vllm bench serve` (#25138 ) Signed-off-by: Roger Wang <hey@rogerw.io>	2025-09-18 12:43:33 +00:00
Kay Yan	eaffe4486c	[Docs] Fix pooling-params doc references in openai_compatible_server.md (#24939 )	2025-09-18 04:36:47 -07:00
Aaron Pham	29283e8976	[Chore] Cleanup guided namespace, move to structured outputs config (#22772 ) Signed-off-by: Aaron Pham <contact@aarnphm.xyz> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-18 09:20:27 +00:00
YiwenC	52bc9d5b3e	[Model] enable data parallel for InternVL vision encoder (#23909 ) Signed-off-by: Yiwen Chen <yiwen66@berkeley.edu> Signed-off-by: YiwenC <54658925+666even666@users.noreply.github.com> Co-authored-by: Roger Wang <hey@rogerw.io>	2025-09-17 21:11:46 -07:00
Harry Mellor	32baf1d036	[Docs] Clean up the contributing README (#25099 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-17 21:05:18 -07:00
afeldman-nm	7ae9887542	[V1] Logits processor docs (#22919 ) Signed-off-by: Andrew Feldman <afeldman@redhat.com> Signed-off-by: afeldman-nm <156691304+afeldman-nm@users.noreply.github.com> Co-authored-by: Joseph Marinier <Joseph.Marinier@gmail.com>	2025-09-17 11:53:12 -07:00
Roger Wang	0f7acdd73c	[Model] Support Qwen3-VL Model Series (#24727 ) Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Huang Jie <92386084+JJJYmmm@users.noreply.github.com> Co-authored-by: 松灵 <26085463+wulipc@users.noreply.github.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-09-17 05:01:04 +00:00
yyzxw	5672ba90bd	[Docs] fix invalid doc link (#25017 ) Signed-off-by: zxw <1020938856@qq.com>	2025-09-16 20:53:23 -07:00
Isotr0py	5a411ef6c4	[Benchmarks] Add MMVU video dataset support and clean up deprecated datasets (#24719 ) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-09-17 03:29:43 +00:00
elvischenv	3059b9cc6b	[Doc] Add --force-overwrite option to generate_cmake_presets.py (#24375 ) Signed-off-by: elvischenv <219235043+elvischenv@users.noreply.github.com>	2025-09-16 18:45:29 -07:00
Benjamin Bartels	64ad551878	Removes source compilation of nixl dependency (#24874 ) Signed-off-by: bbartels <benjamin@bartels.dev> Signed-off-by: Benjamin Bartels <benjamin@bartels.dev> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Daniele <36171005+dtrifiro@users.noreply.github.com>	2025-09-17 01:33:18 +00:00
TeeKen Lau	e4f0b4cd96	(doc): set cmake c++ compatible standard when building on MacOS CPU. (#23483 ) Signed-off-by: teekenl <teekenlau@gmail.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-16 06:08:46 -07:00
Ye (Charlotte) Qi	85e0df1392	[Docs] move benchmarks README to contributing guides (#24820 )	2025-09-16 05:52:57 -07:00
Woosuk Kwon	759ef49b15	Remove V0 Encoder-Decoder Support (#24907 ) Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>	2025-09-15 21:17:14 -07:00
Richard Zou	e1279ef00f	[Docs] Update instructions for how to using existing torch binary (#24892 ) Signed-off-by: Richard Zou <zou3519@gmail.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-16 02:25:50 +00:00
Sergio Paniego Blanco	7f6f2c1182	`HuggingFace` -> `Hugging Face` in `Integration with Hugging Face` docs (#24889 )	2025-09-15 17:28:35 -07:00
ant-yy	72c99f2a75	[Model]: support Ling2.0 (#24627 ) Signed-off-by: vito.yy <vito.yy@antgroup.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-09-15 05:09:30 -07:00
wang.yuqi	bf214ca226	[Misc] Fix examples openai_pooling_client.py (#24853 ) Signed-off-by: wang.yuqi <noooop@126.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-15 11:57:30 +00:00
Michael Yao	78818dd1b0	[Docs] Have a try to improve frameworks/streamlit.md (#24841 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-09-14 21:50:36 -07:00
Rakesh Asapanna	30498f2a65	[Doc]: Remove 404 hyperlinks (#24785 ) Signed-off-by: Rakesh Asapanna <45640029+rozeappletree@users.noreply.github.com>	2025-09-13 00:15:41 -07:00
Harry Mellor	abc7989adc	[Docs] Remove Neuron install doc as backend no longer exists (#24396 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-13 00:15:03 -07:00
Shane A	89e08d6d18	[Model] Add Olmo3 model implementation (#24534 ) Signed-off-by: Shane A <shanea@allenai.org> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-09-13 03:26:21 +00:00
Chenheli Hua	7f2ea7074e	[Frontend][Multimodal] Allow skipping media data when UUIDs are provided. (#23950 ) Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: Chenheli Hua <huachenheli@outlook.com> Signed-off-by: Roger Wang <hey@rogerw.me> Co-authored-by: Roger Wang <hey@rogerw.io> Co-authored-by: Roger Wang <hey@rogerw.me>	2025-09-13 02:16:06 +00:00
dongluw	a5b84f1cbf	[Core] Shared memory based object store for Multimodal data caching and IPC (#20452 ) Signed-off-by: donglu <donglu@cohere.com>	2025-09-12 07:54:17 -07:00
Didier Durand	bcb06d7baf	[Doc]: fix typos in various files (#24726 ) Signed-off-by: Didier Durand <durand.didier@gmail.com>	2025-09-12 06:43:12 -07:00
Gregory Shtrasberg	6a50eaa0d3	[DOCs] Update ROCm installation docs section (#24691 ) Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>	2025-09-11 20:02:53 -07:00
Harry Mellor	361ae27f8a	[Docs] Fix formatting of transcription doc (#24676 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-11 11:18:06 -07:00
Harry Mellor	51d41265ad	[Docs] Fix typos in EP deployment doc (#24669 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-11 09:07:23 -07:00
Wentao Ye	4984a291d5	[Doc] Fix Markdown Pre-commit Error (#24670 ) Signed-off-by: yewentao256 <zhyanwentao@126.com>	2025-09-11 09:05:59 -07:00
Nicolò Lucchesi	404c85ca72	[Docs] Add transcription support to model (#24664 ) Signed-off-by: NickLucche <nlucches@redhat.com>	2025-09-11 07:39:01 -07:00
youkaichao	f510715882	[build] add torch to tool.uv no-build-isolation-package (#24303 ) Signed-off-by: youkaichao <youkaichao@gmail.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-11 13:19:44 +00:00
Tao He	f946197473	[Docs] Fixes a typo in the qwen3next model name. (#24654 ) Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>	2025-09-11 19:35:14 +08:00
Michael Yao	d14c4ebf08	[Docs] Use 1-2-3 list for deploy steps in deployment/frameworks/ (#24633 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-09-11 01:50:12 -07:00
Russell Bryant	ba6011027d	[Docs] Update V1 doc to reflect whisper support (#24606 ) Signed-off-by: Russell Bryant <rbryant@redhat.com>	2025-09-11 01:50:08 -07:00
Michael Yao	85df8afdae	[Docs] Revise frameworks/anything-llm.md (#24489 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-09-11 01:50:05 -07:00
Tao He	e93f4cc9e3	Add the support for the qwen3 next model (a hybrid attention model). (#24526 ) Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com> Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>	2025-09-11 15:32:09 +08:00
TaehyunKim	9bd831f501	[Model] New model support for Motif-1-Tiny (#23414 ) Signed-off-by: ca1207 <ca1207zzz@gmail.com> Signed-off-by: TaehyunKim <73943231+ca1207@users.noreply.github.com> Co-authored-by: WyldeCat <skan1543@gmail.com> Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>	2025-09-10 23:29:40 -07:00
youkaichao	8c5a747246	[distributed] update known issues (#24624 ) Signed-off-by: youkaichao <youkaichao@gmail.com>	2025-09-11 11:09:38 +08:00
Robin	36cacd0958	[Doc] Add documentation for GLM-4.5 series models: tool-calling and reasoning parser (#24589 ) Signed-off-by: WangErXiao <863579016@qq.com>	2025-09-10 07:50:55 -07:00
Yash Pratap Singh	9e3c3a7df2	[LoRA]: Add LoRA support to Mistral's Voxtral models (#24517 ) Signed-off-by: Yash Pratap Singh <yashsingh20001@gmail.com> Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>	2025-09-10 06:12:03 -07:00
Tyler Michael Smith	8b83b93739	[Docs] Document the extra memory footprint overhead when using EPLB (#24537 ) Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com>	2025-09-10 06:09:49 -07:00
Harry Mellor	9dbefd88e9	[Docs] Improve organisation of API Reference nav (#24569 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-10 06:08:21 -07:00
Harry Mellor	e40827280b	[Docs] Enable relative links in examples to function when rendered in the docs (#24041 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-09 21:40:45 -07:00
Nicolò Lucchesi	3707cb2505	[Docs] Gemma3n `transcriptions` endpoint support (#24512 ) Signed-off-by: NickLucche <nlucches@redhat.com>	2025-09-09 11:03:32 -07:00
Didier Durand	46876dff32	[Doc]: fixing typos to improve docs (#24480 ) Signed-off-by: Didier Durand <durand.didier@gmail.com>	2025-09-08 23:06:04 -07:00
Mickaël Seznec	ed16d0f26f	[Doc] mention fpdb for multiprocess breakpoints (#24452 ) Signed-off-by: Mickael Seznec <mickael@mistral.ai>	2025-09-08 21:46:45 -07:00

1 2 3 4 5 ...

1462 Commits