youngkingdom/vllm - vllm - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
RED	c9461e05a4	Support Anthropic API /v1/messages Endpoint (#22627 ) Signed-off-by: liuli <ll407707@alibaba-inc.com> Co-authored-by: liuli <ll407707@alibaba-inc.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk> Co-authored-by: Michael Goin <mgoin64@gmail.com>	2025-10-22 09:13:18 -07:00
Russell Bryant	58fab50d82	[Frontend] Require flag for loading text and image embeds (#27204 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-22 15:52:02 +00:00
wang.yuqi	1f633b8632	[Frontend][3/N] Improve all pooling task \| Support binary embedding response (#27066 ) Signed-off-by: wang.yuqi <noooop@126.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2025-10-22 18:38:57 +08:00
ExtReMLapin	a4c29e6e82	fixed reasoning streaming with tool_choice="required" (#24108 ) Signed-off-by: CNE Pierre FICHEPOIL <pierre-1.fichepoil@gendarmerie.interieur.gouv.fr> Signed-off-by: ExtReMLapin <3909752+ExtReMLapin@users.noreply.github.com> Co-authored-by: CNE Pierre FICHEPOIL <pierre-1.fichepoil@gendarmerie.interieur.gouv.fr> Co-authored-by: Chauncey <chaunceyjiang@gmail.com>	2025-10-22 09:42:55 +00:00
Russell Bryant	3ada34f9cb	[Frontend] Enforce tokenize=False when applying chat template (#27205 ) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-10-21 02:57:34 +00:00
iAmir97	7a6c8c3fa1	[Chore] Separate out `vllm.utils.network_utils` (#27164 ) Signed-off-by: iAmir97 <Amir.balwel@embeddedllm.com> Co-authored-by: iAmir97 <Amir.balwel@embeddedllm.com>	2025-10-19 03:06:32 -07:00
dongbo910220	83004020fd	[Test] Add test for /health endpoint on engine failure (#26074 ) Signed-off-by: dongbo910220 <1275604947@qq.com>	2025-10-18 09:59:05 +00:00
Hanchenli	7c572544e4	[GPT-OSS] Structure_Tag support for gpt-oss tool-call in cot (#25515 ) Signed-off-by: Hanchenli <lihanc2002@gmail.com> Signed-off-by: Hanchenli <61769611+Hanchenli@users.noreply.github.com> Signed-off-by: Wei Wei <wwei6@meta.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Wei Wei <wwei6@meta.com> Co-authored-by: Wei Wei <weiweinpu@gmail.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>	2025-10-17 21:55:54 -07:00
Tahsin Tunan	43721bc67f	[CI] Replace large models with tiny alternatives in tests (#24057 ) Signed-off-by: Tahsin Tunan <tahsintunan@gmail.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Nick Hill <nhill@redhat.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-16 15:51:27 +01:00
Cyrus Leung	76f0d05bc6	[CI/Build] Update expected beam search output for Phi3V (#26978 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-16 05:12:44 +00:00
InChang Jeong	0ecc553ee6	[Bugfix] reasoning_parser parameter handling in run_batch.py (#26225 ) Signed-off-by: inc-jeong <inc.jeong@navercorp.com> Signed-off-by: InChang Jeong <inc.jeong@navercorp.com> Co-authored-by: USER <user@AL02367916.local>	2025-10-16 10:24:05 +08:00
Pradeep Dasigi	4794c2bd92	Olmo 3 tool parser and tests (#26143 ) Signed-off-by: Pradeep Dasigi <pradeepd@allenai.org>	2025-10-15 16:36:12 +00:00
wang.yuqi	f54f85129e	[Model][2/N] Improve all pooling task \| Support multi-vector retrieval (#25370 ) Signed-off-by: wang.yuqi <noooop@126.com>	2025-10-15 11:14:41 +00:00
Cyrus Leung	b8a4572157	[Misc] Use helper function to generate dummy messages in OpenAI MM tests (#26875 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-15 07:17:37 +00:00
Ye Hu	0512c04aee	[frontend][gptoss] Add per turn stats into Harmony Context (#25061 ) Signed-off-by: lacora <hyelacora@gmail.com> Co-authored-by: Ye Hu <yehu@fb.com>	2025-10-14 16:48:13 -07:00
Chauncey	df850c4912	[Feature][Responses API] Stream Function Call - harmony (#24317 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-14 08:31:43 -07:00
Chauncey	780eb03d9b	[CI] Fix test_tool_id_kimi_k2 (#26787 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-14 10:27:07 +00:00
Max Wittig	fd85c9f426	[Bugfix][FE]: Always include usage with `--enable-force-include-usage` (#20983 ) Signed-off-by: Max Wittig <max.wittig@siemens.com> Signed-off-by: Antoine Auger <antoineauger@users.noreply.github.com> Co-authored-by: Antoine Auger <antoineauger@users.noreply.github.com>	2025-10-14 09:17:39 +02:00
Jialin Ouyang	35bc22f23c	[ResponseAPI] Further polish message serialization and unit tests (#26728 ) Signed-off-by: Jialin Ouyang <Jialin.Ouyang@gmail.com>	2025-10-13 23:31:35 +00:00
wang.yuqi	d2a7938582	[Frontend][1/N] Improve all pooling task \| Support FP16 Embedding Base64 (Still uses fp32 by default). (#26414 ) Signed-off-by: wang.yuqi <noooop@126.com> Co-authored-by: Maximilien de Bayser <maxdebayser@gmail.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2025-10-13 19:06:43 +00:00
Jialin Ouyang	4073c82c4e	[ResponseAPI] Simplify input/output message serialization (#26620 ) Signed-off-by: Jialin Ouyang <Jialin.Ouyang@gmail.com>	2025-10-13 09:59:15 +00:00
wang.yuqi	767c3ab869	[Model][0/N] Improve all pooling task \| clean up (#25817 ) Signed-off-by: wang.yuqi <noooop@126.com>	2025-10-13 16:44:50 +08:00
Harry Mellor	8fcaaf6a16	Update `Optional[x]` -> `x \| None` and `Union[x, y]` to `x \| y` (#26633 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-12 09:51:31 -07:00
Chauncey	910abdbd08	[Bugfix] fixed top_logprobs: -1 does not appear to work as intended (#26470 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-11 00:41:17 +08:00
Mark McLoughlin	e519281920	[Metrics] Add test for multi-modal cache stats logging (#26588 ) Signed-off-by: Mark McLoughlin <markmc@redhat.com>	2025-10-10 16:00:50 +00:00
Chauncey	1e6848a65d	[CI] fix test_run_batch.py::test_completions - AssertionError (#26578 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-10 22:16:28 +08:00
Chauncey	720d3cd0f0	[CI] fix ruff format (#26579 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-10 03:02:12 -07:00
Ashwin Phadke	ab196edefb	Remove LoRA bias support (#25807 ) Signed-off-by: Ashwin Phadke <ashwinphadke12@rediffmail.com> Signed-off-by: Ashwin Phadke <23502062+ashwin-phadke@users.noreply.github.com> Signed-off-by: Jee Jee Li <pandaleefree@gmail.com> Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>	2025-10-10 09:50:33 +00:00
Luis Tomas Bolivar	3ee202ea1e	[GPT-OSS] Add support for arrays at tool message content (#25593 ) Signed-off-by: Luis Tomas Bolivar <ltomasbo@redhat.com>	2025-10-10 09:00:45 +00:00
Cyrus Leung	ad430a67ca	[Metrics] Log multi-modal cache stats and fix reset (#26285 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-10 01:45:55 -07:00
Ben Browning	da4455609d	[Chore]: One pythonic tool parser test uses the wrong parser (#26515 ) Signed-off-by: Ben Browning <bbrownin@redhat.com>	2025-10-10 04:03:55 +00:00
Julien Denize	c6187f55f7	Refactor MistralTokenizer (#26358 ) Signed-off-by: Julien Denize <julien.denize@mistral.ai>	2025-10-09 22:48:58 +00:00
Cyrus Leung	4bdf7ac593	[Bugfix] Fix SHM cache initialization (#26427 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-09 02:48:04 -07:00
Cyrus Leung	dc7976dd9f	[Misc] Upgrade more code to Python 3.10 (#26463 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-09 10:43:53 +01:00
Thomas Parnell	31a4b3e6c4	Revert #24446 and #26168 (#26332 ) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>	2025-10-07 16:38:19 -06:00
Cyrus Leung	1e4ecca1d0	[V0 Deprecation] Remove `VLLM_USE_V1` from tests (#26341 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-07 15:42:31 +00:00
Andrew Xia	185d8ed44f	[responsesAPI][bugfix] serialize harmony messages (#26185 ) Signed-off-by: Andrew Xia <axia@meta.com> Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>	2025-10-07 07:07:53 +00:00
Harry Mellor	6c04638214	Fix per file ruff ignores related to line length (#26262 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-06 05:12:40 +00:00
wuhang	91ac7f764d	[CI][gpt-oss] Enable python tool tests in CI (#24315 ) Signed-off-by: wuhang <wuhang6@huawei.com>	2025-10-06 04:20:06 +00:00
Harry Mellor	1c0c68202c	Fix per file ruff ignores related to typing (#26254 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-05 16:37:55 +00:00
Harry Mellor	4e256cadc2	Remove all references to `yapf` as it's no longer used (#26251 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-05 09:18:11 -07:00
Harry Mellor	d6953beb91	Convert formatting to use `ruff` instead of `yapf` + `isort` (#26247 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-05 07:06:22 -07:00
Cyrus Leung	a964e5e6c3	[Bugfix] Allow `--skip-tokenizer-init` with `echo and return_token_ids` (#26238 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-05 05:38:53 +00:00
Cyrus Leung	119f00630b	[Renderer] Clean up renderer code (#26216 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-04 17:05:29 +00:00
Yannick Schnider	f05fea1f5e	[Core] Enable decode of context length equal to max model length (#26168 ) Signed-off-by: Yannick Schnider <yannick.schnider1@ibm.com>	2025-10-04 09:59:26 +00:00
Ben Browning	ea25a76c05	[BugFix] Use async Mistral Tokenizer in Chat Completions (#26134 ) Signed-off-by: Ben Browning <bbrownin@redhat.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>	2025-10-04 09:42:08 +08:00
Andrew Xia	831b124151	[responsesAPI] add better error messaging for long prompts (#25724 ) Signed-off-by: Andrew Xia <axia@meta.com> Signed-off-by: Andrew Xia <axia@fb.com> Co-authored-by: Andrew Xia <axia@fb.com>	2025-10-03 14:33:13 -07:00
Yang Liu	812b7f54a8	[Renderer] Move Processor out of AsyncLLM (#24138 ) Signed-off-by: Yang <lymailforjob@gmail.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-03 11:29:45 +00:00
kyt	2ed3f20dba	[openai] Fix missing tool usage check (system message) (#24768 ) Signed-off-by: kyt <eluban4532@gmail.com>	2025-10-03 18:55:44 +08:00
HUIJONG JEONG	3e70e3d4d5	add(v1): RequestStatesStats to RequestOutput (#24947 ) Signed-off-by: huijjj <huijong.jeong@squeezebits.com>	2025-10-03 08:56:25 +00:00

1 2 3 4 5 ...

525 Commits