559ab46ee1
fix: Removes redundant token calculations and updates dependencies
...
Eliminates unnecessary pre-calculation of token limits and recalculation of max tokens
across multiple app runners, simplifying the logic for prompt handling.
Updates tiktoken library from version 0.8.0 to 0.9.0 for improved tokenization performance.
Increases default token limit in TokenBufferMemory to accommodate larger prompt messages.
These changes streamline the token management process and leverage the latest
improvements in the tiktoken library.
Fixes potential token overflow issues and prepares the system for handling larger
inputs more efficiently.
Relates to internal optimization tasks.
Signed-off-by: -LAN- <laipz8200@outlook.com >
2025-04-28 15:39:12 +08:00
144f9507f8
feat : add GPT4.1 in the model providers ( #18912 )
2025-04-27 19:31:20 +08:00
2e097a1ac0
add bedrock deepseek-r1 ( #18908 )
2025-04-27 19:30:42 +08:00
024f242251
add bedrock claude-sonnet-3.7 ( #18788 )
2025-04-25 17:35:12 +08:00
b26e20fe34
fix: fix vertex gemini 2.0 flash 001 schema ( #18405 )
...
Co-authored-by: achmad-kautsar <achmad.kautsar@insignia.co.id >
2025-04-19 22:04:13 +08:00
fe1846c437
fix: change gemini-2.0-flash to validate google api #17082 ( #17115 )
2025-03-30 13:04:12 +08:00
413dfd5628
feat: add completion mode and context size options for LLM configuration ( #13325 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com >
2025-02-07 15:08:53 +08:00
f9515901cc
fix: Azure AI Foundry model cannot be used in the workflow ( #13323 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com >
2025-02-07 14:52:57 +08:00
3f42fabff8
chore:improve thinking display for llm from xinference and ollama pro… ( #13318 )
2025-02-07 14:29:29 +08:00
1caa578771
chore(*): Update style of thinking ( #13319 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com >
2025-02-07 14:06:35 +08:00
3eb3db0663
chore: refactor the OpenAICompatible and improve thinking display ( #13299 )
2025-02-07 13:28:46 +08:00
6e5c915f96
feat(model): add deepseek-r1 for openrouter ( #13312 )
2025-02-07 12:39:13 +08:00
2348abe4bf
feat: added a couple of models not defined in vertex ai, that were already … ( #13296 )
2025-02-07 09:11:25 +08:00
f7e7a399d9
feat:add think tag display for xinference deepseek r1 ( #13291 )
2025-02-06 22:04:58 +08:00
16865d43a8
feat: add deepseek models for volcengine provider ( #13283 )
...
Co-authored-by: zhaoqingyu.1075 <zhaoqingyu.1075@bytedance.com >
2025-02-06 18:20:03 +08:00
0d13aee15c
feat:add deepseek r1 think display for ollama provider ( #13272 )
2025-02-06 15:32:10 +08:00
40dd63ecef
Upgrade oracle models ( #13174 )
...
Co-authored-by: engchina <atjapan2015@gmail.com >
2025-02-06 13:24:27 +08:00
6d66d6da15
feat(model_providers): Support deepseek-r1 for Nvidia Catalog ( #13269 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com >
2025-02-06 13:03:19 +08:00
87763fc234
feat(model_providers): Support deepseek for Azure AI Foundry ( #13267 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com >
2025-02-06 12:45:48 +08:00
f6c44cae2e
feat(model): add gemini-2.0 model ( #13266 )
2025-02-06 12:28:59 +08:00
da2ee04fce
fix: correct linewrap think display in generic openai api ( #13260 )
...
Signed-off-by: xhe <xw897002528@gmail.com >
2025-02-06 10:53:08 +08:00
7673c36af3
feat(model): add gemini-2.0-flash-thinking-exp-01-21 ( #13230 )
2025-02-06 10:01:00 +08:00
9457b2af2f
feat: added models :gemini 2.0 flash 001 and gemini 2.0 pro exp 02-05 ( #13247 )
2025-02-06 09:58:39 +08:00
7203991032
feat: add parameter "reasoning_effort" and Openai o3-mini ( #13243 )
2025-02-06 09:29:48 +08:00
5a685f7156
feat: add think display for volcengine and generic openapi ( #13234 )
...
Signed-off-by: xhe <xw897002528@gmail.com >
2025-02-06 09:24:40 +08:00
a6a25030ad
fix: updated _position.yaml to include the latest model already integ… ( #13245 )
2025-02-06 09:21:51 +08:00
00458a31d5
feat: added deepseek r1 and v3 to siliconflow ( #13238 )
2025-02-05 21:59:18 +08:00
c6ddf6d6cc
feat(model_providers): Add Groq DeepSeek-R1-Distill-Llama-70b ( #13229 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com >
2025-02-05 19:15:29 +08:00
34b21b3065
feat: Add o3-mini and o3-mini-2025-01-31 model variants ( #13129 )
...
Co-authored-by: crazywoola <427733928@qq.com >
2025-02-05 17:04:45 +08:00
59ca44f493
chore(model_runtime): Move deepseek ahead in the providers list. ( #13197 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com >
2025-02-05 16:08:28 +08:00
1a2523fd15
feat: bedrock_endpoint_url ( #12838 )
2025-02-05 12:24:24 +08:00
7452032d81
add azure openai api version 2024-12-01-preview ( #13135 )
2025-02-03 11:04:20 +08:00
840729afa5
feat: the think tag display of siliconflow's deepseek r1 ( #13153 )
2025-02-02 21:55:13 +08:00
b09c39c8dc
refactor: avoid to use extra space when finding model by name ( #13043 )
2025-01-30 15:08:29 +08:00
b4b09ddc3c
add tongyi qwen2.5-14b/7b-instruct-1m model ( #13089 )
2025-01-29 11:58:01 +08:00
d44882c1b5
refactor: reduce duplciate code by inheritance ( #13073 )
2025-01-28 10:52:01 +08:00
560c5de1b7
Fixed Novita AI color and added DeepSeek R1 model ( #13074 )
2025-01-28 10:38:54 +08:00
6c31ee36cd
fix qwen-vl blocking mode ( #13052 )
2025-01-27 11:35:23 +08:00
d4be5ef9de
Update Novita AI predefined models ( #13045 )
2025-01-26 09:25:29 +08:00
59b3e672aa
feat: add agent thinking content display of deepseek R1 ( #12949 )
2025-01-24 20:13:42 +08:00
a2f8bce8f5
chore: add Japanese translation: model_providers/bedrock ( #13016 )
2025-01-24 18:43:33 +08:00
28067640b5
fix: wrong zh_Hans translation: Ohio ( #13006 )
2025-01-24 13:41:20 +08:00
da67916843
feat: add glm-4-air-0111 ( #12997 )
...
Co-authored-by: lowell <lowell.hu@zkteco.in >
2025-01-24 10:04:46 +08:00
d167d5b1be
feat(ark): support doubao 1.5 series of models ( #12935 )
2025-01-22 15:25:57 +08:00
e23f4b0265
feat: add gemini-2.0-flash-thinking-exp-01-21 ( #12924 )
2025-01-22 10:14:37 +08:00
3d1ce4c53f
bug: fixed bedrock rerank bug ( #12774 )
...
Co-authored-by: hobo.l <hobo.l@binance.com >
2025-01-21 19:09:36 +08:00
46e95e8309
fix: OpenAI o1 Bad Request Error ( #12839 )
2025-01-21 15:29:13 +08:00
a7b9375877
Update deepseek model configuration ( #12899 )
2025-01-21 15:28:11 +08:00
9903f1e703
add deepseek-reasoner ( #12898 )
2025-01-21 12:40:58 +08:00
166221d784
chore(lint): fix quotes for f-string formatting by bumping ruff to 0.9.x ( #12702 )
2025-01-21 10:12:29 +08:00