vllm/worker at 00d91c8a2cf3ebaf0f3ea69312f6e3882ed9f372 - vllm - Gitea: Git with a cup of tea

youngkingdom/vllm

Files

History

wangshuai09 3ddbe25502 [Hardware][CPU] using current_platform.is_cpu (#9536 )

2024-10-22 00:50:43 -07:00

..

__init__.py

[Speculative decoding 2/9] Multi-step worker for draft model (#2424 )

2024-01-21 16:31:47 -08:00

test_encoder_decoder_model_runner.py

[Hardware][CPU] using current_platform.is_cpu (#9536 )

2024-10-22 00:50:43 -07:00

test_model_input.py

[Core] Add AttentionState abstraction (#7663 )

2024-08-20 18:50:45 +00:00

test_model_runner.py

[Core] Factor out common code in SequenceData and Sequence (#8675 )

2024-09-21 02:30:39 +00:00

test_profile.py

🐛 fix torch memory profiling (#9516 )

2024-10-18 21:25:19 -04:00

test_swap.py

[Core] Pipeline Parallel Support (#4412 )

2024-07-02 10:58:08 -07:00