vllm/vllm at ac5cf86aa6aebbf9e42df51f7e377fbee85bc703 - vllm - Gitea: Git with a cup of tea

youngkingdom/vllm

Files

History

Wang Ran (汪然) ac5cf86aa6 Fix __repr__ of SequenceOutputs (#1311 )

2023-10-10 09:58:28 -07:00

..

Use monotonic time where appropriate (#1249 )

2023-10-02 19:22:05 -07:00

Use monotonic time where appropriate (#1249 )

2023-10-02 19:22:05 -07:00

API server support ipv4 / ipv6 dualstack (#1288 )

2023-10-07 15:15:54 -07:00

[Minor] Fix comment in mistral.py (#1303 )

2023-10-09 19:44:37 -07:00

transformers_utils

add support for tokenizer revision (#1163 )

2023-10-02 19:19:46 -07:00

Move bfloat16 check to worker (#1259 )

2023-10-07 22:10:44 -07:00

__init__.py

Bump up the version to v0.2.0 (#1212 )

2023-09-28 15:30:38 -07:00

block.py

[Quality] Add code formatter and linter (#326 )

2023-07-03 11:31:55 -07:00

config.py

Move bfloat16 check to worker (#1259 )

2023-10-07 22:10:44 -07:00

logger.py

[Quality] Add code formatter and linter (#326 )

2023-07-03 11:31:55 -07:00

outputs.py

Align vLLM's beam search implementation with HF generate (#857 )

2023-09-04 17:29:42 -07:00

sampling_params.py

[Minor] Fix type annotations (#1238 )

2023-10-02 15:28:31 -07:00

sequence.py

Fix __repr__ of SequenceOutputs (#1311 )

2023-10-10 09:58:28 -07:00

utils.py

Allocate more shared memory to attention kernel (#1154 )

2023-09-26 22:27:13 -07:00