vllm/models at 09e9245478a44faec3c9bc888edea4089085e222 - vllm - Gitea: Git with a cup of tea

youngkingdom/vllm

Files

History

Woosuk Kwon 09e9245478 Add custom kernel for RMS normalization (#16 )

2023-04-01 00:51:22 +08:00

..

__init__.py

Support tensor parallel (#2 )

2023-03-21 13:45:42 -07:00

attention.py

Implement custom kernel for LLaMA rotary embedding (#14 )

2023-03-30 11:04:21 -07:00

input_metadata.py

Support tensor parallel (#2 )

2023-03-21 13:45:42 -07:00

layernorm.py

Add custom kernel for RMS normalization (#16 )

2023-04-01 00:51:22 +08:00

llama.py

Add custom kernel for RMS normalization (#16 )

2023-04-01 00:51:22 +08:00

memory_analyzer.py

Implement custom kernel for LLaMA rotary embedding (#14 )

2023-03-30 11:04:21 -07:00

model_utils.py

Implement LLaMA (#9 )

2023-03-30 12:25:32 +08:00

opt.py

Implement custom kernel for LLaMA rotary embedding (#14 )

2023-03-30 11:04:21 -07:00

sample.py

Implement custom kernel for LLaMA rotary embedding (#14 )

2023-03-30 11:04:21 -07:00

utils.py

FastAPI-based working frontend (#10 )

2023-03-29 14:48:56 +08:00