Files
cutlass/examples/68_hopper_flash_mla/README.md
2025-02-26 01:29:07 -05:00

317 B

Hopper FlashMLA - Examples

The codes in this example are migrated from FlashMLA, it implements an efficient MLA decoding kernel for Hopper GPU.

Run the example

Install

python setup.py install

Run the test

python tests/test_flash_mla.py