317 B
317 B
Hopper FlashMLA - Examples
The codes in this example are migrated from FlashMLA, it implements an efficient MLA decoding kernel for Hopper GPU.
Run the example
Install
python setup.py install
Run the test
python tests/test_flash_mla.py