Files
cutlass/examples/41_fused_multi_head_attention
dan_the_3rd 146d314057 Update fMHA kernels (#992)
* Update fMHA kernels

Upstream recent changes to fMHA that we did in xFormers.
Previous version in CUTLASS: facebookresearch/xformers@b6be33a
Updating to: facebookresearch/xformers@55a4798

* minor changes

* make var work

---------

Co-authored-by: danthe3rd <danthe3rd>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
2023-07-12 22:30:46 -04:00
..
2023-07-12 22:30:46 -04:00
2023-07-12 22:30:46 -04:00
2023-07-12 22:30:46 -04:00
2023-04-06 20:44:58 -04:00
2023-07-12 22:30:46 -04:00
2023-07-12 22:30:46 -04:00
2023-07-12 22:30:46 -04:00
2023-07-12 22:30:46 -04:00