Files
cutlass/examples
Bing Xu d0d941efc7 [hardswish] correct implmentation (#403)
* [hardswish] correct implmentation

* seems working

* hardswish fp32/fp16x2 optimization

* [relu] half2 support

* add relu0; add multiply_add_relu0;

* cleanup

Co-authored-by: Bing Xu <bingxu@fb.com>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
2022-02-09 14:28:53 -05:00
..
2021-07-27 17:58:30 -07:00
2021-09-20 11:02:22 -07:00
2021-11-19 13:26:35 -08:00
2019-11-19 16:55:34 -08:00