Files
cutlass/include/cutlass
masahi dceabd4c5a Support half precision sigmoid activation (#378)
* Support half precision sigmoid activation

* introduce a vectorized variant using fast_tanh

* move the math to fast_math.h

* fixed compile

* .raw() -> .to_half()

Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
2021-12-22 14:45:06 -05:00
..
2021-12-17 16:04:43 -05:00
2021-11-19 13:26:35 -08:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-11-19 13:26:35 -08:00
2021-07-27 17:58:30 -07:00
2021-11-19 13:26:35 -08:00
2021-11-19 13:26:35 -08:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-11-19 13:26:35 -08:00
2021-12-17 16:04:43 -05:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-11-19 13:26:35 -08:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-12-17 16:04:43 -05:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00
2021-07-27 17:58:30 -07:00