cutlass

Files

masahi dceabd4c5a Support half precision sigmoid activation (#378 )

* Support half precision sigmoid activation

* introduce a vectorized variant using fast_tanh

* move the math to fast_math.h

* fixed compile

* .raw() -> .to_half()

Co-authored-by: Haicheng Wu <haichengw@nvidia.com>

2021-12-22 14:45:06 -05:00

cutlass

Support half precision sigmoid activation (#378 )

2021-12-22 14:45:06 -05:00