gelu_tile
-
template<bool fast_and_approx = true>
void ckernel::gelu_tile_init()
-
Please refer to documentation for any_init.
-
template<bool fast_and_approx = true>
void ckernel::gelu_tile(uint32_t idst)
-
Performs element-wise computation of gelu on each element of a tile in DST register at index tile_index. The DST register buffer must be in acquired state via acquire_dst call. This call is blocking and is only available on the compute engine.
Return value: None
| Argument | Description | Type | Valid Range | Required | |——————|————————————————————————-—|——-—|—————————————————-—|——-—| | tile_index | The index of the tile in DST register buffer to perform the computation on | uint32_t | Must be less than the size of the DST register buffer | True | | fast_and_approx | Computation to be done faster and approximate | bool | | False |