ttnn.kv_cache.update_cache_for_token_

ttnn.kv_cache.update_cache_for_token_(cache: ttnn.Tensor, token: ttnn.Tensor, update_index: int, batch_offset: int) → ttnn.Tensor

Updates the cache tensor in-place with values from input at update_index and batch_offset.

Parameters:

Returns:

ttnn.Tensor – the output tensor.