Why _index_put_impl_ is deterministic when accumulate=true on GPU

_index_put_impl_

This will degrade the API’s performance