Commit 4f7b77e
committed
Remove unnecessary instructions from Advanced SIMD dot
The existing kernel was issuing extra instructions to organise the arguments into the same registers they would usually be in and similarly to put the result into the appropriate register.
This has an impact on smaller sized dots and seemed like a quick fix1 parent e9a911f commit 4f7b77e
1 file changed
Lines changed: 118 additions & 129 deletions
0 commit comments