This uses a indexed linalg.generic, which is rather awkward standalone but allows fusing into the output of the concatenate and avoid to ever materialize it in memory. I think this is the only way to get that with the current linalg stack, fusion across a concatenate would require more infrastructure. PiperOrigin-RevId: 369677652 |
||
|---|---|---|
| .. | ||
| Dialect | ||
| utils | ||
| CMakeLists.txt | ||