mlir-hlo

Commit Graph

Author	SHA1	Message	Date
Adrian Kuegel	4c282fb542	Avoid Broadcast op if all shapes are (known to) be equal. The rank specialization case for shapes which are either of the same shape or a scalar doesn't need to compute the final result shape. PiperOrigin-RevId: 380129316	2021-06-18 00:22:35 -07:00
A. Unique TensorFlower	9f47ff607b	[MLIR][KernelGen] Better rank specialization for clusters containing `mhlo.select` Merge `mhlo.select` into rank specialization clusters. Infer shape equalities correctly from `mhlo.select` (and also from `mhlo.clamp`). This allows to lower the relu kernel completely flattened. PiperOrigin-RevId: 379925793	2021-06-17 04:06:08 -07:00
A. Unique TensorFlower	82696f8598	[MLIR][HLO] Annotate `mhlo.clamp` and `mhlo.select` as element-wise broadcasting The operations allow for a limited form of broadcasting which allows some operands to be scalars. As such they are neither strictly `Elementwise`, nor `Broadcasting`. They do fulfill the requirements for `BroadcastingElementwise` though. PiperOrigin-RevId: 379719961	2021-06-16 07:59:26 -07:00
A. Unique TensorFlower	bd5752f0bf	[MLIR][HLO] Find shape equivalences and use them for better rank specialization Find shape equivalence classes among the operands and use them for better rank specialization. If all operands are known to be of the same shape, we can flatten them to rank one. If there are two shape equivalence classes, we can generalize the scalar rank specialization cases. PiperOrigin-RevId: 378844575	2021-06-11 04:00:26 -07:00
A. Unique TensorFlower	9f67417b41	[MLIR][HLO] Avoid duplicate cluster operands when merging When merging rank specialization clusters, avoid duplicating operands. A fewer number of operands usually allows better rank specialization. PiperOrigin-RevId: 378445946	2021-06-09 10:54:55 -07:00
A. Unique TensorFlower	b580722041	[MLIR][KernelGen] Merge rank specialization clusters Merge adjacent rank specialization clusters. Combine their operands, bodies, and results. PiperOrigin-RevId: 378433222	2021-06-09 10:07:47 -07:00
A. Unique TensorFlower	75a1c450ea	[MLIR][KernelGen] Fix Windows build failure Fix usage of default constructor. Instead, always use the parameterized constructor and make the maximum supported rank explicit. PiperOrigin-RevId: 377037155	2021-06-02 05:34:44 -07:00
A. Unique TensorFlower	d1828625ab	[MLIR][KernelGen] Make maximum supported rank in rank specialization configurable The maximum supported target rank of 5 is sufficient for all operations but `select`. Make the maximum target rank configurable in the rank specialization. This reduces the number of generated kernels for operations that don't require it. PiperOrigin-RevId: 376822496	2021-06-01 06:54:31 -07:00
A. Unique TensorFlower	c7c245eaf1	[MLIR][KernelGen] Add MLIR-generated Xlogy kernel Add the first MLIR-generated kernel that relies on an in-TF lowering. Fusion for this kernel relies on the generalized rank specialization for operation groups. PiperOrigin-RevId: 376805435	2021-06-01 04:48:18 -07:00
A. Unique TensorFlower	4ebcebf31c	[MLIR][HLO] Exploit scalar properties in rank specialization lowering Take advantage of the fact that scalars are already ranked and that they are neutral elements to broadcasting. Do not reshape scalars, do not consider them for broadcasting, and materialize ranked operations on scalars accordingly. PiperOrigin-RevId: 375968371	2021-05-26 09:59:13 -07:00
A. Unique TensorFlower	cb46298a07	[MLIR][HLO] Support all smaller ranks in rank specialization cases Rank specialization cases can be applied to all argument tensors of smaller ranks than the expected maximum rank. This is crucial if all operands are effectively scalars and the maximum reduced rank is 0. PiperOrigin-RevId: 375712020	2021-05-25 08:38:53 -07:00
A. Unique TensorFlower	97e6103933	[MLIR][HLO] Reshape to scalars in rank specialization Scalars were incorrectly casted to scalar tensors when they have to be reshaped. PiperOrigin-RevId: 375049088	2021-05-21 03:12:16 -07:00
A. Unique TensorFlower	3daf65578a	[MLIR][HLO] Add scalar cases for binary rank specialization For rank specialization clusters that have only two operands, we can materialize two extra cases in which either of them is a scalar. This avoids redundant index computations in these cases. PiperOrigin-RevId: 375037390	2021-05-21 01:35:44 -07:00
A. Unique TensorFlower	c62fd89663	[MLIR][HLO] Add equal shapes case to rank specialization Also restructure lowering implementation to facilitate the addition or removal of special cases. PiperOrigin-RevId: 374626365	2021-05-19 05:38:42 -07:00
A. Unique TensorFlower	6af3d2df91	[MLIR][HLO] Add rank specialization with multiple non-scalar operands Add lowering pattern for rank specialization clusters with more than one non-scalar operand. The lowering resembles that of the `TransformUnrankedHlo` pass and switches cases for maximal ranks from 1 through 8. PiperOrigin-RevId: 374377002	2021-05-18 03:02:45 -07:00
A. Unique TensorFlower	474e419729	[MLIR][HLO] Generalize rank specialization with single operand The pattern can be generalized to also rank specialize operations with a single non-scalar operand. Also extract helper functions that can be reused in following specializations. PiperOrigin-RevId: 374198381	2021-05-17 08:12:55 -07:00
A. Unique TensorFlower	c514c73390	[MLIR][HLO] Extend rank specialization clustering pass Also cluster operations that operate on same shape operands. These implicitly satisfy the broadcasting semantics requirement. Also, add test cases for some cases that appear in the current MLIR-generated kernels. PiperOrigin-RevId: 374191950	2021-05-17 07:31:36 -07:00
A. Unique TensorFlower	ccd70d5717	[MLIR][HLO] Add `rank-specialization-to-scf` pass Currently the lowering is only implemented for the unary case. The n-ary case will follow. PiperOrigin-RevId: 374162772	2021-05-17 03:56:23 -07:00
A. Unique TensorFlower	313d24bc8f	[MLIR][HLO] Add `rank-specialization-cluster` pass Add a pass to cluster unranked C/HLO operations in one `chlo.rank_specialization_cluster` op. The C/HLO operations are moved to the body of the operation. Later passes can use this to rank-specialize all these operations together. PiperOrigin-RevId: 373336725	2021-05-12 03:46:01 -07:00

19 Commits