mlir-hlo

Commit Graph

Author	SHA1	Message	Date
Adrian Kuegel	62b357b601	Remove rank 1 specialization from TransformUnrankedHloPass. For binary ops, we already special-case rank 0 vs rank 1, and same shape. So we don't need to special-case a maximum rank of 1. PiperOrigin-RevId: 360881387	2021-03-04 04:04:11 -08:00
Geoffrey Martin-Noble	8687f3e4cf	Lower MHLO Dot to type-polymorphic linalg named ops The linalg named ops are now type polymorphic, so the type-monomorphic varieties are redundant (and will be deleted soon). PiperOrigin-RevId: 360509010	2021-03-02 14:00:58 -08:00
Adrian Kuegel	0683db3b24	Legalize MinimumBroadcastShapes op. Use it in TransformUnrankedHloPass, which allows to reduce the maximum rank for rank specialized broadcast from 6 to 5. PiperOrigin-RevId: 360415743	2021-03-02 06:39:01 -08:00
Jacques Pienaar	329b1fd071	Verify compatible shapes in unpack verification rather than exact Previously this would be too strict and fail if dynamic and static dims were compared. Dynamic/unknown are treated as "maybe equal" to a static value without further info, so at this layer don't flag as invalid unless truly are. PiperOrigin-RevId: 360189086	2021-03-01 08:00:16 -08:00
Benjamin Kramer	e19ccf975e	Filter static dimensions from dynamic_broadcast_in_dim's init_tensor Otherwise we'd generate invalid IR for those cases. PiperOrigin-RevId: 360144122	2021-03-01 03:03:54 -08:00
Adrian Kuegel	e6a1f5f0f9	Add MinimumBroadcastShapesOp to chlo dialect. This op is useful for rank specialization of broadcasts. Kernel Generator needs to generate one kernel for each rank, so if we can minimize the rank of the broadcast shape, we can support more cases with the same number of special-cased kernels. PiperOrigin-RevId: 360137827	2021-03-01 02:23:52 -08:00
Hanhan Wang	a8f99ee0f5	Fix the shape of linalg.init_tensor in conv op lowering. The output spatial dims are not as same as the input spatial dims. Only supports static output spatial dims for now. PiperOrigin-RevId: 359775479	2021-02-26 09:34:11 -08:00
Hanhan Wang	90f0d7f935	Add support for lowering mhlo.conv to Linalg on tensors. This pattern only works for normal convolutions. It does not work for depthwise convolutions. The Linalg conv ops are defined with static rank, so it only supports 1d/2d/3d cases, which are the most typical cases. This also refactors out the same check in lmhlo.conv lowering. PiperOrigin-RevId: 359503527	2021-02-25 05:59:08 -08:00
Hanhan Wang	45a1249fe2	Add support for lowering mhlo.pad to linalg.pad_tensor The change upstreams the pattern from IREE repo to MHLO repo. PiperOrigin-RevId: 359481543	2021-02-25 03:00:39 -08:00
Geoffrey Martin-Noble	89f7f2bd65	Lower integer matmuls to linalg PiperOrigin-RevId: 359306495	2021-02-24 09:45:07 -08:00
Hanhan Wang	475b4a06a5	Add support for lowering mhlo.slice to subtensor. PiperOrigin-RevId: 359297978	2021-02-24 09:06:09 -08:00
A. Unique TensorFlower	ac0552f127	[MLIR][HLO] Remove duplicate `PopulateTransformUnrankedHloPatterns` PiperOrigin-RevId: 359046173	2021-02-23 07:50:47 -08:00
Rahul Joshi	5adb7c6e12	[MLIR:LHLO] Add optional call target arg mapping to LMHLO CustomCall operations. - XLA:HLO -> LMHLO conversion drops all token arguments and return values, however custom calls that users write still expect to get buffer pointers for these token types. - To be able to support this, add an optional call target argument mapping attribute to LMHLO custom calls. When this attribute is present, it indicates the number of arguments and returns that the custom call expects and also indicates which LMHLO arg() or output() maps to which arg or result number of the custom call. PiperOrigin-RevId: 358826664	2021-02-22 08:43:00 -08:00
Benjamin Kramer	a9cc1dcfa0	[mlir][hlo] Add basic rank-specialization for select This just blows up everything to ranked (up to 6) and is probably quite slow. This is sufficient to make kernelgen compile SelectV2. PiperOrigin-RevId: 358777728	2021-02-22 02:41:12 -08:00
Benjamin Kramer	b42def4612	[mlir][hlo] Refactor rank specialization to allow an arbitrary number of inputs This actually simplifies the code a bit. PiperOrigin-RevId: 358201038	2021-02-18 09:53:03 -08:00
Benjamin Kramer	ca4034b56e	[mlir][hlo] Make select ready for dynamic shapes (ranked only for now) Move tf.SelectV2 broadcast lowering to a chlo.broadcast_select op, and lower it to broadcasts on mhlo from there. PiperOrigin-RevId: 358179975	2021-02-18 08:08:40 -08:00
Adrian Kuegel	37e31f8b26	Lower Expm1 kernel to math.ExpM1. PiperOrigin-RevId: 358152908	2021-02-18 04:54:23 -08:00
Richard Uhler	b579bd5d9e	Support dynamic-shaped operand in verification of BroadcastInDim. Verification of HLO_BroadcastInDimOp was previously failing or crashing if the operand had a dynamic shape or was unranked. Update the verification code to allow the operand to be unranked or have dynamic shape. PiperOrigin-RevId: 358056793	2021-02-17 16:18:09 -08:00
A. Unique TensorFlower	220deb3709	[MLIR][CHLO] Add legalization for `chlo.polygamma` to MHLO PiperOrigin-RevId: 357954624	2021-02-17 08:33:01 -08:00
A. Unique TensorFlower	81abaf364d	[MLIR][MHLO] Add polygamma op to the CHLO dialect PiperOrigin-RevId: 357724465	2021-02-16 08:32:33 -08:00
Adrian Kuegel	b594254c79	[mhlo] Lower int->bool to a comparison with zero This matches what TF (and C++) do in this case. PiperOrigin-RevId: 357566262	2021-02-15 06:38:09 -08:00
Benjamin Kramer	240a44de82	[mhlo] Lower int->int cast to sign extension instead of zero extension Signless does not mean unsigned here. Currently mhlo only has signed types. PiperOrigin-RevId: 357561712	2021-02-15 05:58:47 -08:00
Adrian Kuegel	8672735e9a	[mhlo] Lower float->bool to a comparison with zero This matches what TF (and C++) do in this case. PiperOrigin-RevId: 357553098	2021-02-15 04:36:36 -08:00
A. Unique TensorFlower	89d81adf6d	[mhlo] Lower float->bool to a comparison with zero This matches what TF (and C++) do in this case. PiperOrigin-RevId: 357541594	2021-02-15 03:11:56 -08:00
Benjamin Kramer	3e80d91e73	[mhlo] Lower float->bool to a comparison with zero This matches what TF (and C++) do in this case. PiperOrigin-RevId: 357534118	2021-02-15 02:17:19 -08:00
Adrian Kuegel	824bc9c425	Improve broadcast transformation to treat dynamic shapes with 1 element as scalar. A shape that contains exactly one element is effectively a scalar. This leads to a speedup in cases where we have a binary op with one operand that is effectively a scalar, because we can use the fast path. PiperOrigin-RevId: 357515552	2021-02-14 23:25:41 -08:00
A. Unique TensorFlower	4060a86fe2	Integrate LLVM at llvm/llvm-project@2bfe27da17 Updates LLVM usage to match [2bfe27da171e](https://github.com/llvm/llvm-project/commit/2bfe27da171e) PiperOrigin-RevId: 357196336	2021-02-12 08:32:03 -08:00
Tim Shen	6fa6974e8d	[XLA/GPU] Plumb through Bitcast op for LMHLO. Also remove BitcastOp. XLA bitcast requires the input buffer to alias the output buffer, which makes bitcast always a no-op. PiperOrigin-RevId: 356884383	2021-02-10 19:45:40 -08:00
Alexander Belyaev	36e04d92c0	[KERNEL_GEN] Add a pattern to bufferize `mhlo.reshape(<unranked_tensor>)`. PiperOrigin-RevId: 356720899	2021-02-10 06:32:21 -08:00
A. Unique TensorFlower	4a29ca3b1d	Add layout to mhlo::InfeedOp td. PiperOrigin-RevId: 356286875	2021-02-08 09:48:14 -08:00
Tres Popp	d086b8a0ec	Correct HLO atan2 lowering in cases of -inf and -0 inputs. This is being done by just removing the approximation and lowering to atan2 lib calls later to make the implementation the same as XLA. Note that if the approximation is brought back later, it can be fixed by changing the IR checking `less-than(X, 0)` to `less-than(copysign(X, 1), 0)` PiperOrigin-RevId: 356253941	2021-02-08 06:58:04 -08:00
A. Unique TensorFlower	2aa8a90c69	Integrate LLVM at llvm/llvm-project@a1a1d338e9 Updates LLVM usage to match [a1a1d338e99d](https://github.com/llvm/llvm-project/commit/a1a1d338e99d) PiperOrigin-RevId: 355927079	2021-02-05 14:20:29 -08:00
Rahul Joshi	b251712b1d	[XLA:GPU] Add conversion from HLO -> MLIR LMHLO for TriangularSolve - Also add layout attributes for inputs and output for error checking. PiperOrigin-RevId: 355863625	2021-02-05 09:18:02 -08:00
A. Unique TensorFlower	99bc05f2e4	Integrate LLVM at llvm/llvm-project@91e7a17133 Updates LLVM usage to match [91e7a1713332](https://github.com/llvm/llvm-project/commit/91e7a1713332) PiperOrigin-RevId: 355702100	2021-02-04 13:42:31 -08:00
Mahesh Ravishankar	44d0464d16	Use linalg.fill on tensors instead of tensor.generate in MHLO -> Linalg conversion. linalg.fill on tensors is a structured op that allows use tile + fuse to reduce the fill overhead. PiperOrigin-RevId: 355490400	2021-02-03 15:03:49 -08:00
Stephan Herhut	6cd1875ee4	Implement lowering of chlo::zeta to mhlo dialect. PiperOrigin-RevId: 355395581	2021-02-03 07:50:05 -08:00
A. Unique TensorFlower	04110a4b1c	Integrate LLVM at llvm/llvm-project@67dfe9c8d7 Updates LLVM usage to match [67dfe9c8d70c](https://github.com/llvm/llvm-project/commit/67dfe9c8d70c) PiperOrigin-RevId: 355235205	2021-02-02 13:09:20 -08:00
Tres Popp	ae722a883f	Improve performance of lowered chlo.pow with integers The new lowering takes 6 iterations of a loop always rather than iterating the exponent's number of times. PiperOrigin-RevId: 355131133	2021-02-02 03:28:38 -08:00
A. Unique TensorFlower	f40ccc5b4b	[MLIR][CHLO] Add `chlo.digamma` and lowering to MHLO PiperOrigin-RevId: 355122765	2021-02-02 02:10:17 -08:00
Adrian Kuegel	c2115f56c7	Integrate LLVM at llvm/llvm-project@8f7f2c4211 Updates LLVM usage to match [8f7f2c4211ca](https://github.com/llvm/llvm-project/commit/8f7f2c4211ca) PiperOrigin-RevId: 355120697	2021-02-02 01:54:32 -08:00
Adrian Kuegel	96f8771ed7	Add MLIR generated kernel for Angle kernel. This also requires a canonicalization pattern to remove a redundant dynamic reshape from rank 1 to rank 1. PiperOrigin-RevId: 355113135	2021-02-02 00:47:20 -08:00
Rahul Joshi	8e3890e8e8	[MLIR:HLO] Add AllGather and AllToAll operations to LMHLO dialect. - Use a common base class to for AllReduce, AllGather, and AllToAll in the ODS spec. - Add basic verification for replica groups attribute. PiperOrigin-RevId: 354969654	2021-02-01 10:23:46 -08:00
Stephan Herhut	e61ef86fdb	Add zeta and broadcasting_zeta to chlo dialect. PiperOrigin-RevId: 354500879	2021-01-29 03:22:52 -08:00
Hanhan Wang	30ce82790d	Upstream mhlo.reduce lowering to Linalg to MHLO repo. In IREE, we use indexed generic op to handle the initial value. However, we lower it to a generic op that carries an init_tensor here, and leave the handle of initialization problem to later passes. PiperOrigin-RevId: 354294807	2021-01-28 05:46:09 -08:00
Lei Zhang	39589add22	Use the correct shape when converting mhlo.reshape If mhlo.reshape is not purely collapsing some consecutive operand dimensions into result dimensions, we will generate two linalg reshape op for it: the first one collapses all operand dimensions into one dimension, and the second one expands it to all result dimensions. For this case, the number of collapsed/expanded dimensions should be coming strictly from the operand/result. It is different from the case where we can generate one linalg reshape. For that case, the reassociation map should have rank equal to the largest among operand/result shape. PiperOrigin-RevId: 354293826	2021-01-28 05:37:54 -08:00
A. Unique TensorFlower	e0a7be7fb1	[MLIR][CHLO] Add `chlo.lgamma` and lowering to `hlo` PiperOrigin-RevId: 354287316	2021-01-28 04:35:03 -08:00
A. Unique TensorFlower	d77c9ad6fa	[MLIR][CHLO] Add `is_inf`, `is_pos_inf`, and `is_neg_inf` to CHLO dialect Also add the respective lowerings to MHLO. PiperOrigin-RevId: 354101955	2021-01-27 09:00:56 -08:00
Rahul Joshi	44deae2aa1	[MLIR:HLO] Extend AllReduce to support multiple inputs and results (to model tuples). - Instead of SameTypeOperands, add custom verification to check if operands and results pairwise have the same type. PiperOrigin-RevId: 353986341	2021-01-26 17:25:22 -08:00
Benjamin Kramer	f6b24a6d54	[mlir][hlo] Make min/max always propagate NaNs This is the right behavior for TF and JAX and matches what TF does on GPU. It doesn't match TF on CPU, but that's really a TF bug. PiperOrigin-RevId: 353657779	2021-01-25 09:04:16 -08:00
A. Unique TensorFlower	b1438eebcb	[mlir][hlo] Make min/max always propagate NaNs This is the right behavior for TF and JAX and matches what TF does on GPU. It doesn't match TF on CPU, but that's really a TF bug. PiperOrigin-RevId: 353628258	2021-01-25 05:43:15 -08:00

1 2 3 4 5

223 Commits