mlir-hlo

Commit Graph

Author	SHA1	Message	Date
Tim Shen	d16860d26d	[MLIR] Change LMHLO Conditional and While to capture needed buffers, instead of passing them by operands. This is consistent with the design of LMHLO FusionOp, and it simplifies the usage. Before the change, those redundant operands ended up unused as all sub-regions can already capture needed buffers. PiperOrigin-RevId: 362381155	2021-03-11 14:42:41 -08:00
Rahul Joshi	9902e6ee32	[HLO] Add LMHLO CollectivePermute verification. - Extract verification of source target pairs attached to collective permute into a common helper function and use that to verify both MHLO and LMHLO variants. - Change MlirGpuTestBase::ParseMlirModule to allow returning back a failure, and use that to update the mlir_gpu_compile_test to check the new behavior. PiperOrigin-RevId: 362156962	2021-03-10 15:37:12 -08:00
A. Unique TensorFlower	c217a6ef61	[MHLO] Add pass to move up dynamic broadcasts for fusion For now, the pass only reifies the required shape computations. Moving broadcasts will follow to allow for fusion across them. PiperOrigin-RevId: 362033715	2021-03-10 06:21:57 -08:00
Benjamin Kramer	e5a6706260	Integrate LLVM at llvm/llvm-project@c907681b07 Updates LLVM usage to match [c907681b077c](https://github.com/llvm/llvm-project/commit/c907681b077c) PiperOrigin-RevId: 360891677	2021-03-04 05:22:16 -08:00
Jacques Pienaar	329b1fd071	Verify compatible shapes in unpack verification rather than exact Previously this would be too strict and fail if dynamic and static dims were compared. Dynamic/unknown are treated as "maybe equal" to a static value without further info, so at this layer don't flag as invalid unless truly are. PiperOrigin-RevId: 360189086	2021-03-01 08:00:16 -08:00
Adrian Kuegel	e6a1f5f0f9	Add MinimumBroadcastShapesOp to chlo dialect. This op is useful for rank specialization of broadcasts. Kernel Generator needs to generate one kernel for each rank, so if we can minimize the rank of the broadcast shape, we can support more cases with the same number of special-cased kernels. PiperOrigin-RevId: 360137827	2021-03-01 02:23:52 -08:00
A. Unique TensorFlower	ac0552f127	[MLIR][HLO] Remove duplicate `PopulateTransformUnrankedHloPatterns` PiperOrigin-RevId: 359046173	2021-02-23 07:50:47 -08:00
Rahul Joshi	0da7ea2545	[MLIR][HLO] Cleanup CMakeLists and comments. - Cleanup CMakeLists file to remove unused argument and use a new function for setting up lmhlo and lmhlo_gpu dialect targets. - Fix inconsistently formatted copyright comment and fix header include guards. PiperOrigin-RevId: 358865838	2021-02-22 11:36:29 -08:00
Rahul Joshi	5adb7c6e12	[MLIR:LHLO] Add optional call target arg mapping to LMHLO CustomCall operations. - XLA:HLO -> LMHLO conversion drops all token arguments and return values, however custom calls that users write still expect to get buffer pointers for these token types. - To be able to support this, add an optional call target argument mapping attribute to LMHLO custom calls. When this attribute is present, it indicates the number of arguments and returns that the custom call expects and also indicates which LMHLO arg() or output() maps to which arg or result number of the custom call. PiperOrigin-RevId: 358826664	2021-02-22 08:43:00 -08:00
Prakalp Srivastava	909574e393	Pass result element type to XlaBuilder for `mhlo.dot_general` and `mhlo.convolution` ops. `mhlo.dot_general` and `mhlo.convolution` result element type might be different from operand element type. See `preferred_element_type` attribute that allows i8xi8 to i32 dot computation. `mhlo` to HLO exporter should pass the result element type to Xla builder to override the shape inference of XLA. PiperOrigin-RevId: 358580718	2021-02-20 07:07:17 -08:00
Benjamin Kramer	ca4034b56e	[mlir][hlo] Make select ready for dynamic shapes (ranked only for now) Move tf.SelectV2 broadcast lowering to a chlo.broadcast_select op, and lower it to broadcasts on mhlo from there. PiperOrigin-RevId: 358179975	2021-02-18 08:08:40 -08:00
Adrian Kuegel	37e31f8b26	Lower Expm1 kernel to math.ExpM1. PiperOrigin-RevId: 358152908	2021-02-18 04:54:23 -08:00
A. Unique TensorFlower	220deb3709	[MLIR][CHLO] Add legalization for `chlo.polygamma` to MHLO PiperOrigin-RevId: 357954624	2021-02-17 08:33:01 -08:00
A. Unique TensorFlower	81abaf364d	[MLIR][MHLO] Add polygamma op to the CHLO dialect PiperOrigin-RevId: 357724465	2021-02-16 08:32:33 -08:00
Adrian Kuegel	b594254c79	[mhlo] Lower int->bool to a comparison with zero This matches what TF (and C++) do in this case. PiperOrigin-RevId: 357566262	2021-02-15 06:38:09 -08:00
Benjamin Kramer	240a44de82	[mhlo] Lower int->int cast to sign extension instead of zero extension Signless does not mean unsigned here. Currently mhlo only has signed types. PiperOrigin-RevId: 357561712	2021-02-15 05:58:47 -08:00
Adrian Kuegel	8672735e9a	[mhlo] Lower float->bool to a comparison with zero This matches what TF (and C++) do in this case. PiperOrigin-RevId: 357553098	2021-02-15 04:36:36 -08:00
A. Unique TensorFlower	89d81adf6d	[mhlo] Lower float->bool to a comparison with zero This matches what TF (and C++) do in this case. PiperOrigin-RevId: 357541594	2021-02-15 03:11:56 -08:00
Benjamin Kramer	3e80d91e73	[mhlo] Lower float->bool to a comparison with zero This matches what TF (and C++) do in this case. PiperOrigin-RevId: 357534118	2021-02-15 02:17:19 -08:00
A. Unique TensorFlower	4060a86fe2	Integrate LLVM at llvm/llvm-project@2bfe27da17 Updates LLVM usage to match [2bfe27da171e](https://github.com/llvm/llvm-project/commit/2bfe27da171e) PiperOrigin-RevId: 357196336	2021-02-12 08:32:03 -08:00
Tim Shen	6fa6974e8d	[XLA/GPU] Plumb through Bitcast op for LMHLO. Also remove BitcastOp. XLA bitcast requires the input buffer to alias the output buffer, which makes bitcast always a no-op. PiperOrigin-RevId: 356884383	2021-02-10 19:45:40 -08:00
A. Unique TensorFlower	4a29ca3b1d	Add layout to mhlo::InfeedOp td. PiperOrigin-RevId: 356286875	2021-02-08 09:48:14 -08:00
Rahul Joshi	b251712b1d	[XLA:GPU] Add conversion from HLO -> MLIR LMHLO for TriangularSolve - Also add layout attributes for inputs and output for error checking. PiperOrigin-RevId: 355863625	2021-02-05 09:18:02 -08:00
Stephan Herhut	6cd1875ee4	Implement lowering of chlo::zeta to mhlo dialect. PiperOrigin-RevId: 355395581	2021-02-03 07:50:05 -08:00
Tres Popp	ae722a883f	Improve performance of lowered chlo.pow with integers The new lowering takes 6 iterations of a loop always rather than iterating the exponent's number of times. PiperOrigin-RevId: 355131133	2021-02-02 03:28:38 -08:00
A. Unique TensorFlower	f40ccc5b4b	[MLIR][CHLO] Add `chlo.digamma` and lowering to MHLO PiperOrigin-RevId: 355122765	2021-02-02 02:10:17 -08:00
Rahul Joshi	8e3890e8e8	[MLIR:HLO] Add AllGather and AllToAll operations to LMHLO dialect. - Use a common base class to for AllReduce, AllGather, and AllToAll in the ODS spec. - Add basic verification for replica groups attribute. PiperOrigin-RevId: 354969654	2021-02-01 10:23:46 -08:00
Rahul Joshi	1be1123c70	[XLA:GPU] Add support for PartitionId PiperOrigin-RevId: 354599221	2021-01-29 13:31:54 -08:00
Stephan Herhut	e61ef86fdb	Add zeta and broadcasting_zeta to chlo dialect. PiperOrigin-RevId: 354500879	2021-01-29 03:22:52 -08:00
Rahul Joshi	c97cff9a7b	[XLA:GPU] Migrate AllReduce thunk emission to MLIR. PiperOrigin-RevId: 354335704	2021-01-28 10:06:06 -08:00
A. Unique TensorFlower	e0a7be7fb1	[MLIR][CHLO] Add `chlo.lgamma` and lowering to `hlo` PiperOrigin-RevId: 354287316	2021-01-28 04:35:03 -08:00
A. Unique TensorFlower	c3ddcd6c7f	[MLIR][CHLO] Implement type inference for `is_inf`-like operations in CHLO PiperOrigin-RevId: 354265834	2021-01-28 01:37:04 -08:00
A. Unique TensorFlower	fe2e5a175f	[MLIR][HLO] Implement type inference for `is_finite` op PiperOrigin-RevId: 354261420	2021-01-28 00:56:12 -08:00
A. Unique TensorFlower	d77c9ad6fa	[MLIR][CHLO] Add `is_inf`, `is_pos_inf`, and `is_neg_inf` to CHLO dialect Also add the respective lowerings to MHLO. PiperOrigin-RevId: 354101955	2021-01-27 09:00:56 -08:00
Adrian Kuegel	f4f728f18e	Add SameOperandsAndResultShape trait to ConvertOp and IsFiniteOp. PiperOrigin-RevId: 354070005	2021-01-27 05:10:01 -08:00
Rahul Joshi	44deae2aa1	[MLIR:HLO] Extend AllReduce to support multiple inputs and results (to model tuples). - Instead of SameTypeOperands, add custom verification to check if operands and results pairwise have the same type. PiperOrigin-RevId: 353986341	2021-01-26 17:25:22 -08:00
Benjamin Kramer	f6b24a6d54	[mlir][hlo] Make min/max always propagate NaNs This is the right behavior for TF and JAX and matches what TF does on GPU. It doesn't match TF on CPU, but that's really a TF bug. PiperOrigin-RevId: 353657779	2021-01-25 09:04:16 -08:00
A. Unique TensorFlower	b1438eebcb	[mlir][hlo] Make min/max always propagate NaNs This is the right behavior for TF and JAX and matches what TF does on GPU. It doesn't match TF on CPU, but that's really a TF bug. PiperOrigin-RevId: 353628258	2021-01-25 05:43:15 -08:00
Benjamin Kramer	6af4bccfde	[mlir][hlo] Make min/max always propagate NaNs This is the right behavior for TF and JAX and matches what TF does on GPU. It doesn't match TF on CPU, but that's really a TF bug. PiperOrigin-RevId: 353624935	2021-01-25 05:15:24 -08:00
A. Unique TensorFlower	ef8ccdaebc	[MLIR] Add mhlo.logistic lowering to linalg PiperOrigin-RevId: 353205440	2021-01-22 03:03:16 -08:00
A. Unique TensorFlower	56758a9562	[MLIR][KernelGen] Lower mhlo.log_plus_one to std.log1p PiperOrigin-RevId: 353200069	2021-01-22 02:18:32 -08:00
Tim Shen	d1c785381d	[XLA/GPU] Add XLA HLO -> LMHLO conversion to several ops, and implement them in XLA/GPU. PiperOrigin-RevId: 353158172	2021-01-21 19:57:54 -08:00
Alexander Belyaev	7aa64ee0b7	[MLIR] Migrate TF from STD complex ops to ComplexDialect. PiperOrigin-RevId: 352966408	2021-01-21 01:22:25 -08:00
Hanhan Wang	46112c95c6	Use `uitofp` when converting a boolean to floating-point. It was lowered to `sitofp` which resulted in `-1.0`. PiperOrigin-RevId: 352958489	2021-01-21 00:15:30 -08:00
Stephan Herhut	70a351f301	Add chlo.acosh operation and associated lowerings. PiperOrigin-RevId: 352839289	2021-01-20 11:43:44 -08:00
A. Unique TensorFlower	ec5f5667e1	[MLIR][KernelGen] Add `tf.Asinh` kernels and complete their lowerings PiperOrigin-RevId: 352773540	2021-01-20 05:31:15 -08:00
A. Unique TensorFlower	0e85b4d511	[MLIR][KernelGen] Add `tf.Asinh` kernels and complete their lowerings PiperOrigin-RevId: 352604725	2021-01-19 10:51:41 -08:00
Alexander Belyaev	9e07bdf4ea	[HLO] Move `SameOperandsAndResultShape` trait to Unary/BinaryElemenwiseOp classes. PiperOrigin-RevId: 352404756	2021-01-18 06:47:36 -08:00
A. Unique TensorFlower	c11ea4ef5a	[MLIR][KernelGen] Add `tf.Atanh` kernels PiperOrigin-RevId: 352393602	2021-01-18 05:14:09 -08:00
Tres Popp	ba2ee556f1	Handle negative exponents for lowering of hlo.pow PiperOrigin-RevId: 352382812	2021-01-18 03:47:28 -08:00
A. Unique TensorFlower	b5dc600860	[MLIR] Fix line wraps in hlo_ops.td PiperOrigin-RevId: 352364076	2021-01-18 01:41:41 -08:00
A. Unique TensorFlower	bcdb3c3548	[MLIR] Lower mhlo.clamp to linalg PiperOrigin-RevId: 351998800	2021-01-15 06:45:38 -08:00
A. Unique TensorFlower	791d5afd28	[MLIR][KernelGen] Add `tf.Asinh` kernels and complete their lowerings PiperOrigin-RevId: 351989552	2021-01-15 05:26:57 -08:00
A. Unique TensorFlower	316f630728	[MLIR][KernelGen] Add cosh kernels and tests Allow for relative tolerance in unary kernel tests. In case of the cosh kernels, this allows to accept an observed difference of 5.6e-8 between the kernel and the `std::cosh` reference (32829984.568665262 vs. 32829984.568665318) in one of the test cases. PiperOrigin-RevId: 351983698	2021-01-15 04:31:30 -08:00
A. Unique TensorFlower	181d2cad31	[MLIR][KernelGen] Add `tf.Log1p` kernel and tests PiperOrigin-RevId: 351566460	2021-01-13 05:37:25 -08:00
Rahul Joshi	0651d58edf	[MLIR:HLO] Change LHLO Outfeed to support multiple inputs. PiperOrigin-RevId: 351447034	2021-01-12 14:13:47 -08:00
A. Unique TensorFlower	0b85d5c510	[MLIR][KernelGen] Add asin kernels and tests PiperOrigin-RevId: 351381423	2021-01-12 09:02:46 -08:00
Alexander Belyaev	ac38478350	Integrate LLVM at llvm/llvm-project@bcbdeafa9c Updates LLVM usage to match [bcbdeafa9cb3](https://github.com/llvm/llvm-project/commit/bcbdeafa9cb3) PiperOrigin-RevId: 350763293	2021-01-08 07:57:20 -08:00
Alexander Belyaev	6c42f54298	[KERNEL_GEN] Restrict broadcast -> reshape canonicalization to identity dims. This is needed to avoid the case, when the broadcast_in_dims also performs permutation. PiperOrigin-RevId: 350650342	2021-01-07 15:30:28 -08:00
Rahul Joshi	e3754d7b5c	[MLIR:HLO] Change LHLO Infeed to support multiple outputs. PiperOrigin-RevId: 348851612	2020-12-23 14:57:08 -08:00
Rahul Joshi	bd8e768e23	[MLIR:HLO] Rename LMHLO Infeed and Outfeed to InfeedOp and OutfeedOp. - To be consistent with other operation names. PiperOrigin-RevId: 348824379	2020-12-23 11:38:05 -08:00
Rahul Joshi	8252eafa99	[NFC] Factor out repeated code out of InferFusibilityOpInterface. PiperOrigin-RevId: 348671671	2020-12-22 12:04:29 -08:00
Rahul Joshi	bc367971ec	[MLIR:LHLO_GPU] Add additional constraints for batchnorm - Constrain batchnorm inputs and outputs to be fp memrefs. PiperOrigin-RevId: 348665747	2020-12-22 11:30:40 -08:00
Tres Popp	a42213b870	Define lowering of [l]mhlo.pow. For floating point operations, this uses std.pow. For integer operations, this lowers to a loop. This adds a dependency on scf. PiperOrigin-RevId: 348537232	2020-12-21 15:27:40 -08:00
Smit Hinsu	8d051723c0	Use InferTypeOpInterface for HLO AbsOp and fix result shape inference Shape inference in case of ops with complex element types need to use the element type of complex as the result element type and not the full operand type. Before: "mhlo.abs"(%arg0) : (tensor<4xcomplex<f32>>) -> tensor<4xtensor<4xcomplex<f32>>> After: "mhlo.abs"(%arg0) : (tensor<4xcomplex<f32>>) -> tensor<4xf32> PiperOrigin-RevId: 348123967	2020-12-17 17:37:07 -08:00
Rahul Joshi	8134bff98d	[XLA:GPU] Add layout attributes to LHLO_GPU Convolution operations. - MLIR MemRefs do not preserve layout information correctly when unit dimensions are involved. Operations like convolution that use cuDNN however need the correct layout to be preserved so that we do not end up creating an incompatible combination of input/filter/output layout that is not supported by cuDNN. - Add these layouts to convolution attributes in the form of I32ArrayAttr for representing the layout in "minor_to_major" form similar to XLA. PiperOrigin-RevId: 348034757	2020-12-17 09:26:28 -08:00
Adrian Kuegel	1f244c3e2c	Fix SignOp lowering for floating point values. It didn't return 0 for 0.0 and -0.0. Currently we emit -0.0 for -0.0 which is correct according to the HLO dialect. For the TF_SignOp we should emit 0.0 in that case, we will leave that as a TODO. Enable the tests which work now, and add another one for Int64. Also improve the registration code, we should not register the Int32 kernel. PiperOrigin-RevId: 347981124	2020-12-17 01:45:54 -08:00
Alexander Belyaev	65222893ae	[KERNEL_GEN] Convert LHLO AddOp, SubOp (ComplexType) to complex ops. PiperOrigin-RevId: 347805898	2020-12-16 05:45:06 -08:00
Alexander Belyaev	e6e8920921	[KERNEL_GEN] Switch the pipeline to Linalg-on-Tensors. PiperOrigin-RevId: 347781190	2020-12-16 01:51:15 -08:00
River Riddle	5ab94a00a7	[mlir][NFC] Replace usages or mlir/IR/StandardTypes.h with mlir/IR/BuiltinTypes.h StandardTypes.h was moved to BuiltinTypes.h and is being removed. PiperOrigin-RevId: 347751511	2020-12-15 21:28:18 -08:00
A. Unique TensorFlower	f0c2695d31	Fix SignOp lowering for floating point values. It didn't return 0 for 0.0 and -0.0. Currently we emit -0.0 for -0.0 which is correct according to the HLO dialect. For the TF_SignOp we should emit 0.0 in that case, we will leave that as a TODO. Enable the tests which work now, and add another one for Int64. Also improve the registration code, we should not register the Int32 kernel. PiperOrigin-RevId: 347602378	2020-12-15 06:49:48 -08:00
Alexander Belyaev	ddda2699fb	[KERNEL_GEN] Switch the pipeline to Linalg-on-Tensors. PiperOrigin-RevId: 347600145	2020-12-15 06:32:25 -08:00
Adrian Kuegel	79fa36bcbc	Fix SignOp lowering for floating point values. It didn't return 0 for 0.0 and -0.0. Currently we emit -0.0 for -0.0 which is correct according to the HLO dialect. For the TF_SignOp we should emit 0.0 in that case, we will leave that as a TODO. Enable the tests which work now, and add another one for Int64. Also improve the registration code, we should not register the Int32 kernel. PiperOrigin-RevId: 347590340	2020-12-15 05:12:48 -08:00
Alexander Belyaev	8b35a75d4a	[KERNEL_GEN] Switch the pipeline to Linalg-on-Tensors. PiperOrigin-RevId: 347368063	2020-12-14 05:46:47 -08:00
River Riddle	6b439f7eee	[mlir][NFC] Replace usages or mlir/IR/StandardTypes.h with mlir/IR/BuiltinTypes.h StandardTypes.h was moved to BuiltinTypes.h and is being removed. PiperOrigin-RevId: 347115952	2020-12-11 19:01:25 -08:00
Tim Shen	22b91d62ff	[XLA/GPU] Migrate all unnested elementwise emitters. Also fix GetHloOutputs/GetHloOperands to treat aliased operands correctly. PiperOrigin-RevId: 347055290	2020-12-11 12:45:21 -08:00
Benjamin Kramer	9930c20c31	[mlir][hlo] Fix lowering of NE comparison. It should return true if either side is NaN PiperOrigin-RevId: 346988987	2020-12-11 06:46:14 -08:00
Rahul Joshi	f232da1f9d	[MLIR:HLO] Add window_reversal attribute to convolution attributes. - Add this attribute to match the corresponding XLA HLO attribute on convolution operations. - A true value indicates a reversal of the corresponding kernel spatial dimension. - Since XLA builder does not support this attribute, use a custom HLO converted to map from mlir::mhlo::ConvOp to XLA. PiperOrigin-RevId: 346891737	2020-12-10 16:39:19 -08:00
A. Unique TensorFlower	6a05893169	[XLA/GPU] Migrate all unnested elementwise emitters. PiperOrigin-RevId: 346725498	2020-12-10 00:21:36 -08:00
Tim Shen	cfcf741932	[XLA/GPU] Migrate all unnested elementwise emitters. PiperOrigin-RevId: 346716519	2020-12-09 23:08:13 -08:00
A. Unique TensorFlower	65ebd85563	[XLA/GPU] Migrate all unnested elementwise emitters. PiperOrigin-RevId: 346658288	2020-12-09 16:05:43 -08:00
Tim Shen	1c10e1fec6	[XLA/GPU] Migrate all unnested elementwise emitters. PiperOrigin-RevId: 346624905	2020-12-09 13:31:50 -08:00
A. Unique TensorFlower	1a5a1b5f41	[XLA/GPU] Migrate all unnested elementwise emitters. PiperOrigin-RevId: 346574093	2020-12-09 09:54:16 -08:00
Tim Shen	3c33fe4b9e	[XLA/GPU] Migrate all unnested elementwise emitters. PiperOrigin-RevId: 346559170	2020-12-09 08:42:51 -08:00
Stephan Herhut	c3790af758	Add plumbing for or and xor to hlo to lhlo and linalg lowerings. PiperOrigin-RevId: 346311314	2020-12-08 06:39:02 -08:00
Benjamin Kramer	5235eceea0	Lower mhlo shifts to linalg PiperOrigin-RevId: 346161253	2020-12-07 13:02:32 -08:00
Smit Hinsu	9bd1995f90	Legalize XlaReplicaId to HLO replica-id op Also, define shape inference function for HLO replica-id op. PiperOrigin-RevId: 345714342	2020-12-04 11:04:40 -08:00
Rahul Joshi	e48881af81	[MLIR:LHLO_GPU] Add fused convolution operation without any side inputs. - Add a variant of the fused convolution that does not need a side input and side input scale. - Rename the existing one to `ConvForwardFusedSideInputOp`. - Update tests to exercise all variants of the convolution ops in the GPU dialect. - Eliminate unused `LHLO_ExtentBuffer` and changed LHLO_Buffer to allow any integer element type to match what XLA can generate sometimes for scratch buffers. PiperOrigin-RevId: 345701569	2020-12-04 10:09:27 -08:00
A. Unique TensorFlower	3691e39f62	Integrate LLVM at llvm/llvm-project@f5d52916ce Updates LLVM usage to match [f5d52916ce34](https://github.com/llvm/llvm-project/commit/f5d52916ce34) PiperOrigin-RevId: 345672828	2020-12-04 07:24:17 -08:00
Tim Shen	73cb134cca	[MLIR] Add XLA HLO -> LMHLO conversion for all elementwise ops. PiperOrigin-RevId: 345557248	2020-12-03 16:12:02 -08:00
Rahul Joshi	dbbdfea95b	[MLIR:HLO] Generate enum decls for HLO and LHLO GPU dialects. - Split out enum definitions in hlo dialect into a separate .td file (similar to structs) and generate enum decl/defs for these enums. - Also split out the LHLO GPU enums into a separate .td file and generate enum decl/defs for these enums as well. - Remove unused dialect from ConvolutionAttributes and generate lhlo_gpu enums. - Add appropriate namespace for all the enums. PiperOrigin-RevId: 345277240	2020-12-02 11:39:23 -08:00
Rahul Joshi	d7bd5233ab	[XLA:GPU] Migrate GEMM Thunk emission to MLIR. - Map Custom call for GEMM in XLA HLO to Gemm/Gemm bias operations in LHLO GPU dialect. - Make 'algorithm' an optional attribute to better match with XLA HLO backend config. - Replace 'alpha' with 'alpha_real' and 'alpha_complex' to support complex GEMM correctly. - Generate GemmThunk off of LHLO GPU Gemm operations. PiperOrigin-RevId: 345250840	2020-12-02 09:43:12 -08:00
Rahul Joshi	5350b8080d	[NFC] Add default value for attributes in lmhlo::FusionOp custom builder. PiperOrigin-RevId: 344856882	2020-11-30 11:47:39 -08:00
Alexander Belyaev	5583c63cab	[KERNEL_GEN] Add unranked Conj kernel. PiperOrigin-RevId: 344243271	2020-11-25 06:37:26 -08:00
Tim Shen	3e01448481	[MLIR] Move some walk()ing functions to the lmhlo::FusionOp API. PiperOrigin-RevId: 344109366	2020-11-24 12:23:57 -08:00
Lucy Fox	85f92a1651	[KernelGen] Lower tf.Erf and tf.Erfc ops to CHLO. This does not include the lowerings from CHLO to LMHLO. PiperOrigin-RevId: 344091604	2020-11-24 10:55:43 -08:00
Rahul Joshi	ac54c5ccfa	[XLA:GPU] Convert Cholesky custom call in XLA HLO to LHLO GPU Dialect. - Restructured LHLO GPU Cholesky to better match XLA HLO by eliminating the untyped buffer and changing is_upper attribute to is_lower. - Change LhloDialectEmitter to emit LHLO GPU Cholesky operation. PiperOrigin-RevId: 343873516	2020-11-23 10:06:21 -08:00
Tim Shen	5a6edaa588	Roll-forward with fix: [XLA/GPU] Migrate nested reduce emitter to take LMHLO. PiperOrigin-RevId: 343582798	2020-11-20 16:06:58 -08:00
A. Unique TensorFlower	bae7c7bd2b	[XLA/GPU] Migrate nested reduce emitter to take LMHLO. PiperOrigin-RevId: 343388053	2020-11-19 16:00:58 -08:00
Tim Shen	303181924c	[XLA/GPU] Migrate nested reduce emitter to take LMHLO. PiperOrigin-RevId: 343373088	2020-11-19 14:41:55 -08:00

1 2 3 4 5 ...

259 Commits