PR #50191: [MLIR][DISC] Add RAL (Runtime abstraction layer) Dialect

Imported from GitHub PR https://github.com/tensorflow/tensorflow/pull/50191

DISC is a e2e flow, including both compiler side and runtime side. For
runtime side, we have different targeting environments (e.g. tensorflow,
pytorch, or sometimes even a standalone binary). In order to simplify
the design of the compiler side, we design a Runtime Abstraction Layer
(RAL) to sperate the compiler side and runtime side. Thus the compiler
side only need to target RAL itself and it is the responsibility of RAL
to handle the differences between different targeting environments.

One of the most important functions of RAL is to manage stateful
resources. To this end, it provides a context object, and hides all
stateful operations behind this context, thus the compiler side itself
doesn't need to care about the resource initialization. For example, a
kernel must be loaded before it can be launched on GPU. However, the
loading operation should only be taken once during the whole lifetime of
the context in order to achieve the best performance. Based on the
initialization-free interfaces provided by RAL, compiler side can focus
on its core optimization logic and lets the RAL to manage the resource
status.

The context mentioned above is passed as a parameter to the entry
function and all RAL APIs should always use the context as their first
argument. This CR also provides a pass to help to ensure this property.
The pass rewrites the entry function to make sure their first argument
is the context. For entry function, the pass also rewrites its inputs
and outputs. To be concrete, all the original inputs and outputs of the
entry function are received from and sent to RAL through a sequence of
RAL API calls correspondingly. The motivation behind this is to hide the
implementation details of I/Os. This design may also potentially enable
partial execution of the compiled module when some of the inputs are
ready.
Copybara import of the project:

--
c4f20a89aed71181e75bcc5265723b88bde23240 by Wenyi Zhao <reyizero@gmail.com>:

[MLIR][DISC] Add RAL (Runtime abstraction layer) Dialect

DISC is a e2e flow, including both compiler side and runtime side. For
runtime side, we have different targeting environments (e.g. tensorflow,
pytorch, or sometimes even a standalone binary). In order to simplify
the design of the compiler side, we design a Runtime Abstraction Layer
(RAL) to sperate the compiler side and runtime side. Thus the compiler
side only need to target RAL itself and it is the responsibility of RAL
to handle the differences between different targeting environments.

One of the most important functions of RAL is to manage stateful
resources. To this end, it provides a context object, and hides all
stateful operations behind this context, thus the compiler side itself
doesn't need to care about the resource initialization. For example, a
kernel must be loaded before it can be launched on GPU. However, the
loading operation should only be taken once during the whole lifetime of
the context in order to achieve the best performance. Based on the
initialization-free interfaces provided by RAL, compiler side can focus
on its core optimization logic and lets the RAL to manage the resource
status.

The context mentioned above is passed as a parameter to the entry
function and all RAL APIs should always use the context as their first
argument. This CR also provides a pass to help to ensure this property.
The pass rewrites the entry function to make sure their first argument
is the context. For entry function, the pass also rewrites its inputs
and outputs. To be concrete, all the original inputs and outputs of the
entry function are received from and sent to RAL through a sequence of
RAL API calls correspondingly. The motivation behind this is to hide the
implementation details of I/Os. This design may also potentially enable
partial execution of the compiled module when some of the inputs are
ready.

--
1991d4f80ab6087943956e1c0fec4940a22ab08d by Wenyi Zhao <reyizero@gmail.com>:

fix

PiperOrigin-RevId: 379317586
This commit is contained in:
Wenyi Zhao 2021-06-14 11:26:41 -07:00 committed by TensorFlow MLIR Team
parent a6011d0279
commit 23ebbb28d1
15 changed files with 579 additions and 0 deletions

100
BUILD
View File

@ -611,11 +611,107 @@ cc_library(
alwayslink = 1, alwayslink = 1,
) )
gentbl_cc_library(
name = "DiscRalPassIncGen",
strip_include_prefix = "include",
tbl_outs = [
(
[
"-gen-pass-decls",
"-name=RAL",
],
"include/mlir-hlo/Dialect/mhlo/transforms/disc_ral_passes.h.inc",
),
],
tblgen = "@llvm-project//mlir:mlir-tblgen",
td_file = "include/mlir-hlo/Dialect/mhlo/transforms/disc_ral_passes.td",
td_includes = [
"external/mlir-hlo/include",
"include",
],
deps = [
"@llvm-project//mlir:PassBaseTdFiles",
],
)
gentbl_cc_library(
name = "disc_ral_ops_inc_gen",
strip_include_prefix = "include",
tbl_outs = [
(
["-gen-op-decls"],
"include/mlir-hlo/Dialect/mhlo/IR/disc_ral_ops.h.inc",
),
(
["-gen-op-defs"],
"include/mlir-hlo/Dialect/mhlo/IR/disc_ral_ops.cc.inc",
),
],
tblgen = "@llvm-project//mlir:mlir-tblgen",
td_file = "include/mlir-hlo/Dialect/mhlo/IR/disc_ral_ops.td",
td_includes = [
"external/mlir-hlo/include",
"include",
],
deps = [":hlo_ops_td_files"],
)
cc_library(
name = "disc_ral",
srcs = [
"include/mlir-hlo/Dialect/mhlo/IR/disc_ral_ops.cc.inc",
"include/mlir-hlo/Dialect/mhlo/IR/disc_ral_ops.h.inc",
"lib/Dialect/mhlo/IR/disc_ral_ops.cc",
],
hdrs = [
"include/mlir-hlo/Dialect/mhlo/IR/disc_ral_ops.h",
],
includes = ["include"],
deps = [
":disc_ral_ops_inc_gen",
"@llvm-project//llvm:Support",
"@llvm-project//mlir:Analysis",
"@llvm-project//mlir:ControlFlowInterfaces",
"@llvm-project//mlir:CopyOpInterface",
"@llvm-project//mlir:IR",
"@llvm-project//mlir:InferTypeOpInterface",
"@llvm-project//mlir:LoopLikeInterface",
"@llvm-project//mlir:Pass",
"@llvm-project//mlir:SideEffects",
"@llvm-project//mlir:StandardOps",
"@llvm-project//mlir:Support",
"@llvm-project//mlir:TransformUtils",
"@llvm-project//mlir:Transforms",
"@llvm-project//mlir:ViewLikeInterface",
],
alwayslink = 1,
)
cc_library(
name = "ral_inject_execution_context",
srcs = ["lib/Dialect/mhlo/transforms/ral_inject_execution_context.cc"],
hdrs = ["include/mlir-hlo/Dialect/mhlo/transforms/passes.h"],
deps = [
":disc_ral",
":pass_details",
"@llvm-project//llvm:Support",
"@llvm-project//mlir:IR",
"@llvm-project//mlir:Pass",
"@llvm-project//mlir:SCFDialect",
"@llvm-project//mlir:Shape",
"@llvm-project//mlir:StandardOps",
"@llvm-project//mlir:TensorDialect",
"@llvm-project//mlir:Transforms",
],
alwayslink = 1,
)
cc_library( cc_library(
name = "hlo_dialect_registration", name = "hlo_dialect_registration",
srcs = ["lib/Dialect/mhlo/IR/init.cc"], srcs = ["lib/Dialect/mhlo/IR/init.cc"],
hdrs = ["include/mlir-hlo/Dialect/mhlo/IR/register.h"], hdrs = ["include/mlir-hlo/Dialect/mhlo/IR/register.h"],
deps = [ deps = [
":disc_ral",
":hlo", ":hlo",
":lhlo", ":lhlo",
":lhlo_gpu", ":lhlo_gpu",
@ -1159,6 +1255,7 @@ cc_library(
"//visibility:private", # This target is a private detail of pass implementations "//visibility:private", # This target is a private detail of pass implementations
], ],
deps = [ deps = [
":DiscRalPassIncGen",
":MhloPassIncGen", ":MhloPassIncGen",
"@llvm-project//mlir:Pass", "@llvm-project//mlir:Pass",
], ],
@ -1202,6 +1299,7 @@ cc_library(
"include/mlir-hlo/Dialect/mhlo/transforms/register_passes.h", "include/mlir-hlo/Dialect/mhlo/transforms/register_passes.h",
], ],
deps = [ deps = [
":DiscRalPassIncGen",
":LmhloPassIncGen", ":LmhloPassIncGen",
":MhloPassIncGen", ":MhloPassIncGen",
":broadcast_propagation", ":broadcast_propagation",
@ -1221,6 +1319,7 @@ cc_library(
":mhlo_control_flow_to_scf", ":mhlo_control_flow_to_scf",
":mhlo_fusion", ":mhlo_fusion",
":mhlo_to_mhlo_lowering_patterns", ":mhlo_to_mhlo_lowering_patterns",
":ral_inject_execution_context",
":rank_specialization", ":rank_specialization",
":sink_constants_to_control_flow", ":sink_constants_to_control_flow",
":test_passes", ":test_passes",
@ -1236,6 +1335,7 @@ cc_binary(
], ],
deps = [ deps = [
":all_passes", ":all_passes",
":disc_ral",
":hlo", ":hlo",
":lhlo", ":lhlo",
":lhlo_gpu", ":lhlo_gpu",

View File

@ -55,3 +55,13 @@ add_mlir_hlo_dialect_separate_files(lhlo_ops NO)
add_mlir_hlo_dialect_separate_files(lhlo_gpu_ops YES) add_mlir_hlo_dialect_separate_files(lhlo_gpu_ops YES)
add_mlir_interface(infer_fusibility_op_interface) add_mlir_interface(infer_fusibility_op_interface)
function(add_disc_ral_dialect dialect)
set(LLVM_TARGET_DEFINITIONS ${dialect}.td)
mlir_tablegen(${dialect}.h.inc -gen-op-decls)
mlir_tablegen(${dialect}.cc.inc -gen-op-defs)
add_public_tablegen_target(MLIR${dialect}IncGen)
add_dependencies(mlir-headers MLIR${dialect}IncGen)
endfunction()
add_disc_ral_dialect(disc_ral_ops)

View File

@ -0,0 +1,65 @@
/* Copyright 2021 The TensorFlow Authors. All Rights Reserved.
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
==============================================================================*/
// This file defines the operations used in the disc_ral dialect.
#ifndef TENSORFLOW_COMPILER_MLIR_HLO_INCLUDE_MLIR_HLO_DIALECT_MHLO_IR_DISC_RAL_OPS_H_
#define TENSORFLOW_COMPILER_MLIR_HLO_INCLUDE_MLIR_HLO_DIALECT_MHLO_IR_DISC_RAL_OPS_H_
#include "llvm/ADT/StringRef.h"
#include "mlir/Dialect/StandardOps/IR/Ops.h"
#include "mlir/IR/Attributes.h"
#include "mlir/IR/BuiltinTypes.h"
#include "mlir/IR/Dialect.h"
#include "mlir/IR/DialectImplementation.h"
#include "mlir/IR/Location.h"
#include "mlir/IR/MLIRContext.h"
#include "mlir/IR/OpDefinition.h"
#include "mlir/IR/Operation.h"
#include "mlir/IR/TypeUtilities.h"
#include "mlir/IR/Types.h"
#include "mlir/Interfaces/InferTypeOpInterface.h"
#include "mlir/Interfaces/SideEffectInterfaces.h"
namespace mlir {
class OpBuilder;
namespace disc_ral {
class RalDialect : public Dialect {
public:
explicit RalDialect(MLIRContext* context);
static StringRef getDialectNamespace() { return "disc_ral"; }
// Parses a type registered to this dialect.
Type parseType(DialectAsmParser& parser) const override;
// Prints a type registered to this dialect.
void printType(Type type, DialectAsmPrinter& os) const override;
};
class RalExecutionContextType
: public Type::TypeBase<RalExecutionContextType, Type, TypeStorage> {
public:
using Base::Base;
};
} // end namespace disc_ral
} // end namespace mlir
#define GET_OP_CLASSES
#include "mlir-hlo/Dialect/mhlo/IR/disc_ral_ops.h.inc"
#endif // TENSORFLOW_COMPILER_MLIR_HLO_INCLUDE_MLIR_HLO_DIALECT_MHLO_IR_DISC_RAL_OPS_H_

View File

@ -0,0 +1,95 @@
/* Copyright 2021 The TensorFlow Authors. All Rights Reserved.
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
==============================================================================*/
// This is the operation definition file for DISC RAL ops.
#ifndef DISC_RAL_OPS
#define DISC_RAL_OPS
include "mlir/IR/OpBase.td"
def RAL_Dialect : Dialect {
let name = "disc_ral";
let cppNamespace = "::mlir::disc_ral";
}
class RAL_Op<string mnemonic, list<OpTrait> traits> :
Op<RAL_Dialect, mnemonic, traits> {
let verifier = [{ return Verify(*this); }];
}
def RAL_RalExecutionContextType : DialectType<RAL_Dialect,
CPred<"$_self.isa<::mlir::disc_ral::RalExecutionContextType>()">,
"context">,
BuildableType<"$_builder.getType<::mlir::disc_ral::RalExecutionContextType>()"> {
let description = [{
RalExecutionContextType corresponds to C++ class RalExecution defined in
disc.
}];
}
def RAL_RecvInputOp: RAL_Op<"recv_input", []> {
let summary = "receive input operator";
let description = [{
Fetch the specified input.
In DISC, inputs are not passed as parameter directly. Instead, users need to
explicitly fetch the individual input using this op.
}];
let arguments = (ins
RAL_RalExecutionContextType:$ctx,
Index:$input_idx
);
let results = (outs MemRefOf<[AnyType]>);
}
def RAL_SendOutputOp: RAL_Op<"send_output", []> {
let summary = "send output operator";
let description = [{
Send the specified output.
In DISC, outputs are not returned to the caller directly. Instead, users need to
explicitly send the individual output using this op.
}];
let arguments = (ins
RAL_RalExecutionContextType:$ctx,
Index:$output_idx,
MemRefOf<[AnyType]>:$result
);
let results = (outs);
}
def RAL_DispatchOp: RAL_Op<"dispatch", []> {
let summary = "Dispatch operator";
let description = [{
A dispatch op invokes code external to disc. The `args` are passed to the
external code, and the external code is expected to produce a result of the
given type. The exact mechanism is backend-specific.
`call_target_name` and `backend_config` can be arbitrary strings, but
`call_target_name` should be short as it may be used in labels.
`backend_config` can encode arbitrarily large amounts of information.
}];
let arguments = (ins
RAL_RalExecutionContextType:$ctx,
Variadic<AnyType>:$args,
StrAttr:$call_target_name,
DefaultValuedAttr<BoolAttr, "false">:$has_side_effect,
DefaultValuedAttr<StrAttr, "">:$backend_config
);
let results = (outs Variadic<AnyType>);
}
#endif // DISC_RAL_OPS

View File

@ -21,3 +21,7 @@ add_public_tablegen_target(MLIRMhloPassIncGen)
set(LLVM_TARGET_DEFINITIONS lmhlo_passes.td) set(LLVM_TARGET_DEFINITIONS lmhlo_passes.td)
mlir_tablegen(lmhlo_passes.h.inc -gen-pass-decls -name LMHLO) mlir_tablegen(lmhlo_passes.h.inc -gen-pass-decls -name LMHLO)
add_public_tablegen_target(MLIRLmhloPassIncGen) add_public_tablegen_target(MLIRLmhloPassIncGen)
set(LLVM_TARGET_DEFINITIONS disc_ral_passes.td)
mlir_tablegen(disc_ral_passes.h.inc -gen-pass-decls -name RAL)
add_public_tablegen_target(MLIRDiscRalPassIncGen)

View File

@ -27,4 +27,13 @@ namespace mhlo {
} // end namespace mhlo } // end namespace mhlo
} // end namespace mlir } // end namespace mlir
namespace mlir {
namespace disc_ral {
#define GEN_PASS_CLASSES
#include "mlir-hlo/Dialect/mhlo/transforms/disc_ral_passes.h.inc"
} // end namespace disc_ral
} // end namespace mlir
#endif // TENSORFLOW_COMPILER_MLIR_HLO_INCLUDE_MLIR_HLO_DIALECT_MHLO_TRANSFORMS_PASSDETAIL_H_ #endif // TENSORFLOW_COMPILER_MLIR_HLO_INCLUDE_MLIR_HLO_DIALECT_MHLO_TRANSFORMS_PASSDETAIL_H_

View File

@ -0,0 +1,25 @@
/* Copyright 2021 The TensorFlow Authors. All Rights Reserved.
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
==============================================================================*/
include "mlir/Pass/PassBase.td"
def RalInjectExecutionContextPass : Pass<"disc-ral-inject-execution-context", "ModuleOp"> {
let summary = "Inject DISC RAL execution context.";
let constructor = "createRalInjectExecutionContextPass()";
let options = [
Option<"entry_func_name_", "entry-func-name", "std::string",
/*default=*/"\"main\"", "Name of the entry function.">,
];
}

View File

@ -119,6 +119,13 @@ std::unique_ptr<OperationPass<FuncOp>> createLegalizeTensorLoadOpPass();
} // namespace lmhlo } // namespace lmhlo
namespace disc_ral {
std::unique_ptr<OperationPass<ModuleOp>> createRalInjectExecutionContextPass(
const std::string& entry_func_name = "main");
} // namespace disc_ral
} // namespace mlir } // namespace mlir
#endif // TENSORFLOW_COMPILER_MLIR_HLO_INCLUDE_MLIR_HLO_DIALECT_MHLO_TRANSFORMS_PASSES_H_ #endif // TENSORFLOW_COMPILER_MLIR_HLO_INCLUDE_MLIR_HLO_DIALECT_MHLO_TRANSFORMS_PASSES_H_

View File

@ -43,4 +43,15 @@ inline void registerAllLmhloPasses() { registerLMHLOPasses(); }
} // namespace lmhlo } // namespace lmhlo
} // namespace mlir } // namespace mlir
namespace mlir {
namespace disc_ral {
#define GEN_PASS_REGISTRATION
#include "mlir-hlo/Dialect/mhlo/transforms/disc_ral_passes.h.inc"
inline void registerAllDiscRalPasses() { registerRALPasses(); }
} // namespace disc_ral
} // namespace mlir
#endif // MLIR_HLO_DIALECT_MHLO_TRANSFORMS_REGISTER_PASSES_H_ #endif // MLIR_HLO_DIALECT_MHLO_TRANSFORMS_REGISTER_PASSES_H_

View File

@ -91,6 +91,17 @@ target_link_libraries(LmhloGPUDialect
HloOpsCommon HloOpsCommon
) )
add_mlir_dialect_library(DiscRalDialect
disc_ral_ops.cc
DEPENDS
MLIRdisc_ral_opsIncGen
)
target_link_libraries(DiscRalDialect
PUBLIC
MLIRIR
)
add_mlir_dialect_library(MhloRegisterDialects add_mlir_dialect_library(MhloRegisterDialects
init.cc init.cc
DEPENDS DEPENDS

View File

@ -0,0 +1,64 @@
/* Copyright 2021 The TensorFlow Authors. All Rights Reserved.
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
==============================================================================*/
// This file defines the operations used in the DISC RAL dialect.
#include "mlir-hlo/Dialect/mhlo/IR/disc_ral_ops.h"
namespace mlir {
namespace disc_ral {
template <typename T>
static LogicalResult Verify(T op) {
return success();
}
//===----------------------------------------------------------------------===//
// ral Dialect Constructor
//===----------------------------------------------------------------------===//
RalDialect::RalDialect(MLIRContext* context)
: Dialect(getDialectNamespace(), context, TypeID::get<RalDialect>()) {
addOperations<
#define GET_OP_LIST
#include "mlir-hlo/Dialect/mhlo/IR/disc_ral_ops.cc.inc"
>();
addTypes<RalExecutionContextType>();
context->loadDialect<memref::MemRefDialect>();
}
Type RalDialect::parseType(DialectAsmParser& parser) const {
StringRef data_type;
if (parser.parseKeyword(&data_type)) return Type();
if (data_type == "context") return RalExecutionContextType::get(getContext());
parser.emitError(parser.getNameLoc())
<< "unknown disc_ral type: " << data_type;
return nullptr;
}
void RalDialect::printType(Type type, DialectAsmPrinter& os) const {
if (type.isa<RalExecutionContextType>()) {
os << "context";
return;
}
os << "<unknown disc_ral type>";
}
} // namespace disc_ral
} // namespace mlir
#define GET_OP_CLASSES
#include "mlir-hlo/Dialect/mhlo/IR/disc_ral_ops.cc.inc"

View File

@ -154,9 +154,26 @@ add_mlir_library(LmhloPasses
MLIRPass MLIRPass
) )
add_mlir_library(DiscRalPasses
ral_inject_execution_context.cc
DEPENDS
MLIRdisc_ral_opsIncGen
MLIRDiscRalPassIncGen
LINK_COMPONENTS
Core
LINK_LIBS PUBLIC
DiscRalDialect
MLIRIR
MLIRPass
)
add_library(AllMhloPasses INTERFACE) add_library(AllMhloPasses INTERFACE)
target_link_libraries(AllMhloPasses INTERFACE target_link_libraries(AllMhloPasses INTERFACE
ChloPasses ChloPasses
DiscRalPasses
MhloPasses MhloPasses
MhloToLhloConversion MhloToLhloConversion
MhloToStandard MhloToStandard

View File

@ -0,0 +1,139 @@
/* Copyright 2021 The TensorFlow Authors. All Rights Reserved.
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
==============================================================================*/
// This file implements logic for injecting execution context to the entry
// function.
//
// Below is an example. Before Conversion:
// ```
// func @main(%arg0: memref<?x?xf32>, %arg1: memref<?x?xf32>) ->
// memref<?x?xf32> {
// %0 = memref.alloc(...)
// "lmhlo.add"(%arg0, %arg1, %0) : (memref<?x?xf32>, memref<?x?xf32>,
// memref<?x?xf32>) -> memref<?x?xf32> return %0 : memref<?x?xf32>
// }
// ```
// After conversion:
// ```
// func @main(%ctx: !disc_ral.context) {
// %c0 = constant 0 : index
// %c1 = constant 1 : index
// "disc_ral.recv_input"(%ctx, %c0) : (!disc_ral.context, index) ->
// memref<?x?xf32> "disc_ral.recv_input"(%ctx, %c1) : (!disc_ral.context,
// index) -> memref<?x?xf32> %0 = memref.alloc(...) "lmhlo.add"(%arg0,
// %arg1, %0) : (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>) ->
// memref<?x?xf32> "disc_ral.send_output"(%ctx, %c0, %0) :
// (!disc_ral.context, index, memref<?x?xf32>) -> ()
// }
// ```
// 1. rewrite entry function (supposed that no other function directly calls the
// entry function)
// - function signature rewrite
// - return-like ops rewrite.
// 2. Currently we suppose that functions except the entry function are inlined
// to the entry function. Thus, we don't rewrite all call ops and other
// functions a.t.m. Re-visit this assumption if necessary.
#include "mlir-hlo/Dialect/mhlo/IR/disc_ral_ops.h"
#include "mlir-hlo/Dialect/mhlo/transforms/PassDetail.h"
#include "mlir/Dialect/StandardOps/IR/Ops.h"
#include "mlir/IR/Attributes.h"
#include "mlir/IR/Builders.h"
#include "mlir/IR/BuiltinOps.h"
#include "mlir/IR/BuiltinTypes.h"
#include "mlir/IR/Location.h"
#include "mlir/IR/MLIRContext.h"
#include "mlir/IR/Operation.h"
#include "mlir/Pass/Pass.h"
namespace mlir {
namespace disc_ral {
namespace {
struct RalInjectExecutionContextPass
: public RalInjectExecutionContextPassBase<RalInjectExecutionContextPass> {
explicit RalInjectExecutionContextPass(const std::string& entry_func_name)
: RalInjectExecutionContextPassBase<RalInjectExecutionContextPass>::
RalInjectExecutionContextPassBase() {
this->entry_func_name_ = entry_func_name;
}
void getDependentDialects(DialectRegistry& registry) const override {
registry.insert<RalDialect>();
}
void runOnOperation() override {
ModuleOp m = getOperation();
FuncOp main = m.lookupSymbol<FuncOp>(entry_func_name_);
if (!main) {
m.emitError("entry func: " + entry_func_name_ + " not found");
signalPassFailure();
}
Location loc = main.getLoc();
FunctionType funcType = main.getType();
OpBuilder b(&main.getBody());
Block* entry_block = &main.getBody().front();
Type ctx_type = RalExecutionContextType::get(b.getContext());
// 1. Prepend context to the entry block arguments
Value ctx = entry_block->insertArgument(0u, ctx_type);
// 2. remap original arguments to recv_input ops
for (auto&& en : llvm::enumerate(
llvm::zip(funcType.getInputs(),
entry_block->getArguments().drop_front(1)))) {
Value idx = b.create<ConstantIndexOp>(loc, en.index());
Type argType = std::get<0>(en.value());
Value oldArgument = std::get<1>(en.value());
Value newInput = b.create<RecvInputOp>(loc, argType, ctx, idx);
oldArgument.replaceAllUsesWith(newInput);
}
// 3. remap all return-like ops to send_output ops
for (auto& block : main.getBody()) {
if (block.empty()) continue;
Operation& operation = block.back();
if (!operation.hasTrait<OpTrait::ReturnLike>()) continue;
b.setInsertionPoint(&operation);
for (auto& en : llvm::enumerate(operation.getOperands())) {
Value idx = b.create<ConstantIndexOp>(loc, en.index());
b.create<SendOutputOp>(loc, ctx, idx, en.value());
}
operation.eraseOperands(0, operation.getNumOperands());
}
// 4. remove unused block arguments of entry block
for (int i = 0, e = funcType.getInputs().size(); i < e; ++i) {
// continue to remove the 1st (starting from zero) argument
entry_block->eraseArgument(1);
}
// 5. set entry func to new type
main.setType(b.getFunctionType({ctx_type}, {}));
}
};
} // namespace
std::unique_ptr<OperationPass<ModuleOp>> createRalInjectExecutionContextPass(
const std::string& entry_func_name) {
return std::make_unique<RalInjectExecutionContextPass>(entry_func_name);
}
} // namespace disc_ral
} // namespace mlir

View File

@ -0,0 +1,19 @@
// RUN: mlir-hlo-opt -disc-ral-inject-execution-context=entry-func-name=test \
// RUN: -canonicalize %s -o - | FileCheck %s
// CHECK-LABEL: func @test
// CHECK-SAME: (%[[CTX:.*]]: !disc_ral.context) {
func @test(%arg0: memref<?x?xf32>, %arg1: memref<?x?xf32>,
%arg2: memref<?x?xf32>, %arg3: memref<?x?xf32>) -> (memref<?x?xf32>, memref<?x?xf32>) {
// %[[T0:.*]] = "disc_ral.recv_input"(%[[CTX]], %c0) : (!disc_ral.context, index) -> memref<?x?xf32>
// %[[T1:.*]] = "disc_ral.recv_input"(%[[CTX]], %c1) : (!disc_ral.context, index) -> memref<?x?xf32>
// %[[T2:.*]] = "disc_ral.recv_input"(%[[CTX]], %c2) : (!disc_ral.context, index) -> memref<?x?xf32>
// %[[T3:.*]] = "disc_ral.recv_input"(%[[CTX]], %c3) : (!disc_ral.context, index) -> memref<?x?xf32>
// "lmhlo.abs"(%[[T0]], %[[T1]]) : (memref<?x?xf32>, memref<?x?xf32>) -> ()
// "lmhlo.add"(%[[T1]], %[[T2]], %[[T3]]) : (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>) -> ()
// "disc_ral.send_output"(%[[CTX]], %c0, %[[T0]]) : (!disc_ral.context, index, memref<?x?xf32>) -> ()
// "disc_ral.send_output"(%[[CTX]], %c1, %[[T3]]) : (!disc_ral.context, index, memref<?x?xf32>) -> ()
"lmhlo.abs"(%arg0, %arg1) : (memref<?x?xf32>, memref<?x?xf32>) -> ()
"lmhlo.add"(%arg1, %arg2, %arg3) : (memref<?x?xf32>, memref<?x?xf32>, memref<?x?xf32>) -> ()
return %arg1, %arg3 : memref<?x?xf32>, memref<?x?xf32>
}

View File

@ -14,6 +14,7 @@ limitations under the License.
==============================================================================*/ ==============================================================================*/
#include "mlir-hlo/Dialect/mhlo/IR/chlo_ops.h" #include "mlir-hlo/Dialect/mhlo/IR/chlo_ops.h"
#include "mlir-hlo/Dialect/mhlo/IR/disc_ral_ops.h"
#include "mlir-hlo/Dialect/mhlo/IR/hlo_ops.h" #include "mlir-hlo/Dialect/mhlo/IR/hlo_ops.h"
#include "mlir-hlo/Dialect/mhlo/IR/lhlo_gpu_ops.h" #include "mlir-hlo/Dialect/mhlo/IR/lhlo_gpu_ops.h"
#include "mlir-hlo/Dialect/mhlo/IR/lhlo_ops.h" #include "mlir-hlo/Dialect/mhlo/IR/lhlo_ops.h"
@ -26,6 +27,7 @@ int main(int argc, char **argv) {
mlir::registerAllPasses(); mlir::registerAllPasses();
mlir::mhlo::registerAllMhloPasses(); mlir::mhlo::registerAllMhloPasses();
mlir::lmhlo::registerAllLmhloPasses(); mlir::lmhlo::registerAllLmhloPasses();
mlir::disc_ral::registerAllDiscRalPasses();
mlir::DialectRegistry registry; mlir::DialectRegistry registry;
mlir::registerAllDialects(registry); mlir::registerAllDialects(registry);
@ -33,6 +35,7 @@ int main(int argc, char **argv) {
registry.insert<mlir::chlo::HloClientDialect>(); registry.insert<mlir::chlo::HloClientDialect>();
registry.insert<mlir::lmhlo::LmhloDialect>(); registry.insert<mlir::lmhlo::LmhloDialect>();
registry.insert<mlir::lmhlo_gpu::LmhloGpuDialect>(); registry.insert<mlir::lmhlo_gpu::LmhloGpuDialect>();
registry.insert<mlir::disc_ral::RalDialect>();
return failed(mlir::MlirOptMain(argc, argv, "MLIR HLO pass driver\n", return failed(mlir::MlirOptMain(argc, argv, "MLIR HLO pass driver\n",
registry, registry,