Added op fusion for mean_stddev_normalization ops such as layernorm and
instance norm
Type: New Feature
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
If Executable object doesn't bind with a concrete DeviceID,
it will go first device by default.
When run multi executable with multi device, the behavior is not
expected. Fixed by attach device id with CompileOption.
Signed-off-by: xiang.zhang <xiang.zhang@verisilicon.com>
Convert float16 bias tensor to float32 to meet condition of NN
convolution in driver
Caution: Clang version requires minimum 15.0
Type: Code Improvement
Issue: bugzilla id:32785 | jira id VIVD-744
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
Added new API for tim::vx::Context::CreateGraph with a CompileOption
Only one option added in CompileOption:
relax_mode : Run float32 mode with bfloat16
Signed-off-by: xiang.zhang <xiang.zhang@verisilicon.com>
because the operation is a shared pointor, in app, the operation is
created as:
auto op = graph->CreateOperation();
uses natively think the operation had been register into the graph and
would not manage the op locally.
if running the graph in another fucntion instead of the function that
create the operation, the operation would had been delete.
so the operation should be stored into the graph.
Signed-off-by: Jia <juku.jia@verisilicon.com>