Commit Graph

192 Commits

Author SHA1 Message Date
Chen Feiyue bb10884f98
Added scalar type support (#655)
Added SetScalar api to support scalar input
Added 2 cases for scalar index Gather

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-11-06 09:58:03 +08:00
xie-oritek 10081790ee
Add ScatterND_Update operator (#652)
* Add ScatterND_Update operator

* Remove ScatterNDUpdate shape param

* Rename ScatterND_Update to ScatterND_ONNX_V16

* Fix ScatterND_ONNX_V16 rename problem

---------

Co-authored-by: unknown <z0026@china.oritek.com.cn>
2023-10-11 09:12:40 +08:00
Chen Feiyue 61ea0091ca
Fixed unsupported float16 bias in fc (#646)
Resolve the issue of underlying hardware not supporting float16 bias in fc
by converting bias type to float32

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-09-13 09:44:21 +08:00
Antkillerfarm 98966dac9c
build fix for export Swap Handle API (#643)
PR #635 build error fix

Type: bug fix

Signed-off-by: Tang Jing <jing.tang@verisilicon.com>
2023-08-30 14:25:45 +08:00
chxin66 01235266c5
fixed tensor cache mismatch issue (#644)
Type: Bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-08-30 14:23:20 +08:00
Antkillerfarm 3bbe2ef9ec
export Swap Handle API (#635)
export vsi_nn_SwapHandle & vsi_nn_SwapTensorHandle &
vsi_nn_SwapTensorHandleWithCache for TIM-VX usage.

Type: New Feature

Signed-off-by: Tang Jing <jing.tang@verisilicon.com>
2023-08-28 09:15:43 +08:00
xie-oritek 7fc264a9e6
Refine Tensor::SetShape api to avoid compile warning using const ref (#640)
* Move int4/uint4 to the end of DataType

* Refine api Tensor::SetShape, using const ref avoid compile warning
2023-08-25 00:47:24 +08:00
MercuryChen 6f34b66ae4
Split replayer code from tracer.h (#642)
* Add support for different input dtype of MaxPoolGrad.

Type: Code improvement

* Integrate api trace into tim-vx source code, as part of experimeantal.

Type: New Feature

* Refine api trace code and document
Add missing traced apis of tim::vx::Quantization

Type: Code improvement

* Split Api relayer code out of tracer.
To enable compile replayer code in machine which can't access high version boost libs.

Type: Code improvement
2023-08-25 00:41:45 +08:00
xie-oritek 265e74ff16
Add int4/uint4 definition (#638) 2023-08-22 17:37:38 +08:00
xie-oritek 54af5c2216
Add CumSum&LRN operator to trace module (#639) 2023-08-22 16:53:59 +08:00
xie-oritek bab571b569
Fix data missing when use trace::Graph::CreateTensor (#636) 2023-08-22 16:53:24 +08:00
Chen Feiyue 9bb3e7c68b
Fixed misleading test case bug in deconv1d (#633)
Correct erros of deconv1d unittest
Added hint in the header indicating that padtype is not supported yet
Added 2 cases for deconv1d

Type: Code Improvement
Issue: github issue #585

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-08-17 21:26:54 +08:00
MercuryChen cf2efc63fd
Refine api trace code and document (#634)
* Add support for different input dtype of MaxPoolGrad.

Type: Code improvement

* Integrate api trace into tim-vx source code, as part of experimeantal.

Type: New Feature

* Refine api trace code and document
Add missing traced apis of tim::vx::Quantization

Type: Code improvement
2023-08-17 21:16:34 +08:00
Chen Feiyue af50cc5e3f
Added general Float16 support (#631)
Added Float16 type definition from third-party
Refine float16 bias handlling in conv2d
Refine float16 case in conv2d
Caution: Headers of float16 only be included when build unit_test

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-08-12 10:04:16 +08:00
Chen Feiyue 35e50d7692
Added op fusion for mean_stddev_normalization (#629)
Added op fusion for mean_stddev_normalization ops such as layernorm and
instance norm

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-08-09 22:10:45 +08:00
Sven 821864a582
Fixed IExecutable object not bind with DeviceID (#624)
If Executable object doesn't bind with a concrete DeviceID,
it will go first device by default.

When run multi executable with multi device, the behavior is not
expected. Fixed by attach device id with CompileOption.

Signed-off-by: xiang.zhang <xiang.zhang@verisilicon.com>
2023-07-24 22:45:54 +08:00
MercuryChen 315adcf076
Integrate api trace into tim-vx source as an experimental feature. (#623)
* Add support for different input dtype of MaxPoolGrad.

Type: Code improvement

* Integrate api trace into tim-vx source code, as part of experimeantal.

Type: New Feature
2023-07-19 18:40:48 +08:00
Chen Feiyue 0885a0d797
Remove confusing comment in depthwise conv test (#621)
Remove wrong layout comment for depthwise conv unit test
Add comment of layout condition in basic class for depthwise conv

Type: Code Improvement
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-17 09:43:34 +08:00
Chen Feiyue 62c6b6560c
Added axis param for TopK (#610)
Topk support specifying dimensions with later internal ovxlib

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-12 09:54:07 +08:00
chxin66 ea8046ec9c
Added roi_align layoutinfer & cases (#615)
* Added roi_align layoutinfer & cases

Type: New feature

Signed-off-by: Chen <jack.chen@verisilicon.com>

* Update instancenorm op spec .json

Type: bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>

* Added roi_pool layoutinfer & fixed case bug

Type: new feature

Signed-off-by: Chen <jack.chen@verisilicon.com>

---------

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-07-08 23:39:56 +08:00
Chen Feiyue 33f3a4f176
Enable float16 bias convolution model runs on NN (#612)
Convert float16 bias tensor to float32 to meet condition of NN
convolution in driver

Caution: Clang version requires minimum 15.0

Type: Code Improvement
Issue: bugzilla id:32785 | jira id VIVD-744

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-30 09:41:28 +08:00
chxin66 34812fe40e
Added case for gather (#599)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-26 09:15:08 +08:00
Chen Feiyue 75882d4195
Added new_axis_mask param for stridedslice (#600)
Add another constructor for stridedslice when new_axis_mask is set

The layout inference need to reconstruct the axis mapping when
new_axis_mask is set(TODO)

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-25 09:24:41 +08:00
Chen Feiyue fbfbdd7c83
Added axis support for layernorm (#602)
Layernormolization can handle non zero axis now
Added case to verify layernorm with axis 2
Modify layernorm opjson

Type:  Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-15 21:45:46 +08:00
Chen Feiyue aa7b3a6f8f
Added api json for each op to support acuity (#596)
Record constructor form of each operation as a json file to support acuity to call
timvx op

Type:  Documentation

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-02 07:51:10 +08:00
chxin66 ea8adc456a
fixed instance norm bug & add its layoutinfer (#593)
Type: Bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-05-31 12:55:42 +08:00
chxin66 4f92e58155
optimization for tiny_yolov4 (#591)
Type: code improvment

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-05-23 14:28:47 +08:00
shijie001 51faf286c2
Fixed LayerNormalization eps bug (#589) 2023-05-22 14:13:44 +08:00
Chen Feiyue 3f83db534d
Added missed ops include header (#584)
Include cumsum, mod, maxpool3d, grucell, UnidirectionalSequanceGRU header file in
ops.h

Type:  Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-05-16 15:59:16 +08:00
Chen Feiyue 3c372dd646
Refine UnidirectionalGRU and GRUCell (#587)
Refine unidirectional_gru and gru_cell code to avoid including ovxlib files
in header of some op
Introduce TranslateToVsibool function to support above code refinement

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-05-15 16:44:51 +08:00
Chen Feiyue b81f7979fa
Reload "==" operator for quantizations of two tensor (#583)
Reload operator "==" to check two quantization same or not

Type: New Feature
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-05-10 17:58:30 +09:00
SCUWQ 1543efe098
Add some tensor dtype convert APIs (#576)
For pre/post process.

Type: Code Refine

Co-authored-by: wangqian <wangqian@CNCDD9444.verisilicon.com>
2023-04-27 09:04:39 +08:00
liyuenan 27890719b6
Support remote platform by gRPC (#561)
* Support remote platform by gRPC

Signed-off-by: yuenan.li <yuenan.li@verisilicon.com>
2023-03-28 09:51:23 +08:00
Zhouheng Zheng e49f67b840
Remove tensor GetDataRef api (#569)
Co-authored-by: zhouheng.zheng <zhouheng.zheng@ouotlook.com>
2023-03-23 21:35:30 +08:00
Tang d778dfb82d update copyright information 2023-01-20 12:49:48 +08:00
meseraph 20f759e58a fix depth2space mode enum 2023-01-17 22:44:44 +08:00
meseraph cc34b5f0ea mapped pool1d 2022-12-30 10:52:28 +08:00
Chen Xin 7582b57edc Added pad_v2 & pad_v2 layout infer
And added 4 rank case

Type: Added new op
Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
2022-12-16 15:03:58 +08:00
Feiyue Chen 8d8f4b6e68 Added EmbeddingLookup & deprecate LshProjection
Added EmbeddingLookup and unit test
Changed LshProjection op status to Deprecated

Type: New Feature
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2022-12-05 09:59:56 +08:00
Qin.Chen 13da73bbe3 Fix maxpoolgrad, hide unused pool value output
Type: Bug Fix
2022-12-01 15:49:38 +08:00
Feiyue Chen dd7cd2504c Added HashtableLookup Op
Added HashtableLookup Op and unit test

Type: New Feature
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2022-12-01 10:55:21 +08:00
Qin.Chen 8cd5bd93ce Add BUILD_WITH_BAZEL option, marco of VSI_FEAT_OP_XXX should behind headers now. 2022-11-22 21:39:02 +08:00
meseraph 883334e1bb add rnn 2022-11-16 13:33:39 +08:00
Feiyue Chen 11fd278d7a Fixed BidirectionalSequenceLSTM bug
Fixed input error of  the backward direction
Fixed golden error of unit test

Type: Bug Fix
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2022-11-16 13:31:23 +08:00
Chen Xin 3fed6d6757 fixed bug when broadcast dimensions is negative
Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
2022-10-08 21:42:02 +08:00
Chen Xin 72f2c5b69e Supported composed layout infer & added unit test
Fixed fc layout infer in rnncell layout infer
2022-09-26 14:29:46 +08:00
Feiyue Chen 1802e558ad modified cumsum header && resolve conflict in README.md 2022-09-26 14:27:48 +08:00
Feiyue Chen 264e491d2a added cumsum op & added handle api after BindInput 2022-09-26 14:27:48 +08:00
Feiyue Chen 9cb37b920f added MaxPool3d op 2022-09-26 13:32:56 +08:00
Feiyue Chen 8b8d09aea3 added Rcp op & modified test_utils 2022-09-22 12:15:02 +08:00