Commit Graph

474 Commits

Author SHA1 Message Date
Chen Feiyue a24d2be9c3
Rebuild prebuil-sdk to adjust lower ubuntu env (#658)
Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-11-09 15:44:34 +08:00
Chen Feiyue bb10884f98
Added scalar type support (#655)
Added SetScalar api to support scalar input
Added 2 cases for scalar index Gather

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-11-06 09:58:03 +08:00
Chen Feiyue 1bb1e070f2
Update internal to 1.1.88 release (#657)
Internal ovxlib SHA 32fe479af5549e894bcd40de5740ae0dfd42bdb9

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-11-03 13:16:33 +08:00
xie-oritek 10081790ee
Add ScatterND_Update operator (#652)
* Add ScatterND_Update operator

* Remove ScatterNDUpdate shape param

* Rename ScatterND_Update to ScatterND_ONNX_V16

* Fix ScatterND_ONNX_V16 rename problem

---------

Co-authored-by: unknown <z0026@china.oritek.com.cn>
2023-10-11 09:12:40 +08:00
chxin66 363c369bf6
Fixed quant param lost in Bidirectional lstm (#649)
https://github.com/VeriSilicon/TIM-VX/issues/647

Type: Bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>
2023-09-19 22:08:34 +08:00
Chen Feiyue 61ea0091ca
Fixed unsupported float16 bias in fc (#646)
Resolve the issue of underlying hardware not supporting float16 bias in fc
by converting bias type to float32

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-09-13 09:44:21 +08:00
Antkillerfarm 98966dac9c
build fix for export Swap Handle API (#643)
PR #635 build error fix

Type: bug fix

Signed-off-by: Tang Jing <jing.tang@verisilicon.com>
2023-08-30 14:25:45 +08:00
chxin66 01235266c5
fixed tensor cache mismatch issue (#644)
Type: Bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-08-30 14:23:20 +08:00
Zhouheng Zheng 5668856fc9
Fix the instance norm test input size bug (#645)
Only NNAPI instance norm spec have scalar gamma and beta, which can not
support by sp, rewrite it into tensor.

Type: Bug Fix

Co-authored-by: zhouheng.zheng <zhouheng.zheng@ouotlook.com>
2023-08-30 14:10:23 +08:00
Antkillerfarm 3bbe2ef9ec
export Swap Handle API (#635)
export vsi_nn_SwapHandle & vsi_nn_SwapTensorHandle &
vsi_nn_SwapTensorHandleWithCache for TIM-VX usage.

Type: New Feature

Signed-off-by: Tang Jing <jing.tang@verisilicon.com>
2023-08-28 09:15:43 +08:00
xie-oritek 7fc264a9e6
Refine Tensor::SetShape api to avoid compile warning using const ref (#640)
* Move int4/uint4 to the end of DataType

* Refine api Tensor::SetShape, using const ref avoid compile warning
2023-08-25 00:47:24 +08:00
MercuryChen 6f34b66ae4
Split replayer code from tracer.h (#642)
* Add support for different input dtype of MaxPoolGrad.

Type: Code improvement

* Integrate api trace into tim-vx source code, as part of experimeantal.

Type: New Feature

* Refine api trace code and document
Add missing traced apis of tim::vx::Quantization

Type: Code improvement

* Split Api relayer code out of tracer.
To enable compile replayer code in machine which can't access high version boost libs.

Type: Code improvement
2023-08-25 00:41:45 +08:00
xie-oritek 265e74ff16
Add int4/uint4 definition (#638) 2023-08-22 17:37:38 +08:00
xie-oritek 54af5c2216
Add CumSum&LRN operator to trace module (#639) 2023-08-22 16:53:59 +08:00
xie-oritek bab571b569
Fix data missing when use trace::Graph::CreateTensor (#636) 2023-08-22 16:53:24 +08:00
Chen Feiyue 9bb3e7c68b
Fixed misleading test case bug in deconv1d (#633)
Correct erros of deconv1d unittest
Added hint in the header indicating that padtype is not supported yet
Added 2 cases for deconv1d

Type: Code Improvement
Issue: github issue #585

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-08-17 21:26:54 +08:00
MercuryChen cf2efc63fd
Refine api trace code and document (#634)
* Add support for different input dtype of MaxPoolGrad.

Type: Code improvement

* Integrate api trace into tim-vx source code, as part of experimeantal.

Type: New Feature

* Refine api trace code and document
Add missing traced apis of tim::vx::Quantization

Type: Code improvement
2023-08-17 21:16:34 +08:00
Chen Feiyue 2f018cc088
Code refinement for mean-stddev-normalization fuse (#632)
1.Added copyright  && Added reference or const reference for functions
2.Rewrite function of determing whether there is a common input
3.Use std::remove_if instead of std::find before doing erase
4.Added security check to prevent access to deleted ops

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-08-15 13:15:03 +08:00
Chen Feiyue af50cc5e3f
Added general Float16 support (#631)
Added Float16 type definition from third-party
Refine float16 bias handlling in conv2d
Refine float16 case in conv2d
Caution: Headers of float16 only be included when build unit_test

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-08-12 10:04:16 +08:00
Chen Feiyue 35e50d7692
Added op fusion for mean_stddev_normalization (#629)
Added op fusion for mean_stddev_normalization ops such as layernorm and
instance norm

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-08-09 22:10:45 +08:00
chxin66 bff26a32c4
fix size compute bug in lrn (#626)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-08-07 13:20:35 +08:00
chxin66 6a5694e557
fixed prelu layoutinfer bug & added cases (#628)
Type: bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-08-07 13:17:46 +08:00
zhongzhuonan f0cf45fdaa
Create self-hosted.yml (#625)
* Create self-hosted.yml

* Update self-hosted.yml
2023-07-26 13:31:07 +08:00
Sven 821864a582
Fixed IExecutable object not bind with DeviceID (#624)
If Executable object doesn't bind with a concrete DeviceID,
it will go first device by default.

When run multi executable with multi device, the behavior is not
expected. Fixed by attach device id with CompileOption.

Signed-off-by: xiang.zhang <xiang.zhang@verisilicon.com>
2023-07-24 22:45:54 +08:00
chxin66 680e8d59cb
Fixed conv2d grouped_conv2d deconv2d layoutinfer bug (#622)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-07-24 17:10:24 +08:00
MercuryChen 315adcf076
Integrate api trace into tim-vx source as an experimental feature. (#623)
* Add support for different input dtype of MaxPoolGrad.

Type: Code improvement

* Integrate api trace into tim-vx source code, as part of experimeantal.

Type: New Feature
2023-07-19 18:40:48 +08:00
Chen Feiyue 0885a0d797
Remove confusing comment in depthwise conv test (#621)
Remove wrong layout comment for depthwise conv unit test
Add comment of layout condition in basic class for depthwise conv

Type: Code Improvement
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-17 09:43:34 +08:00
Chen Feiyue 62c6b6560c
Added axis param for TopK (#610)
Topk support specifying dimensions with later internal ovxlib

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-12 09:54:07 +08:00
Chen Feiyue 18749f5d05
Fixed transient deconv1d generate wrong output shape bug (#619)
Fixed bug that when deconv1d ouput is set to be transient, actual output shape will be zero at dim 1.

Reason :internal typing error
Type: Bug Fix
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-08 23:40:32 +08:00
chxin66 ea8046ec9c
Added roi_align layoutinfer & cases (#615)
* Added roi_align layoutinfer & cases

Type: New feature

Signed-off-by: Chen <jack.chen@verisilicon.com>

* Update instancenorm op spec .json

Type: bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>

* Added roi_pool layoutinfer & fixed case bug

Type: new feature

Signed-off-by: Chen <jack.chen@verisilicon.com>

---------

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-07-08 23:39:56 +08:00
Chen Feiyue 32c5a61601
Update prebuilt && internal for 23Q2 release (#617)
* Update prebuilt-sdk to 6.4.15 release

Type: Code Improvement
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>

* Update internal to 1.1.84 rel

Update internal to SHA 1e591108dddcbf6dd88d5eef97a7d8b3ffc19ce3

Type: Code Improvement
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>

---------

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-08 23:38:17 +08:00
chxin66 02d6d72946
fixed yolov4 build issue (#618)
Type: Bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-07-06 09:30:24 +08:00
chxin66 5d741e8ebe
Optimize compilation process for openssl (#613)
do not rebuild when openssl lib already exist

Type: Code refine

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-07-03 15:48:19 +08:00
Chen Feiyue 33f3a4f176
Enable float16 bias convolution model runs on NN (#612)
Convert float16 bias tensor to float32 to meet condition of NN
convolution in driver

Caution: Clang version requires minimum 15.0

Type: Code Improvement
Issue: bugzilla id:32785 | jira id VIVD-744

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-30 09:41:28 +08:00
chxin66 34812fe40e
Added case for gather (#599)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-26 09:15:08 +08:00
chxin66 233eb439e1
Fixed viplite driver build issue (#611)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-26 09:14:42 +08:00
Chen Feiyue 75882d4195
Added new_axis_mask param for stridedslice (#600)
Add another constructor for stridedslice when new_axis_mask is set

The layout inference need to reconstruct the axis mapping when
new_axis_mask is set(TODO)

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-25 09:24:41 +08:00
Chen Feiyue d823ef6fcb
Remove unused value in op layoutinfer (#607)
Remove unused value to make sure project build successfully with higher
version compiler such as clang15

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-25 09:23:09 +08:00
chxin66 bd89faddb1
Fixed openssl android build bug (#606)
android cross build fail when not specified LOCAL_OPENSSL

Type: bug fixed

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-19 21:57:46 +08:00
chxin66 26b4e53fe7
fixed reduce layoutinfer bug (#605)
Type: Bug fixed

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-19 21:56:08 +08:00
Chen Feiyue fbfbdd7c83
Added axis support for layernorm (#602)
Layernormolization can handle non zero axis now
Added case to verify layernorm with axis 2
Modify layernorm opjson

Type:  Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-15 21:45:46 +08:00
chxin66 a64a0f7379
Added a case for resize_bilinear layoutinfer (#595)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-02 07:52:06 +08:00
Chen Feiyue aa7b3a6f8f
Added api json for each op to support acuity (#596)
Record constructor form of each operation as a json file to support acuity to call
timvx op

Type:  Documentation

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-02 07:51:10 +08:00
chxin66 ea8adc456a
fixed instance norm bug & add its layoutinfer (#593)
Type: Bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-05-31 12:55:42 +08:00
chxin66 4f92e58155
optimization for tiny_yolov4 (#591)
Type: code improvment

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-05-23 14:28:47 +08:00
shijie001 51faf286c2
Fixed LayerNormalization eps bug (#589) 2023-05-22 14:13:44 +08:00
chxin66 99606fd891
Add a case for local response norm (#590)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-05-20 16:57:21 +08:00
chxin66 ddcb00c11b
Fixed bug for pad test (#588)
Type: Bug fix

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-05-20 16:56:00 +08:00
Chen Feiyue 3f83db534d
Added missed ops include header (#584)
Include cumsum, mod, maxpool3d, grucell, UnidirectionalSequanceGRU header file in
ops.h

Type:  Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-05-16 15:59:16 +08:00
Chen Feiyue 3c372dd646
Refine UnidirectionalGRU and GRUCell (#587)
Refine unidirectional_gru and gru_cell code to avoid including ovxlib files
in header of some op
Introduce TranslateToVsibool function to support above code refinement

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-05-15 16:44:51 +08:00