Commit Graph

513 Commits

Author SHA1 Message Date
MercuryChen 6f34b66ae4
Split replayer code from tracer.h (#642)
* Add support for different input dtype of MaxPoolGrad.

Type: Code improvement

* Integrate api trace into tim-vx source code, as part of experimeantal.

Type: New Feature

* Refine api trace code and document
Add missing traced apis of tim::vx::Quantization

Type: Code improvement

* Split Api relayer code out of tracer.
To enable compile replayer code in machine which can't access high version boost libs.

Type: Code improvement
2023-08-25 00:41:45 +08:00
xie-oritek 265e74ff16
Add int4/uint4 definition (#638) 2023-08-22 17:37:38 +08:00
xie-oritek 54af5c2216
Add CumSum&LRN operator to trace module (#639) 2023-08-22 16:53:59 +08:00
xie-oritek bab571b569
Fix data missing when use trace::Graph::CreateTensor (#636) 2023-08-22 16:53:24 +08:00
Chen Feiyue 9bb3e7c68b
Fixed misleading test case bug in deconv1d (#633)
Correct erros of deconv1d unittest
Added hint in the header indicating that padtype is not supported yet
Added 2 cases for deconv1d

Type: Code Improvement
Issue: github issue #585

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-08-17 21:26:54 +08:00
MercuryChen cf2efc63fd
Refine api trace code and document (#634)
* Add support for different input dtype of MaxPoolGrad.

Type: Code improvement

* Integrate api trace into tim-vx source code, as part of experimeantal.

Type: New Feature

* Refine api trace code and document
Add missing traced apis of tim::vx::Quantization

Type: Code improvement
2023-08-17 21:16:34 +08:00
Chen Feiyue 2f018cc088
Code refinement for mean-stddev-normalization fuse (#632)
1.Added copyright  && Added reference or const reference for functions
2.Rewrite function of determing whether there is a common input
3.Use std::remove_if instead of std::find before doing erase
4.Added security check to prevent access to deleted ops

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-08-15 13:15:03 +08:00
Chen Feiyue af50cc5e3f
Added general Float16 support (#631)
Added Float16 type definition from third-party
Refine float16 bias handlling in conv2d
Refine float16 case in conv2d
Caution: Headers of float16 only be included when build unit_test

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-08-12 10:04:16 +08:00
Chen Feiyue 35e50d7692
Added op fusion for mean_stddev_normalization (#629)
Added op fusion for mean_stddev_normalization ops such as layernorm and
instance norm

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-08-09 22:10:45 +08:00
chxin66 bff26a32c4
fix size compute bug in lrn (#626)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-08-07 13:20:35 +08:00
chxin66 6a5694e557
fixed prelu layoutinfer bug & added cases (#628)
Type: bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-08-07 13:17:46 +08:00
zhongzhuonan f0cf45fdaa
Create self-hosted.yml (#625)
* Create self-hosted.yml

* Update self-hosted.yml
2023-07-26 13:31:07 +08:00
Sven 821864a582
Fixed IExecutable object not bind with DeviceID (#624)
If Executable object doesn't bind with a concrete DeviceID,
it will go first device by default.

When run multi executable with multi device, the behavior is not
expected. Fixed by attach device id with CompileOption.

Signed-off-by: xiang.zhang <xiang.zhang@verisilicon.com>
2023-07-24 22:45:54 +08:00
chxin66 680e8d59cb
Fixed conv2d grouped_conv2d deconv2d layoutinfer bug (#622)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-07-24 17:10:24 +08:00
MercuryChen 315adcf076
Integrate api trace into tim-vx source as an experimental feature. (#623)
* Add support for different input dtype of MaxPoolGrad.

Type: Code improvement

* Integrate api trace into tim-vx source code, as part of experimeantal.

Type: New Feature
2023-07-19 18:40:48 +08:00
Chen Feiyue 0885a0d797
Remove confusing comment in depthwise conv test (#621)
Remove wrong layout comment for depthwise conv unit test
Add comment of layout condition in basic class for depthwise conv

Type: Code Improvement
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-17 09:43:34 +08:00
Chen Feiyue 62c6b6560c
Added axis param for TopK (#610)
Topk support specifying dimensions with later internal ovxlib

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-12 09:54:07 +08:00
Chen Feiyue 18749f5d05
Fixed transient deconv1d generate wrong output shape bug (#619)
Fixed bug that when deconv1d ouput is set to be transient, actual output shape will be zero at dim 1.

Reason :internal typing error
Type: Bug Fix
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-08 23:40:32 +08:00
chxin66 ea8046ec9c
Added roi_align layoutinfer & cases (#615)
* Added roi_align layoutinfer & cases

Type: New feature

Signed-off-by: Chen <jack.chen@verisilicon.com>

* Update instancenorm op spec .json

Type: bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>

* Added roi_pool layoutinfer & fixed case bug

Type: new feature

Signed-off-by: Chen <jack.chen@verisilicon.com>

---------

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-07-08 23:39:56 +08:00
Chen Feiyue 32c5a61601
Update prebuilt && internal for 23Q2 release (#617)
* Update prebuilt-sdk to 6.4.15 release

Type: Code Improvement
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>

* Update internal to 1.1.84 rel

Update internal to SHA 1e591108dddcbf6dd88d5eef97a7d8b3ffc19ce3

Type: Code Improvement
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>

---------

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-08 23:38:17 +08:00
chxin66 02d6d72946
fixed yolov4 build issue (#618)
Type: Bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-07-06 09:30:24 +08:00
chxin66 5d741e8ebe
Optimize compilation process for openssl (#613)
do not rebuild when openssl lib already exist

Type: Code refine

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-07-03 15:48:19 +08:00
Chen Feiyue 33f3a4f176
Enable float16 bias convolution model runs on NN (#612)
Convert float16 bias tensor to float32 to meet condition of NN
convolution in driver

Caution: Clang version requires minimum 15.0

Type: Code Improvement
Issue: bugzilla id:32785 | jira id VIVD-744

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-30 09:41:28 +08:00
chxin66 34812fe40e
Added case for gather (#599)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-26 09:15:08 +08:00
chxin66 233eb439e1
Fixed viplite driver build issue (#611)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-26 09:14:42 +08:00
Chen Feiyue 75882d4195
Added new_axis_mask param for stridedslice (#600)
Add another constructor for stridedslice when new_axis_mask is set

The layout inference need to reconstruct the axis mapping when
new_axis_mask is set(TODO)

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-25 09:24:41 +08:00
Chen Feiyue d823ef6fcb
Remove unused value in op layoutinfer (#607)
Remove unused value to make sure project build successfully with higher
version compiler such as clang15

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-25 09:23:09 +08:00
chxin66 bd89faddb1
Fixed openssl android build bug (#606)
android cross build fail when not specified LOCAL_OPENSSL

Type: bug fixed

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-19 21:57:46 +08:00
chxin66 26b4e53fe7
fixed reduce layoutinfer bug (#605)
Type: Bug fixed

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-19 21:56:08 +08:00
Chen Feiyue fbfbdd7c83
Added axis support for layernorm (#602)
Layernormolization can handle non zero axis now
Added case to verify layernorm with axis 2
Modify layernorm opjson

Type:  Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-15 21:45:46 +08:00
chxin66 a64a0f7379
Added a case for resize_bilinear layoutinfer (#595)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-02 07:52:06 +08:00
Chen Feiyue aa7b3a6f8f
Added api json for each op to support acuity (#596)
Record constructor form of each operation as a json file to support acuity to call
timvx op

Type:  Documentation

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-02 07:51:10 +08:00
chxin66 ea8adc456a
fixed instance norm bug & add its layoutinfer (#593)
Type: Bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-05-31 12:55:42 +08:00
chxin66 4f92e58155
optimization for tiny_yolov4 (#591)
Type: code improvment

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-05-23 14:28:47 +08:00
shijie001 51faf286c2
Fixed LayerNormalization eps bug (#589) 2023-05-22 14:13:44 +08:00
chxin66 99606fd891
Add a case for local response norm (#590)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-05-20 16:57:21 +08:00
chxin66 ddcb00c11b
Fixed bug for pad test (#588)
Type: Bug fix

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-05-20 16:56:00 +08:00
Chen Feiyue 3f83db534d
Added missed ops include header (#584)
Include cumsum, mod, maxpool3d, grucell, UnidirectionalSequanceGRU header file in
ops.h

Type:  Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-05-16 15:59:16 +08:00
Chen Feiyue 3c372dd646
Refine UnidirectionalGRU and GRUCell (#587)
Refine unidirectional_gru and gru_cell code to avoid including ovxlib files
in header of some op
Introduce TranslateToVsibool function to support above code refinement

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-05-15 16:44:51 +08:00
Chen Feiyue b81f7979fa
Reload "==" operator for quantizations of two tensor (#583)
Reload operator "==" to check two quantization same or not

Type: New Feature
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-05-10 17:58:30 +09:00
chxin66 308a967bcf
Support build openssl from local repo (#582)
Type: Code Improvement

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-05-05 23:38:39 +08:00
SCUWQ 1543efe098
Add some tensor dtype convert APIs (#576)
For pre/post process.

Type: Code Refine

Co-authored-by: wangqian <wangqian@CNCDD9444.verisilicon.com>
2023-04-27 09:04:39 +08:00
chxin66 c688ca6e81
Fixed ovxlib build error on android (#578)
Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-04-26 16:59:52 +08:00
Chen Feiyue 3c5ee7a46e
Refine prelu layout inference (#577)
In the past we reverse all inputs to default order pv and caused
unnecessary transpose operation.
In this commit  only const slope  will be handled and do transpose if necessary.

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-04-25 11:25:55 +08:00
Chen Feiyue 3a3c9fa5fa
Update readme in ops (#575)
Added missing ops which have already supported; Changed status of some
ops.

Type: Documentation

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-04-20 13:23:56 +08:00
chxin66 f1fd2246ae
Support tensor cache while create tensor (#574)
Support tensor cache while create tensor

Tensor can be shared between different operations, if tensor have 
identical data and quantization parameter, they should share same
low-level tensor object to save memory.

In tim-vx, introduce a tensor cache which key is md5sum and value is 
low-level tensor object. If up-coming tensor have same md5sum, the
cached tensor object reused for tensor creation.

Type: New feature

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-04-19 21:31:25 +08:00
Chen Feiyue 6e38e64a1a
Update internal & prebuilt-sdk for 23Q1 release (#573)
Update internal to 0e9393dbb4f653b9dfceaeaaa920d4deb8b27077
Update prebuilt-sdk to 6.4.14 release
Update cmakefiles to support above updates

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-04-18 22:19:16 +08:00
chxin66 a32f255d7f
Modified error to warning when check consumed tensor (#572)
If graph has free INPUT or OUTPUT, modified error to
warning when check in graph compile

Type: Code refine

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-04-10 13:28:51 +08:00
liyuenan 27890719b6
Support remote platform by gRPC (#561)
* Support remote platform by gRPC

Signed-off-by: yuenan.li <yuenan.li@verisilicon.com>
2023-03-28 09:51:23 +08:00
chxin66 c2755b90ea
Fixed l2normalization layout infer bug (#570)
And added a case

Type: Bug fix

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-03-27 15:00:25 +08:00