Commit Graph

456 Commits

Author SHA1 Message Date
Chen Feiyue af50cc5e3f
Added general Float16 support (#631)
Added Float16 type definition from third-party
Refine float16 bias handlling in conv2d
Refine float16 case in conv2d
Caution: Headers of float16 only be included when build unit_test

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-08-12 10:04:16 +08:00
Chen Feiyue 35e50d7692
Added op fusion for mean_stddev_normalization (#629)
Added op fusion for mean_stddev_normalization ops such as layernorm and
instance norm

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-08-09 22:10:45 +08:00
chxin66 bff26a32c4
fix size compute bug in lrn (#626)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-08-07 13:20:35 +08:00
chxin66 6a5694e557
fixed prelu layoutinfer bug & added cases (#628)
Type: bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-08-07 13:17:46 +08:00
zhongzhuonan f0cf45fdaa
Create self-hosted.yml (#625)
* Create self-hosted.yml

* Update self-hosted.yml
2023-07-26 13:31:07 +08:00
Sven 821864a582
Fixed IExecutable object not bind with DeviceID (#624)
If Executable object doesn't bind with a concrete DeviceID,
it will go first device by default.

When run multi executable with multi device, the behavior is not
expected. Fixed by attach device id with CompileOption.

Signed-off-by: xiang.zhang <xiang.zhang@verisilicon.com>
2023-07-24 22:45:54 +08:00
chxin66 680e8d59cb
Fixed conv2d grouped_conv2d deconv2d layoutinfer bug (#622)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-07-24 17:10:24 +08:00
MercuryChen 315adcf076
Integrate api trace into tim-vx source as an experimental feature. (#623)
* Add support for different input dtype of MaxPoolGrad.

Type: Code improvement

* Integrate api trace into tim-vx source code, as part of experimeantal.

Type: New Feature
2023-07-19 18:40:48 +08:00
Chen Feiyue 0885a0d797
Remove confusing comment in depthwise conv test (#621)
Remove wrong layout comment for depthwise conv unit test
Add comment of layout condition in basic class for depthwise conv

Type: Code Improvement
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-17 09:43:34 +08:00
Chen Feiyue 62c6b6560c
Added axis param for TopK (#610)
Topk support specifying dimensions with later internal ovxlib

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-12 09:54:07 +08:00
Chen Feiyue 18749f5d05
Fixed transient deconv1d generate wrong output shape bug (#619)
Fixed bug that when deconv1d ouput is set to be transient, actual output shape will be zero at dim 1.

Reason :internal typing error
Type: Bug Fix
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-08 23:40:32 +08:00
chxin66 ea8046ec9c
Added roi_align layoutinfer & cases (#615)
* Added roi_align layoutinfer & cases

Type: New feature

Signed-off-by: Chen <jack.chen@verisilicon.com>

* Update instancenorm op spec .json

Type: bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>

* Added roi_pool layoutinfer & fixed case bug

Type: new feature

Signed-off-by: Chen <jack.chen@verisilicon.com>

---------

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-07-08 23:39:56 +08:00
Chen Feiyue 32c5a61601
Update prebuilt && internal for 23Q2 release (#617)
* Update prebuilt-sdk to 6.4.15 release

Type: Code Improvement
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>

* Update internal to 1.1.84 rel

Update internal to SHA 1e591108dddcbf6dd88d5eef97a7d8b3ffc19ce3

Type: Code Improvement
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>

---------

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-07-08 23:38:17 +08:00
chxin66 02d6d72946
fixed yolov4 build issue (#618)
Type: Bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-07-06 09:30:24 +08:00
chxin66 5d741e8ebe
Optimize compilation process for openssl (#613)
do not rebuild when openssl lib already exist

Type: Code refine

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-07-03 15:48:19 +08:00
Chen Feiyue 33f3a4f176
Enable float16 bias convolution model runs on NN (#612)
Convert float16 bias tensor to float32 to meet condition of NN
convolution in driver

Caution: Clang version requires minimum 15.0

Type: Code Improvement
Issue: bugzilla id:32785 | jira id VIVD-744

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-30 09:41:28 +08:00
chxin66 34812fe40e
Added case for gather (#599)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-26 09:15:08 +08:00
chxin66 233eb439e1
Fixed viplite driver build issue (#611)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-26 09:14:42 +08:00
Chen Feiyue 75882d4195
Added new_axis_mask param for stridedslice (#600)
Add another constructor for stridedslice when new_axis_mask is set

The layout inference need to reconstruct the axis mapping when
new_axis_mask is set(TODO)

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-25 09:24:41 +08:00
Chen Feiyue d823ef6fcb
Remove unused value in op layoutinfer (#607)
Remove unused value to make sure project build successfully with higher
version compiler such as clang15

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-25 09:23:09 +08:00
chxin66 bd89faddb1
Fixed openssl android build bug (#606)
android cross build fail when not specified LOCAL_OPENSSL

Type: bug fixed

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-19 21:57:46 +08:00
chxin66 26b4e53fe7
fixed reduce layoutinfer bug (#605)
Type: Bug fixed

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-19 21:56:08 +08:00
Chen Feiyue fbfbdd7c83
Added axis support for layernorm (#602)
Layernormolization can handle non zero axis now
Added case to verify layernorm with axis 2
Modify layernorm opjson

Type:  Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-15 21:45:46 +08:00
chxin66 a64a0f7379
Added a case for resize_bilinear layoutinfer (#595)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-06-02 07:52:06 +08:00
Chen Feiyue aa7b3a6f8f
Added api json for each op to support acuity (#596)
Record constructor form of each operation as a json file to support acuity to call
timvx op

Type:  Documentation

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-06-02 07:51:10 +08:00
chxin66 ea8adc456a
fixed instance norm bug & add its layoutinfer (#593)
Type: Bug fix

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-05-31 12:55:42 +08:00
chxin66 4f92e58155
optimization for tiny_yolov4 (#591)
Type: code improvment

Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-05-23 14:28:47 +08:00
shijie001 51faf286c2
Fixed LayerNormalization eps bug (#589) 2023-05-22 14:13:44 +08:00
chxin66 99606fd891
Add a case for local response norm (#590)
Signed-off-by: Chen <jack.chen@verisilicon.com>
Co-authored-by: Chen <jack.chen@verisilicon.com>
2023-05-20 16:57:21 +08:00
chxin66 ddcb00c11b
Fixed bug for pad test (#588)
Type: Bug fix

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-05-20 16:56:00 +08:00
Chen Feiyue 3f83db534d
Added missed ops include header (#584)
Include cumsum, mod, maxpool3d, grucell, UnidirectionalSequanceGRU header file in
ops.h

Type:  Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-05-16 15:59:16 +08:00
Chen Feiyue 3c372dd646
Refine UnidirectionalGRU and GRUCell (#587)
Refine unidirectional_gru and gru_cell code to avoid including ovxlib files
in header of some op
Introduce TranslateToVsibool function to support above code refinement

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-05-15 16:44:51 +08:00
Chen Feiyue b81f7979fa
Reload "==" operator for quantizations of two tensor (#583)
Reload operator "==" to check two quantization same or not

Type: New Feature
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-05-10 17:58:30 +09:00
chxin66 308a967bcf
Support build openssl from local repo (#582)
Type: Code Improvement

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-05-05 23:38:39 +08:00
SCUWQ 1543efe098
Add some tensor dtype convert APIs (#576)
For pre/post process.

Type: Code Refine

Co-authored-by: wangqian <wangqian@CNCDD9444.verisilicon.com>
2023-04-27 09:04:39 +08:00
chxin66 c688ca6e81
Fixed ovxlib build error on android (#578)
Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-04-26 16:59:52 +08:00
Chen Feiyue 3c5ee7a46e
Refine prelu layout inference (#577)
In the past we reverse all inputs to default order pv and caused
unnecessary transpose operation.
In this commit  only const slope  will be handled and do transpose if necessary.

Type: Code Improvement

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-04-25 11:25:55 +08:00
Chen Feiyue 3a3c9fa5fa
Update readme in ops (#575)
Added missing ops which have already supported; Changed status of some
ops.

Type: Documentation

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-04-20 13:23:56 +08:00
chxin66 f1fd2246ae
Support tensor cache while create tensor (#574)
Support tensor cache while create tensor

Tensor can be shared between different operations, if tensor have 
identical data and quantization parameter, they should share same
low-level tensor object to save memory.

In tim-vx, introduce a tensor cache which key is md5sum and value is 
low-level tensor object. If up-coming tensor have same md5sum, the
cached tensor object reused for tensor creation.

Type: New feature

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-04-19 21:31:25 +08:00
Chen Feiyue 6e38e64a1a
Update internal & prebuilt-sdk for 23Q1 release (#573)
Update internal to 0e9393dbb4f653b9dfceaeaaa920d4deb8b27077
Update prebuilt-sdk to 6.4.14 release
Update cmakefiles to support above updates

Type: New Feature

Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-04-18 22:19:16 +08:00
chxin66 a32f255d7f
Modified error to warning when check consumed tensor (#572)
If graph has free INPUT or OUTPUT, modified error to
warning when check in graph compile

Type: Code refine

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-04-10 13:28:51 +08:00
liyuenan 27890719b6
Support remote platform by gRPC (#561)
* Support remote platform by gRPC

Signed-off-by: yuenan.li <yuenan.li@verisilicon.com>
2023-03-28 09:51:23 +08:00
chxin66 c2755b90ea
Fixed l2normalization layout infer bug (#570)
And added a case

Type: Bug fix

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-03-27 15:00:25 +08:00
Zhouheng Zheng e49f67b840
Remove tensor GetDataRef api (#569)
Co-authored-by: zhouheng.zheng <zhouheng.zheng@ouotlook.com>
2023-03-23 21:35:30 +08:00
chxin66 6424ef104e
Fixed the IOtensor order difference between src_graph and infer_graph
* Fixed the IOtensor order difference between src_graph and infer_graph

Graph Input/Output tensor sequence may changed after graph
transformation(layout infer), it is difficult to get the IO mapping
between original graph and final graph.

Clients such as the Android Support Library create tensors using the
original input/output order, which may not be the same as the input
order of src_graph, the data can not be setup correctly.

Solution:
Decide the order of inputs/outputs while creating tensor not at binding to
operation. The order of binding could be change in each transform.

Type:Code improvement

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>

* Fixed maxpoolgrad maxpoolwithargmax2 cases

Some tensors created with wrong attr

Type: Bug fix

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>

---------

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
Co-authored-by: Chen Xin <jack.chen@verisilicon.com>
2023-03-21 09:21:15 +08:00
zhouheng.zheng 958b26e499 Fix mirror pad param mismatch 2023-03-21 09:20:00 +08:00
Chen Xin f0a0f1728a Added case for hardswish
Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
2023-03-06 09:37:13 +08:00
Chen Xin e71d537042 Fixed deconv2d layout infer bug
Type: Bug fix

Signed-off-by: Chen Xin <jack.chen@verisilicon.com>
2023-02-20 13:06:24 +08:00
ZhangXiang 1c6041c394 Introduce CMAKE option TIM_VX_DBG_DISABLE_TENSOR_HNDL=OFF
Enable/Disable tensorFromHandle usage

Signed-off-by: ZhangXiang <Xiang.Zhang@verisilicon.com>
2023-02-09 14:31:32 +08:00
Feiyue Chen ea4ba66b94 Refine Depth2Space op
Change default mode from CRD_mode to DCR_mode

Type: Code Improvement
Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>
2023-02-09 13:19:57 +08:00