TIM-VX

Commit Graph

Author	SHA1	Message	Date
Chen Feiyue	af50cc5e3f	Added general Float16 support (#631 ) Added Float16 type definition from third-party Refine float16 bias handlling in conv2d Refine float16 case in conv2d Caution: Headers of float16 only be included when build unit_test Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-08-12 10:04:16 +08:00
Chen Feiyue	35e50d7692	Added op fusion for mean_stddev_normalization (#629 ) Added op fusion for mean_stddev_normalization ops such as layernorm and instance norm Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-08-09 22:10:45 +08:00
chxin66	bff26a32c4	fix size compute bug in lrn (#626 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-08-07 13:20:35 +08:00
chxin66	6a5694e557	fixed prelu layoutinfer bug & added cases (#628 ) Type: bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-08-07 13:17:46 +08:00
zhongzhuonan	f0cf45fdaa	Create self-hosted.yml (#625 ) * Create self-hosted.yml * Update self-hosted.yml	2023-07-26 13:31:07 +08:00
Sven	821864a582	Fixed IExecutable object not bind with DeviceID (#624 ) If Executable object doesn't bind with a concrete DeviceID, it will go first device by default. When run multi executable with multi device, the behavior is not expected. Fixed by attach device id with CompileOption. Signed-off-by: xiang.zhang <xiang.zhang@verisilicon.com>	2023-07-24 22:45:54 +08:00
chxin66	680e8d59cb	Fixed conv2d grouped_conv2d deconv2d layoutinfer bug (#622 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-07-24 17:10:24 +08:00
MercuryChen	315adcf076	Integrate api trace into tim-vx source as an experimental feature. (#623 ) * Add support for different input dtype of MaxPoolGrad. Type: Code improvement * Integrate api trace into tim-vx source code, as part of experimeantal. Type: New Feature	2023-07-19 18:40:48 +08:00
Chen Feiyue	0885a0d797	Remove confusing comment in depthwise conv test (#621 ) Remove wrong layout comment for depthwise conv unit test Add comment of layout condition in basic class for depthwise conv Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-07-17 09:43:34 +08:00
Chen Feiyue	62c6b6560c	Added axis param for TopK (#610 ) Topk support specifying dimensions with later internal ovxlib Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-07-12 09:54:07 +08:00
Chen Feiyue	18749f5d05	Fixed transient deconv1d generate wrong output shape bug (#619 ) Fixed bug that when deconv1d ouput is set to be transient, actual output shape will be zero at dim 1. Reason :internal typing error Type: Bug Fix Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-07-08 23:40:32 +08:00
chxin66	ea8046ec9c	Added roi_align layoutinfer & cases (#615 ) * Added roi_align layoutinfer & cases Type: New feature Signed-off-by: Chen <jack.chen@verisilicon.com> * Update instancenorm op spec .json Type: bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> * Added roi_pool layoutinfer & fixed case bug Type: new feature Signed-off-by: Chen <jack.chen@verisilicon.com> --------- Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-07-08 23:39:56 +08:00
Chen Feiyue	32c5a61601	Update prebuilt && internal for 23Q2 release (#617 ) * Update prebuilt-sdk to 6.4.15 release Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com> * Update internal to 1.1.84 rel Update internal to SHA 1e591108dddcbf6dd88d5eef97a7d8b3ffc19ce3 Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com> --------- Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-07-08 23:38:17 +08:00
chxin66	02d6d72946	fixed yolov4 build issue (#618 ) Type: Bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-07-06 09:30:24 +08:00
chxin66	5d741e8ebe	Optimize compilation process for openssl (#613 ) do not rebuild when openssl lib already exist Type: Code refine Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-07-03 15:48:19 +08:00
Chen Feiyue	33f3a4f176	Enable float16 bias convolution model runs on NN (#612 ) Convert float16 bias tensor to float32 to meet condition of NN convolution in driver Caution: Clang version requires minimum 15.0 Type: Code Improvement Issue: bugzilla id:32785 \| jira id VIVD-744 Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-06-30 09:41:28 +08:00
chxin66	34812fe40e	Added case for gather (#599 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-06-26 09:15:08 +08:00
chxin66	233eb439e1	Fixed viplite driver build issue (#611 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-06-26 09:14:42 +08:00
Chen Feiyue	75882d4195	Added new_axis_mask param for stridedslice (#600 ) Add another constructor for stridedslice when new_axis_mask is set The layout inference need to reconstruct the axis mapping when new_axis_mask is set(TODO) Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-06-25 09:24:41 +08:00
Chen Feiyue	d823ef6fcb	Remove unused value in op layoutinfer (#607 ) Remove unused value to make sure project build successfully with higher version compiler such as clang15 Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-06-25 09:23:09 +08:00
chxin66	bd89faddb1	Fixed openssl android build bug (#606 ) android cross build fail when not specified LOCAL_OPENSSL Type: bug fixed Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-06-19 21:57:46 +08:00
chxin66	26b4e53fe7	fixed reduce layoutinfer bug (#605 ) Type: Bug fixed Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-06-19 21:56:08 +08:00
Chen Feiyue	fbfbdd7c83	Added axis support for layernorm (#602 ) Layernormolization can handle non zero axis now Added case to verify layernorm with axis 2 Modify layernorm opjson Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-06-15 21:45:46 +08:00
chxin66	a64a0f7379	Added a case for resize_bilinear layoutinfer (#595 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-06-02 07:52:06 +08:00
Chen Feiyue	aa7b3a6f8f	Added api json for each op to support acuity (#596 ) Record constructor form of each operation as a json file to support acuity to call timvx op Type: Documentation Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-06-02 07:51:10 +08:00
chxin66	ea8adc456a	fixed instance norm bug & add its layoutinfer (#593 ) Type: Bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-05-31 12:55:42 +08:00
chxin66	4f92e58155	optimization for tiny_yolov4 (#591 ) Type: code improvment Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-05-23 14:28:47 +08:00
shijie001	51faf286c2	Fixed LayerNormalization eps bug (#589 )	2023-05-22 14:13:44 +08:00
chxin66	99606fd891	Add a case for local response norm (#590 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-05-20 16:57:21 +08:00
chxin66	ddcb00c11b	Fixed bug for pad test (#588 ) Type: Bug fix Signed-off-by: Chen Xin <jack.chen@verisilicon.com> Co-authored-by: Chen Xin <jack.chen@verisilicon.com>	2023-05-20 16:56:00 +08:00
Chen Feiyue	3f83db534d	Added missed ops include header (#584 ) Include cumsum, mod, maxpool3d, grucell, UnidirectionalSequanceGRU header file in ops.h Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-05-16 15:59:16 +08:00
Chen Feiyue	3c372dd646	Refine UnidirectionalGRU and GRUCell (#587 ) Refine unidirectional_gru and gru_cell code to avoid including ovxlib files in header of some op Introduce TranslateToVsibool function to support above code refinement Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-05-15 16:44:51 +08:00
Chen Feiyue	b81f7979fa	Reload "==" operator for quantizations of two tensor (#583 ) Reload operator "==" to check two quantization same or not Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-05-10 17:58:30 +09:00
chxin66	308a967bcf	Support build openssl from local repo (#582 ) Type: Code Improvement Signed-off-by: Chen Xin <jack.chen@verisilicon.com> Co-authored-by: Chen Xin <jack.chen@verisilicon.com>	2023-05-05 23:38:39 +08:00
SCUWQ	1543efe098	Add some tensor dtype convert APIs (#576 ) For pre/post process. Type: Code Refine Co-authored-by: wangqian <wangqian@CNCDD9444.verisilicon.com>	2023-04-27 09:04:39 +08:00
chxin66	c688ca6e81	Fixed ovxlib build error on android (#578 ) Signed-off-by: Chen Xin <jack.chen@verisilicon.com> Co-authored-by: Chen Xin <jack.chen@verisilicon.com>	2023-04-26 16:59:52 +08:00
Chen Feiyue	3c5ee7a46e	Refine prelu layout inference (#577 ) In the past we reverse all inputs to default order pv and caused unnecessary transpose operation. In this commit only const slope will be handled and do transpose if necessary. Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-04-25 11:25:55 +08:00
Chen Feiyue	3a3c9fa5fa	Update readme in ops (#575 ) Added missing ops which have already supported; Changed status of some ops. Type: Documentation Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-04-20 13:23:56 +08:00
chxin66	f1fd2246ae	Support tensor cache while create tensor (#574 ) Support tensor cache while create tensor Tensor can be shared between different operations, if tensor have identical data and quantization parameter, they should share same low-level tensor object to save memory. In tim-vx, introduce a tensor cache which key is md5sum and value is low-level tensor object. If up-coming tensor have same md5sum, the cached tensor object reused for tensor creation. Type: New feature Signed-off-by: Chen Xin <jack.chen@verisilicon.com> Co-authored-by: Chen Xin <jack.chen@verisilicon.com>	2023-04-19 21:31:25 +08:00
Chen Feiyue	6e38e64a1a	Update internal & prebuilt-sdk for 23Q1 release (#573 ) Update internal to 0e9393dbb4f653b9dfceaeaaa920d4deb8b27077 Update prebuilt-sdk to 6.4.14 release Update cmakefiles to support above updates Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-04-18 22:19:16 +08:00
chxin66	a32f255d7f	Modified error to warning when check consumed tensor (#572 ) If graph has free INPUT or OUTPUT, modified error to warning when check in graph compile Type: Code refine Signed-off-by: Chen Xin <jack.chen@verisilicon.com> Co-authored-by: Chen Xin <jack.chen@verisilicon.com>	2023-04-10 13:28:51 +08:00
liyuenan	27890719b6	Support remote platform by gRPC (#561 ) * Support remote platform by gRPC Signed-off-by: yuenan.li <yuenan.li@verisilicon.com>	2023-03-28 09:51:23 +08:00
chxin66	c2755b90ea	Fixed l2normalization layout infer bug (#570 ) And added a case Type: Bug fix Signed-off-by: Chen Xin <jack.chen@verisilicon.com> Co-authored-by: Chen Xin <jack.chen@verisilicon.com>	2023-03-27 15:00:25 +08:00
Zhouheng Zheng	e49f67b840	Remove tensor GetDataRef api (#569 ) Co-authored-by: zhouheng.zheng <zhouheng.zheng@ouotlook.com>	2023-03-23 21:35:30 +08:00
chxin66	6424ef104e	Fixed the IOtensor order difference between src_graph and infer_graph * Fixed the IOtensor order difference between src_graph and infer_graph Graph Input/Output tensor sequence may changed after graph transformation(layout infer), it is difficult to get the IO mapping between original graph and final graph. Clients such as the Android Support Library create tensors using the original input/output order, which may not be the same as the input order of src_graph, the data can not be setup correctly. Solution: Decide the order of inputs/outputs while creating tensor not at binding to operation. The order of binding could be change in each transform. Type:Code improvement Signed-off-by: Chen Xin <jack.chen@verisilicon.com> * Fixed maxpoolgrad maxpoolwithargmax2 cases Some tensors created with wrong attr Type: Bug fix Signed-off-by: Chen Xin <jack.chen@verisilicon.com> --------- Signed-off-by: Chen Xin <jack.chen@verisilicon.com> Co-authored-by: Chen Xin <jack.chen@verisilicon.com>	2023-03-21 09:21:15 +08:00
zhouheng.zheng	958b26e499	Fix mirror pad param mismatch	2023-03-21 09:20:00 +08:00
Chen Xin	f0a0f1728a	Added case for hardswish Signed-off-by: Chen Xin <jack.chen@verisilicon.com>	2023-03-06 09:37:13 +08:00
Chen Xin	e71d537042	Fixed deconv2d layout infer bug Type: Bug fix Signed-off-by: Chen Xin <jack.chen@verisilicon.com>	2023-02-20 13:06:24 +08:00
ZhangXiang	1c6041c394	Introduce CMAKE option TIM_VX_DBG_DISABLE_TENSOR_HNDL=OFF Enable/Disable tensorFromHandle usage Signed-off-by: ZhangXiang <Xiang.Zhang@verisilicon.com>	2023-02-09 14:31:32 +08:00
Feiyue Chen	ea4ba66b94	Refine Depth2Space op Change default mode from CRD_mode to DCR_mode Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-02-09 13:19:57 +08:00

1 2 3 4 5 ...

456 Commits All Branches Search

456 Commits

All Branches