TIM-VX

Commit Graph

Author	SHA1	Message	Date
xie-oritek	7fc264a9e6	Refine Tensor::SetShape api to avoid compile warning using const ref (#640 ) * Move int4/uint4 to the end of DataType * Refine api Tensor::SetShape, using const ref avoid compile warning	2023-08-25 00:47:24 +08:00
MercuryChen	6f34b66ae4	Split replayer code from tracer.h (#642 ) * Add support for different input dtype of MaxPoolGrad. Type: Code improvement * Integrate api trace into tim-vx source code, as part of experimeantal. Type: New Feature * Refine api trace code and document Add missing traced apis of tim::vx::Quantization Type: Code improvement * Split Api relayer code out of tracer. To enable compile replayer code in machine which can't access high version boost libs. Type: Code improvement	2023-08-25 00:41:45 +08:00
xie-oritek	265e74ff16	Add int4/uint4 definition (#638 )	2023-08-22 17:37:38 +08:00
xie-oritek	54af5c2216	Add CumSum&LRN operator to trace module (#639 )	2023-08-22 16:53:59 +08:00
xie-oritek	bab571b569	Fix data missing when use trace::Graph::CreateTensor (#636 )	2023-08-22 16:53:24 +08:00
Chen Feiyue	9bb3e7c68b	Fixed misleading test case bug in deconv1d (#633 ) Correct erros of deconv1d unittest Added hint in the header indicating that padtype is not supported yet Added 2 cases for deconv1d Type: Code Improvement Issue: github issue #585 Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-08-17 21:26:54 +08:00
MercuryChen	cf2efc63fd	Refine api trace code and document (#634 ) * Add support for different input dtype of MaxPoolGrad. Type: Code improvement * Integrate api trace into tim-vx source code, as part of experimeantal. Type: New Feature * Refine api trace code and document Add missing traced apis of tim::vx::Quantization Type: Code improvement	2023-08-17 21:16:34 +08:00
Chen Feiyue	2f018cc088	Code refinement for mean-stddev-normalization fuse (#632 ) 1.Added copyright && Added reference or const reference for functions 2.Rewrite function of determing whether there is a common input 3.Use std::remove_if instead of std::find before doing erase 4.Added security check to prevent access to deleted ops Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-08-15 13:15:03 +08:00
Chen Feiyue	af50cc5e3f	Added general Float16 support (#631 ) Added Float16 type definition from third-party Refine float16 bias handlling in conv2d Refine float16 case in conv2d Caution: Headers of float16 only be included when build unit_test Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-08-12 10:04:16 +08:00
Chen Feiyue	35e50d7692	Added op fusion for mean_stddev_normalization (#629 ) Added op fusion for mean_stddev_normalization ops such as layernorm and instance norm Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-08-09 22:10:45 +08:00
chxin66	bff26a32c4	fix size compute bug in lrn (#626 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-08-07 13:20:35 +08:00
chxin66	6a5694e557	fixed prelu layoutinfer bug & added cases (#628 ) Type: bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-08-07 13:17:46 +08:00
zhongzhuonan	f0cf45fdaa	Create self-hosted.yml (#625 ) * Create self-hosted.yml * Update self-hosted.yml	2023-07-26 13:31:07 +08:00
Sven	821864a582	Fixed IExecutable object not bind with DeviceID (#624 ) If Executable object doesn't bind with a concrete DeviceID, it will go first device by default. When run multi executable with multi device, the behavior is not expected. Fixed by attach device id with CompileOption. Signed-off-by: xiang.zhang <xiang.zhang@verisilicon.com>	2023-07-24 22:45:54 +08:00
chxin66	680e8d59cb	Fixed conv2d grouped_conv2d deconv2d layoutinfer bug (#622 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-07-24 17:10:24 +08:00
MercuryChen	315adcf076	Integrate api trace into tim-vx source as an experimental feature. (#623 ) * Add support for different input dtype of MaxPoolGrad. Type: Code improvement * Integrate api trace into tim-vx source code, as part of experimeantal. Type: New Feature	2023-07-19 18:40:48 +08:00
Chen Feiyue	0885a0d797	Remove confusing comment in depthwise conv test (#621 ) Remove wrong layout comment for depthwise conv unit test Add comment of layout condition in basic class for depthwise conv Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-07-17 09:43:34 +08:00
Chen Feiyue	62c6b6560c	Added axis param for TopK (#610 ) Topk support specifying dimensions with later internal ovxlib Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-07-12 09:54:07 +08:00
Chen Feiyue	18749f5d05	Fixed transient deconv1d generate wrong output shape bug (#619 ) Fixed bug that when deconv1d ouput is set to be transient, actual output shape will be zero at dim 1. Reason :internal typing error Type: Bug Fix Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-07-08 23:40:32 +08:00
chxin66	ea8046ec9c	Added roi_align layoutinfer & cases (#615 ) * Added roi_align layoutinfer & cases Type: New feature Signed-off-by: Chen <jack.chen@verisilicon.com> * Update instancenorm op spec .json Type: bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> * Added roi_pool layoutinfer & fixed case bug Type: new feature Signed-off-by: Chen <jack.chen@verisilicon.com> --------- Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-07-08 23:39:56 +08:00
Chen Feiyue	32c5a61601	Update prebuilt && internal for 23Q2 release (#617 ) * Update prebuilt-sdk to 6.4.15 release Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com> * Update internal to 1.1.84 rel Update internal to SHA 1e591108dddcbf6dd88d5eef97a7d8b3ffc19ce3 Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com> --------- Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-07-08 23:38:17 +08:00
chxin66	02d6d72946	fixed yolov4 build issue (#618 ) Type: Bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-07-06 09:30:24 +08:00
chxin66	5d741e8ebe	Optimize compilation process for openssl (#613 ) do not rebuild when openssl lib already exist Type: Code refine Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-07-03 15:48:19 +08:00
Chen Feiyue	33f3a4f176	Enable float16 bias convolution model runs on NN (#612 ) Convert float16 bias tensor to float32 to meet condition of NN convolution in driver Caution: Clang version requires minimum 15.0 Type: Code Improvement Issue: bugzilla id:32785 \| jira id VIVD-744 Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-06-30 09:41:28 +08:00
chxin66	34812fe40e	Added case for gather (#599 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-06-26 09:15:08 +08:00
chxin66	233eb439e1	Fixed viplite driver build issue (#611 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-06-26 09:14:42 +08:00
Chen Feiyue	75882d4195	Added new_axis_mask param for stridedslice (#600 ) Add another constructor for stridedslice when new_axis_mask is set The layout inference need to reconstruct the axis mapping when new_axis_mask is set(TODO) Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-06-25 09:24:41 +08:00
Chen Feiyue	d823ef6fcb	Remove unused value in op layoutinfer (#607 ) Remove unused value to make sure project build successfully with higher version compiler such as clang15 Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-06-25 09:23:09 +08:00
chxin66	bd89faddb1	Fixed openssl android build bug (#606 ) android cross build fail when not specified LOCAL_OPENSSL Type: bug fixed Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-06-19 21:57:46 +08:00
chxin66	26b4e53fe7	fixed reduce layoutinfer bug (#605 ) Type: Bug fixed Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-06-19 21:56:08 +08:00
Chen Feiyue	fbfbdd7c83	Added axis support for layernorm (#602 ) Layernormolization can handle non zero axis now Added case to verify layernorm with axis 2 Modify layernorm opjson Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-06-15 21:45:46 +08:00
chxin66	a64a0f7379	Added a case for resize_bilinear layoutinfer (#595 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-06-02 07:52:06 +08:00
Chen Feiyue	aa7b3a6f8f	Added api json for each op to support acuity (#596 ) Record constructor form of each operation as a json file to support acuity to call timvx op Type: Documentation Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-06-02 07:51:10 +08:00
chxin66	ea8adc456a	fixed instance norm bug & add its layoutinfer (#593 ) Type: Bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-05-31 12:55:42 +08:00
chxin66	4f92e58155	optimization for tiny_yolov4 (#591 ) Type: code improvment Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-05-23 14:28:47 +08:00
shijie001	51faf286c2	Fixed LayerNormalization eps bug (#589 )	2023-05-22 14:13:44 +08:00
chxin66	99606fd891	Add a case for local response norm (#590 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-05-20 16:57:21 +08:00
chxin66	ddcb00c11b	Fixed bug for pad test (#588 ) Type: Bug fix Signed-off-by: Chen Xin <jack.chen@verisilicon.com> Co-authored-by: Chen Xin <jack.chen@verisilicon.com>	2023-05-20 16:56:00 +08:00
Chen Feiyue	3f83db534d	Added missed ops include header (#584 ) Include cumsum, mod, maxpool3d, grucell, UnidirectionalSequanceGRU header file in ops.h Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-05-16 15:59:16 +08:00
Chen Feiyue	3c372dd646	Refine UnidirectionalGRU and GRUCell (#587 ) Refine unidirectional_gru and gru_cell code to avoid including ovxlib files in header of some op Introduce TranslateToVsibool function to support above code refinement Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-05-15 16:44:51 +08:00
Chen Feiyue	b81f7979fa	Reload "==" operator for quantizations of two tensor (#583 ) Reload operator "==" to check two quantization same or not Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-05-10 17:58:30 +09:00
chxin66	308a967bcf	Support build openssl from local repo (#582 ) Type: Code Improvement Signed-off-by: Chen Xin <jack.chen@verisilicon.com> Co-authored-by: Chen Xin <jack.chen@verisilicon.com>	2023-05-05 23:38:39 +08:00
SCUWQ	1543efe098	Add some tensor dtype convert APIs (#576 ) For pre/post process. Type: Code Refine Co-authored-by: wangqian <wangqian@CNCDD9444.verisilicon.com>	2023-04-27 09:04:39 +08:00
chxin66	c688ca6e81	Fixed ovxlib build error on android (#578 ) Signed-off-by: Chen Xin <jack.chen@verisilicon.com> Co-authored-by: Chen Xin <jack.chen@verisilicon.com>	2023-04-26 16:59:52 +08:00
Chen Feiyue	3c5ee7a46e	Refine prelu layout inference (#577 ) In the past we reverse all inputs to default order pv and caused unnecessary transpose operation. In this commit only const slope will be handled and do transpose if necessary. Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-04-25 11:25:55 +08:00
Chen Feiyue	3a3c9fa5fa	Update readme in ops (#575 ) Added missing ops which have already supported; Changed status of some ops. Type: Documentation Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-04-20 13:23:56 +08:00
chxin66	f1fd2246ae	Support tensor cache while create tensor (#574 ) Support tensor cache while create tensor Tensor can be shared between different operations, if tensor have identical data and quantization parameter, they should share same low-level tensor object to save memory. In tim-vx, introduce a tensor cache which key is md5sum and value is low-level tensor object. If up-coming tensor have same md5sum, the cached tensor object reused for tensor creation. Type: New feature Signed-off-by: Chen Xin <jack.chen@verisilicon.com> Co-authored-by: Chen Xin <jack.chen@verisilicon.com>	2023-04-19 21:31:25 +08:00
Chen Feiyue	6e38e64a1a	Update internal & prebuilt-sdk for 23Q1 release (#573 ) Update internal to 0e9393dbb4f653b9dfceaeaaa920d4deb8b27077 Update prebuilt-sdk to 6.4.14 release Update cmakefiles to support above updates Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-04-18 22:19:16 +08:00
chxin66	a32f255d7f	Modified error to warning when check consumed tensor (#572 ) If graph has free INPUT or OUTPUT, modified error to warning when check in graph compile Type: Code refine Signed-off-by: Chen Xin <jack.chen@verisilicon.com> Co-authored-by: Chen Xin <jack.chen@verisilicon.com>	2023-04-10 13:28:51 +08:00
liyuenan	27890719b6	Support remote platform by gRPC (#561 ) * Support remote platform by gRPC Signed-off-by: yuenan.li <yuenan.li@verisilicon.com>	2023-03-28 09:51:23 +08:00

1 2 3 4 5 ...

514 Commits All Branches Search

514 Commits

All Branches