TIM-VX

Commit Graph

Author	SHA1	Message	Date
Chen Feiyue	9b945633d7	Fixed typing error and added missed opheader (#673 ) Correct file name of moments operation Added groupedconv1d header in ops.h Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2024-01-03 09:47:47 +08:00
Zhouheng Zheng	4f4f6cd6dc	Add json third party when support node trace db (#670 ) Type: Code Improvement	2023-12-20 21:26:42 +08:00
Yunshan	feaf06365b	Refine layout inference (#671 ) * Remove unnecessary compiler flags * Refactor CMakeLists.txt * Tweak CMakeLists.txt for libtim_internal * Tweak CMakeLists.txt for libtim-vx * Make TIM_VX_ENABLE_TEST defaults to OFF * Eliminate usage of include_directories * Fix CI unit test * Fix warnings relating to inheritance * Keep graph output order in layout inference Type: Code Improvement * Fix typos in layout inference Type: Code Improvement --------- authored-by: Xiaoran Weng <Xiaoran.Weng@verisilicon.com>	2023-12-20 21:26:16 +08:00
Zhouheng Zheng	622c472edf	Add uid() api for class operation (#668 ) Type: Code Improvement	2023-12-18 22:32:21 +08:00
chxin66	11d12f03a8	fix layoutinfer crash when logical op inputs are different rank (#667 ) Type: Bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-12-13 09:57:17 +08:00
chxin66	0dc7a3465e	fix const tensor align bug in AlignPermuteVectorForElementWise (#666 ) * fix const tensor align bug in AlignPermuteVectorForElementWise Signed-off-by: Chen <jack.chen@verisilicon.com> * fix build issue use android ndk Type: Bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> * Fix inappropriate comments for reduce layoutinfer Type: Code refine Signed-off-by: Chen <jack.chen@verisilicon.com> --------- Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-12-11 16:59:37 +08:00
chxin66	720f0a485a	fix crash when eletwise inputs are different rank (#665 ) Fix crash in AlignPermuteVectorForElmentWise() if inputs tensor have different rank Type: Bug fix Signed-off-by: Chen <jack.chen@verisilicon.com>	2023-12-06 17:15:15 +08:00
chxin66	e013cf0a65	fix slope shpae 1 crash issue (#663 ) graph compile will crash when shape is broadcast from 1 to 1,1,1,1 Type: Bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-12-05 09:35:10 +08:00
Chen Feiyue	4578f40953	Added 2 cases for stack (#664 ) Added int32 and uint8 ut for stack op Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-11-30 15:01:33 +08:00
Chen Feiyue	517397949d	Fix the instance norm test input size bug in layout infer test (#661 ) Correct gamma and beta size in InstanceNorm.nhwc case Type: Bug Fix Issue: 37103 Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-11-22 09:20:27 +08:00
chxin66	74e2740daa	add a case for resize bilinear (#662 ) Type: Code improvement Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-11-21 21:39:42 +08:00
Chen Feiyue	8267effdfb	Refine RNNCell/HardSwish/Reduce_sum ut (#660 ) Modify tolerance in some of these op unit tests for StreamProcessor Type: Bug Fix Issue: 37103 Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-11-21 16:18:22 +08:00
Chen Feiyue	4fde0badb2	Refine UnitTest which have acc issue or unspport issue in sp (#659 ) 1.Skip elementwise/relational op unittest if input/output have INF which sp cannot handle 2.Set different tolerance in Layernom/SoftMax/GRU unittest if SP supported Type: Bug Fix Issue: 37103 Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-11-20 14:42:41 +08:00
Chen Feiyue	a24d2be9c3	Rebuild prebuil-sdk to adjust lower ubuntu env (#658 ) Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-11-09 15:44:34 +08:00
Chen Feiyue	bb10884f98	Added scalar type support (#655 ) Added SetScalar api to support scalar input Added 2 cases for scalar index Gather Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-11-06 09:58:03 +08:00
Chen Feiyue	1bb1e070f2	Update internal to 1.1.88 release (#657 ) Internal ovxlib SHA 32fe479af5549e894bcd40de5740ae0dfd42bdb9 Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-11-03 13:16:33 +08:00
xie-oritek	10081790ee	Add ScatterND_Update operator (#652 ) * Add ScatterND_Update operator * Remove ScatterNDUpdate shape param * Rename ScatterND_Update to ScatterND_ONNX_V16 * Fix ScatterND_ONNX_V16 rename problem --------- Co-authored-by: unknown <z0026@china.oritek.com.cn>	2023-10-11 09:12:40 +08:00
chxin66	363c369bf6	Fixed quant param lost in Bidirectional lstm (#649 ) https://github.com/VeriSilicon/TIM-VX/issues/647 Type: Bug fix Signed-off-by: Chen <jack.chen@verisilicon.com>	2023-09-19 22:08:34 +08:00
Chen Feiyue	61ea0091ca	Fixed unsupported float16 bias in fc (#646 ) Resolve the issue of underlying hardware not supporting float16 bias in fc by converting bias type to float32 Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-09-13 09:44:21 +08:00
Antkillerfarm	98966dac9c	build fix for export Swap Handle API (#643 ) PR #635 build error fix Type: bug fix Signed-off-by: Tang Jing <jing.tang@verisilicon.com>	2023-08-30 14:25:45 +08:00
chxin66	01235266c5	fixed tensor cache mismatch issue (#644 ) Type: Bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-08-30 14:23:20 +08:00
Zhouheng Zheng	5668856fc9	Fix the instance norm test input size bug (#645 ) Only NNAPI instance norm spec have scalar gamma and beta, which can not support by sp, rewrite it into tensor. Type: Bug Fix Co-authored-by: zhouheng.zheng <zhouheng.zheng@ouotlook.com>	2023-08-30 14:10:23 +08:00
Antkillerfarm	3bbe2ef9ec	export Swap Handle API (#635 ) export vsi_nn_SwapHandle & vsi_nn_SwapTensorHandle & vsi_nn_SwapTensorHandleWithCache for TIM-VX usage. Type: New Feature Signed-off-by: Tang Jing <jing.tang@verisilicon.com>	2023-08-28 09:15:43 +08:00
xie-oritek	7fc264a9e6	Refine Tensor::SetShape api to avoid compile warning using const ref (#640 ) * Move int4/uint4 to the end of DataType * Refine api Tensor::SetShape, using const ref avoid compile warning	2023-08-25 00:47:24 +08:00
MercuryChen	6f34b66ae4	Split replayer code from tracer.h (#642 ) * Add support for different input dtype of MaxPoolGrad. Type: Code improvement * Integrate api trace into tim-vx source code, as part of experimeantal. Type: New Feature * Refine api trace code and document Add missing traced apis of tim::vx::Quantization Type: Code improvement * Split Api relayer code out of tracer. To enable compile replayer code in machine which can't access high version boost libs. Type: Code improvement	2023-08-25 00:41:45 +08:00
xie-oritek	265e74ff16	Add int4/uint4 definition (#638 )	2023-08-22 17:37:38 +08:00
xie-oritek	54af5c2216	Add CumSum&LRN operator to trace module (#639 )	2023-08-22 16:53:59 +08:00
xie-oritek	bab571b569	Fix data missing when use trace::Graph::CreateTensor (#636 )	2023-08-22 16:53:24 +08:00
Chen Feiyue	9bb3e7c68b	Fixed misleading test case bug in deconv1d (#633 ) Correct erros of deconv1d unittest Added hint in the header indicating that padtype is not supported yet Added 2 cases for deconv1d Type: Code Improvement Issue: github issue #585 Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-08-17 21:26:54 +08:00
MercuryChen	cf2efc63fd	Refine api trace code and document (#634 ) * Add support for different input dtype of MaxPoolGrad. Type: Code improvement * Integrate api trace into tim-vx source code, as part of experimeantal. Type: New Feature * Refine api trace code and document Add missing traced apis of tim::vx::Quantization Type: Code improvement	2023-08-17 21:16:34 +08:00
Chen Feiyue	2f018cc088	Code refinement for mean-stddev-normalization fuse (#632 ) 1.Added copyright && Added reference or const reference for functions 2.Rewrite function of determing whether there is a common input 3.Use std::remove_if instead of std::find before doing erase 4.Added security check to prevent access to deleted ops Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-08-15 13:15:03 +08:00
Chen Feiyue	af50cc5e3f	Added general Float16 support (#631 ) Added Float16 type definition from third-party Refine float16 bias handlling in conv2d Refine float16 case in conv2d Caution: Headers of float16 only be included when build unit_test Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-08-12 10:04:16 +08:00
Chen Feiyue	35e50d7692	Added op fusion for mean_stddev_normalization (#629 ) Added op fusion for mean_stddev_normalization ops such as layernorm and instance norm Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-08-09 22:10:45 +08:00
chxin66	bff26a32c4	fix size compute bug in lrn (#626 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-08-07 13:20:35 +08:00
chxin66	6a5694e557	fixed prelu layoutinfer bug & added cases (#628 ) Type: bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-08-07 13:17:46 +08:00
zhongzhuonan	f0cf45fdaa	Create self-hosted.yml (#625 ) * Create self-hosted.yml * Update self-hosted.yml	2023-07-26 13:31:07 +08:00
Sven	821864a582	Fixed IExecutable object not bind with DeviceID (#624 ) If Executable object doesn't bind with a concrete DeviceID, it will go first device by default. When run multi executable with multi device, the behavior is not expected. Fixed by attach device id with CompileOption. Signed-off-by: xiang.zhang <xiang.zhang@verisilicon.com>	2023-07-24 22:45:54 +08:00
chxin66	680e8d59cb	Fixed conv2d grouped_conv2d deconv2d layoutinfer bug (#622 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-07-24 17:10:24 +08:00
MercuryChen	315adcf076	Integrate api trace into tim-vx source as an experimental feature. (#623 ) * Add support for different input dtype of MaxPoolGrad. Type: Code improvement * Integrate api trace into tim-vx source code, as part of experimeantal. Type: New Feature	2023-07-19 18:40:48 +08:00
Chen Feiyue	0885a0d797	Remove confusing comment in depthwise conv test (#621 ) Remove wrong layout comment for depthwise conv unit test Add comment of layout condition in basic class for depthwise conv Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-07-17 09:43:34 +08:00
Chen Feiyue	62c6b6560c	Added axis param for TopK (#610 ) Topk support specifying dimensions with later internal ovxlib Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-07-12 09:54:07 +08:00
Chen Feiyue	18749f5d05	Fixed transient deconv1d generate wrong output shape bug (#619 ) Fixed bug that when deconv1d ouput is set to be transient, actual output shape will be zero at dim 1. Reason :internal typing error Type: Bug Fix Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-07-08 23:40:32 +08:00
chxin66	ea8046ec9c	Added roi_align layoutinfer & cases (#615 ) * Added roi_align layoutinfer & cases Type: New feature Signed-off-by: Chen <jack.chen@verisilicon.com> * Update instancenorm op spec .json Type: bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> * Added roi_pool layoutinfer & fixed case bug Type: new feature Signed-off-by: Chen <jack.chen@verisilicon.com> --------- Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-07-08 23:39:56 +08:00
Chen Feiyue	32c5a61601	Update prebuilt && internal for 23Q2 release (#617 ) * Update prebuilt-sdk to 6.4.15 release Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com> * Update internal to 1.1.84 rel Update internal to SHA 1e591108dddcbf6dd88d5eef97a7d8b3ffc19ce3 Type: Code Improvement Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com> --------- Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-07-08 23:38:17 +08:00
chxin66	02d6d72946	fixed yolov4 build issue (#618 ) Type: Bug fix Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-07-06 09:30:24 +08:00
chxin66	5d741e8ebe	Optimize compilation process for openssl (#613 ) do not rebuild when openssl lib already exist Type: Code refine Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-07-03 15:48:19 +08:00
Chen Feiyue	33f3a4f176	Enable float16 bias convolution model runs on NN (#612 ) Convert float16 bias tensor to float32 to meet condition of NN convolution in driver Caution: Clang version requires minimum 15.0 Type: Code Improvement Issue: bugzilla id:32785 \| jira id VIVD-744 Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-06-30 09:41:28 +08:00
chxin66	34812fe40e	Added case for gather (#599 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-06-26 09:15:08 +08:00
chxin66	233eb439e1	Fixed viplite driver build issue (#611 ) Signed-off-by: Chen <jack.chen@verisilicon.com> Co-authored-by: Chen <jack.chen@verisilicon.com>	2023-06-26 09:14:42 +08:00
Chen Feiyue	75882d4195	Added new_axis_mask param for stridedslice (#600 ) Add another constructor for stridedslice when new_axis_mask is set The layout inference need to reconstruct the axis mapping when new_axis_mask is set(TODO) Type: New Feature Signed-off-by: Feiyue Chen <Feiyue.Chen@verisilicon.com>	2023-06-25 09:24:41 +08:00

1 2 3 4 5 ...

487 Commits All Branches Search

487 Commits

All Branches