306 Commits (c90e0b54bea08b46b656da8f69aafac353e28279)

Author SHA1 Message Date
  Megvii Engine Team c90e0b54be perf(arm): optimize arm uint16 relayout with n=4 3 years ago
  Megvii Engine Team f6d9909460 feat(dnn): add elemwise multi type support i16xf32 and u8xf32 3 years ago
  Megvii Engine Team 6bb5409976 feat(dnn/src): add images2neibs kernel of opencl and related test 3 years ago
  Megvii Engine Team c96dbd29b8 fix(dnn/arm_common): support more monotonous case in arm typecvt for performance 3 years ago
  Megvii Engine Team 02d5f46d90 fix(mgb/x86): fix convbias crash on X86 3 years ago
  Megvii Engine Team 2696e4efaa feat(dnn): add float16 for remap backward 3 years ago
  Megvii Engine Team 11d75fecb5 feat(dnn/check_non_finite): add batch check_non_finite 3 years ago
  Megvii Engine Team 2318ea3f15 fix(dnn): fix naive average pooling overflow bug for int8 type 3 years ago
  Megvii Engine Team ba2f0c2e48 fix(dnn/cuda): fix cudnn_conv algo of conv_bias opr for fp16 add z cases 3 years ago
  Megvii Engine Team b59e8ccf24 fix(mgb): fix cambricon bangc copybara 3 years ago
  Megvii Engine Team 3116e128c5 fix(ci/integration_test): fix benchmark torch version 3 years ago
  Megvii Engine Team c85631aa77 feat(dnn): use ref ptr interface for all backends 3 years ago
  Megvii Engine Team 89186edc5d fix(dnn): correct reduce/argmxx/fakequant calculation with nan 3 years ago
  Megvii Engine Team 68cdabd288 feat(opr): indexing_multi_axis_vec support nd index 3 years ago
  Megvii Engine Team a1cba6cc27 fix(dnn): fix convbias crash on X86 3 years ago
  Megvii Engine Team 9b4cd92ba3 fix(mgb/dnn): fix cudnnConvBiasActivation crash on nchw32 int8 with oc > 256 3 years ago
  Megvii Engine Team c48d58daa8 feat(dnn/arm_common): add N1HW like elemwise broadcast mode 3 years ago
  Megvii Engine Team 26634db7a8 fix(dnn): support relayout for non-contigous layout 3 years ago
  Megvii Engine Team 056fd6bc59 feat(dnn/arm64): support stride_m in arm64 relayout 3 years ago
  liuke b0ba6d3201 Merge pull request #207 from togetherwhenyouwant:feat-x86-matmul-6x16x2 3 years ago
  Megvii Engine Team 10af44abba fix(dnn/cuda): fix cudnn conv impl for nchw4_nchw hybrid layout 3 years ago
  Megvii Engine Team 5885b137fa feat(dnn/arm): support layout like NHWC channel like broadcast on arm 3 years ago
  Megvii Engine Team 369c2ccc5a style(all): reformat c++ code 3 years ago
  zjl d2184af3b2 feat(dnn/src/x86/matmul): add matmul_6x16 for x86 3 years ago
  Megvii Engine Team 177dec94c5 feat(mgb/opr): add bgr2gray mode for cvtcolor opr 3 years ago
  Megvii Engine Team f5cb21ed3a fix(mgb/opr): add non finite check 3 years ago
  Megvii Engine Team bde5cf3564 feat(dnn): add resize linear for arm 3 years ago
  Megvii Engine Team 3344b580a9 feat(dnn): add elemwise for nchw88+fp16 3 years ago
  Megvii Engine Team 682c74df27 feat(dnn): add direct nchw88 fp16 conv 3 years ago
  Megvii Engine Team 3d3666b6e0 test(dnn/bn): add compatible configs for NHWC BN 3 years ago
  Megvii Engine Team 3977b7aa0b feat(mgb/shuffle): add shuffle opr 3 years ago
  Megvii Engine Team 17371e79b9 fix(dnn/reduce): fix reduce_mean o16c32 is incorrect for large tensor 3 years ago
  Megvii Engine Team c33126ab5c feat(mgb/gopt): add reformat manager 3 years ago
  Megvii Engine Team 8b40f57738 feat(mgb/dnn): add conv1x1 algo for matrix mul 3 years ago
  Megvii Engine Team d69b59035d feat(dnn): add an get_all_algorithms_safe interface 3 years ago
  Megvii Engine Team 103d7f33ba refactor(dnn/rocm): update hip license header 4 years ago
  Megvii Engine Team 5aa52d3863 feat(dnn/rocm): add adaptive pooling opr 3 years ago
  Megvii Engine Team 323a4642e6 feat(dnn/rocm): add topk opr 3 years ago
  Megvii Engine Team f4784f4af1 feat(dnn/rocm): add argsort opr 3 years ago
  Megvii Engine Team 8b94f49328 fix(dnn/cuda): fix elemwise and relayout int4 bug when last shape is 1 3 years ago
  Megvii Engine Team bc9cfc277a feat(mgb): add arm resize nchwxx and naive nearest interp 3 years ago
  Megvii Engine Team 722aecd437 feat(mgb): support fp16 nhwc backward 3 years ago
  Megvii Engine Team 0708bc780c fix(dnn/cuda): disallow implicit dtype conversion in cublaslt matmul algos 3 years ago
  Megvii Engine Team 1e83ab638e feat(dnn): add channelwise conv for fp16 nchw88 3 years ago
  Megvii Engine Team 4c13bc7e1b feat(dnn/cuda): add nhwc int8 deconv 3 years ago
  Megvii Engine Team 11f022ff7c feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt 3 years ago
  Megvii Engine Team 67575d582c feat(mge/opr): add interpolate bilinear mode 3 years ago
  Megvii Engine Team 0558b2123d feat(mge/opr): add interpolate nearest mode 3 years ago
  Megvii Engine Team c25125e3d2 perf(dnn/cuda): sass int8 epilogue remove shared load 3 years ago
  Megvii Engine Team c9d060307f feat(dnn/common): add named tensor shape 4 years ago