553 Commits (v1.7.1.m1)

Author SHA1 Message Date
  Megvii Engine Team c96dbd29b8 fix(dnn/arm_common): support more monotonous case in arm typecvt for performance 3 years ago
  Megvii Engine Team ead611e11d perf(dnn): slightly improve arm neon transcendental function performance 3 years ago
  Megvii Engine Team 0d16952470 fix(mgb/cuda): fix conv error when the input tensor is too large 3 years ago
  Megvii Engine Team 02d5f46d90 fix(mgb/x86): fix convbias crash on X86 3 years ago
  Megvii Engine Team accb2d8d47 fix(mgb/serialize): fix flatbuffer compatibility issues 3 years ago
  Megvii Engine Team 5e07e1e0f9 fix(dnn/falback): let cpu be able to execute int4 model 3 years ago
  Megvii Engine Team 2696e4efaa feat(dnn): add float16 for remap backward 3 years ago
  Megvii Engine Team 1f0cc891b0 feat(dnn): enable eye to support bool 3 years ago
  Megvii Engine Team 11d75fecb5 feat(dnn/check_non_finite): add batch check_non_finite 3 years ago
  Megvii Engine Team 2318ea3f15 fix(dnn): fix naive average pooling overflow bug for int8 type 3 years ago
  Megvii Engine Team 2d54ad185b feat(lite): add global layout transform interface for load and run 3 years ago
  Megvii Engine Team ba2f0c2e48 fix(dnn/cuda): fix cudnn_conv algo of conv_bias opr for fp16 add z cases 3 years ago
  Megvii Engine Team 30976c239f fix(mgb/gopt): fix global layout transform 3 years ago
  Megvii Engine Team ca7cec7a5d fix(mgb/gopt): minor fixes for global layout transform 3 years ago
  Megvii Engine Team fe93013a6e feat(mgb/gopt): global layout transform support nchw_nchwxx hybrid mode 3 years ago
  Megvii Engine Team 3d45d35241 feat(mgb/gopt): profiler support checking algo availability 3 years ago
  Megvii Engine Team b59e8ccf24 fix(mgb): fix cambricon bangc copybara 3 years ago
  Megvii Engine Team 3116e128c5 fix(ci/integration_test): fix benchmark torch version 3 years ago
  Megvii Engine Team c85631aa77 feat(dnn): use ref ptr interface for all backends 3 years ago
  Megvii Engine Team d90cb7763c feat(src/core): record support change ptr basic 3 years ago
  Megvii Engine Team 89186edc5d fix(dnn): correct reduce/argmxx/fakequant calculation with nan 3 years ago
  Megvii Engine Team 68cdabd288 feat(opr): indexing_multi_axis_vec support nd index 3 years ago
  Megvii Engine Team a1cba6cc27 fix(dnn): fix convbias crash on X86 3 years ago
  Megvii Engine Team 9b4cd92ba3 fix(mgb/dnn): fix cudnnConvBiasActivation crash on nchw32 int8 with oc > 256 3 years ago
  Megvii Engine Team 23c1fda7e6 perf(arm_common): optimize sigmoid 3 years ago
  Megvii Engine Team 25ec2530ba feat(whl/api/lar): enable megengine dll on Windows 3 years ago
  Megvii Engine Team c48d58daa8 feat(dnn/arm_common): add N1HW like elemwise broadcast mode 3 years ago
  Megvii Engine Team 26634db7a8 fix(dnn): support relayout for non-contigous layout 3 years ago
  Megvii Engine Team 056fd6bc59 feat(dnn/arm64): support stride_m in arm64 relayout 3 years ago
  Megvii Engine Team c50858ee13 fix(dnn): specialize pow to make it consistent 3 years ago
  Megvii Engine Team 849f0ece9d fix(dnn): drop batched matmul cublas algo when batch is 1 3 years ago
  Megvii Engine Team b5bf56e0ee style(dnn): add bypass of clang-format for dnn foreach_opr macro 3 years ago
  Megvii Engine Team 5af52746f7 fix(mgb): fix bug caused by conv filter size is too big 3 years ago
  liuke b0ba6d3201 Merge pull request #207 from togetherwhenyouwant:feat-x86-matmul-6x16x2 3 years ago
  Megvii Engine Team 10af44abba fix(dnn/cuda): fix cudnn conv impl for nchw4_nchw hybrid layout 3 years ago
  Megvii Engine Team 5885b137fa feat(dnn/arm): support layout like NHWC channel like broadcast on arm 3 years ago
  Megvii Engine Team 369c2ccc5a style(all): reformat c++ code 3 years ago
  Megvii Engine Team bfb30dcb81 chore(format): fix compile bugs after code format 3 years ago
  Megvii Engine Team eeccf2bc0d ci(check): add clang-format in check stage 3 years ago
  zjl d2184af3b2 feat(dnn/src/x86/matmul): add matmul_6x16 for x86 3 years ago
  Megvii Engine Team 177dec94c5 feat(mgb/opr): add bgr2gray mode for cvtcolor opr 3 years ago
  Megvii Engine Team f5cb21ed3a fix(mgb/opr): add non finite check 3 years ago
  Megvii Engine Team bde5cf3564 feat(dnn): add resize linear for arm 3 years ago
  Megvii Engine Team 8cb201868e fix(mgb): fix fastrun cache serialization method 3 years ago
  Megvii Engine Team 563239d38f feat(dnn): add arm_common nchw44 cwconv3x3s1p1 and cwconv5x5s1p2 3 years ago
  Megvii Engine Team 3344b580a9 feat(dnn): add elemwise for nchw88+fp16 3 years ago
  Megvii Engine Team 682c74df27 feat(dnn): add direct nchw88 fp16 conv 3 years ago
  Megvii Engine Team fca195351c feat(gopt): add nhwc fuse conv typecvt optpass 3 years ago
  Megvii Engine Team 2fc7358517 Revert "feat(dnn/apicache): add generic apicache" 3 years ago
  Megvii Engine Team de363c04af Revert "perf(cuda/conv): cache serval cudnn api" 3 years ago