128 Commits (release-1.0)

Author SHA1 Message Date
  Megvii Engine Team edb32495c6 feat(dnn/opr): add megdnn adaptive pooling opr 4 years ago
  Megvii Engine Team 310c805f20 fix(dnn/cuda): use kernel parameter instead of user constant memory 4 years ago
  Megvii Engine Team 3a03fa7a50 fix(dnn/cuda): disable pascal sass conv2d 4 years ago
  Megvii Engine Team a5fad7d07c feat(dnn): add compile for riscv64 4 years ago
  Megvii Engine Team 76fa71573b feat(dnn/cuda): add cutlass nchw4 convolution 4 years ago
  Megvii Engine Team 5b6ebeb563 fix(mgb): append json file for dump and ready for midout open source 4 years ago
  Megvii Engine Team 16324e3076 feat(dnn/cuda): add remap backward 5 years ago
  Megvii Engine Team bd73dabbe2 fix(dnn/build): add CUDNN_INCLUDE_DIR to the megdnn_test target 4 years ago
  Megvii Engine Team 343335932a fix(dnn/arm): fix read invalid data in arm kernel 4 years ago
  Megvii Engine Team 6e882c1a86 feat(whl/imperative): compat for build python whl imperative and legacy runtime 4 years ago
  Megvii Engine Team 7f857bd471 feat(mgb/rocm): add cmake for rocm and fix compile errors and bn 4 years ago
  Megvii Engine Team 9510136223 fix(mgb/rocm): remove begin-internal of rocm 4 years ago
  Megvii Engine Team 0380811218 feat(dnn/arm_common): add nchw44 8x8x16 stride1 stride2 4 years ago
  Megvii Engine Team 00ef677249 fix(mgb): remove internal for cambricon and atlas 4 years ago
  Megvii Engine Team aeffcd5897 feat(dnn/cuda): integrate cutlass nchw32 tensorcore convolution 4 years ago
  Megvii Engine Team 6e70fa7a11 feat(dnn/arm): add fp32 asm gemm for a53 a55 and i8i8i16 gemm for a72 a53 4 years ago
  Megvii Engine Team b778d22523 feat(mgb/fallback): add conv1x1_gemv, conv1x1 and im2col 8x8x16/8x8x32 support bias 4 years ago
  Megvii Engine Team c357db0134 feat(mgb/arm_common): add 8x8x16 nchw44 max pooling 4 years ago
  Megvii Engine Team 7f5f375fda feat(dnn/arm): add armv7 nchw_nchw44 3x3s2 asm kernel 4 years ago
  Megvii Engine Team 3931099ea7 fix(dnn/test): fix nchw_nchw44 i8i8i16 benchmark 4 years ago
  Megvii Engine Team bcf5691ddf feat(dnn/arm): add nchw_nchw44 i8i8i16 2x2 3x3 5x5 7x7 s1 s2 conv 4 years ago
  Megvii Engine Team c7b6ef35c1 feat(dnn/cuda): add warp perspective backward mat idx 5 years ago
  Megvii Engine Team a773d07678 feat(dnn/arm_common): add nchw44 8x8x16 channel wise conv 4 years ago
  Megvii Engine Team e258812f12 feat(dnn): add bool dtype 4 years ago
  Megvii Engine Team 7ca3d579db feat(dnn): make mk4 and mk8 matmul for winograd both on aarch64 and armv7 supports n=1 4 years ago
  Megvii Engine Team f6018422fd perf(dnn/arm_common): add nchw44 winograd f73 5 years ago
  Megvii Engine Team e1e56988cd feat(dnn/fallback): add conv1x1 filter preprocess funciton 5 years ago
  Megvii Engine Team e05c795b45 refactor(dnn/arm): refactor direct algo in algo selection 4 years ago
  Megvii Engine Team 324af87807 feat(dnn/arm): add cpuinfo runtime check for x86 and arm 4 years ago
  Megvii Engine Team 8b183f2c70 test(dnn/testcase): fix a testcase bug 4 years ago
  Megvii Engine Team 14a32ae19b fix(cmake/cross-build): misc fix 4 years ago
  Megvii Engine Team edd7e16701 feat(dnn/fallback): add im2col filterpreprocess function 5 years ago
  Megvii Engine Team ef267dacf8 fix(megdnn_test/ev300): try run megdnn_test on ev300 board 4 years ago
  Megvii Engine Team eed54081ab feat(dnn/arm): add armv7 mk4 i8i8i16 gemm, optimized for A7 4 years ago
  Megvii Engine Team 4d56371e0b refactor(dnn/arm): split arm direct kernel to cut compile time 5 years ago
  Megvii Engine Team fc1ce273b7 fix(dnn/cuda): fix elemwise add cuda int8 bcast 4 years ago
  Megvii Engine Team 57bc36575f style(dnn/cuda): format cuda elemwise code 4 years ago
  Megvii Engine Team fff2cdc7bb feat(dnn/fallback): add winograd weight preprocess 5 years ago
  Megvii Engine Team d37229fa02 feat(dnn): optimize f23 and f63 nchw44 winograd 5 years ago
  Megvii Engine Team 3bd8ef3589 feat(mgb/compnode): add atlas compnode 5 years ago
  Megvii Engine Team 1e576e321b feat(dnn/aarch64-arm_common): add mat_idx warppespective for aarch64/arm_common/naive 5 years ago
  Megvii Engine Team 714cb232bb feat(dnn): add gemv supports in conv1x1 for NCHW44 and NCHW44_DOT(aarch64 binary size grows 2KB) 5 years ago
  Megvii Engine Team b8b000db3b feat(dnn/fallback): fix fallback interface of weight preprocess 5 years ago
  Megvii Engine Team 5fb07c9964 fix(dnn/x86): fix cmake error for build x86 gtest 5 years ago
  Megvii Engine Team 763b57add7 fix(dnn/cuda): fix INTMAX overflow in warp_perspective_cuda 5 years ago
  Megvii Engine Team 2e6e570dfe feat(dnn/fallback): add armv7 im2col mk4-dot int8 and 5 years ago
  Megvii Engine Team 7886ff9af0 feat(dnn): add relayout_format for nchw to nchw4 and ic <=4 5 years ago
  Megvii Engine Team dedb7a3f14 feat(dnn/cuda): add cuda remap 5 years ago
  Megvii Engine Team 946a340c3d feat(ci/midout): opt midout and add midout ci 5 years ago
  Megvii Engine Team 44c381b6f4 Revert "feat(dnn/naive): workspacebundle support 2D" 5 years ago