235 Commits (a3550f91caa2e5415820b7cf602e5b80267bdf66)

Author SHA1 Message Date
  Megvii Engine Team 1217801133 perf(mge): add opdef for broadcast 4 years ago
  Megvii Engine Team 2a3f4d099a refactor(dnn/arm): refactor CPU heuristic algo selection 4 years ago
  Megvii Engine Team ba66e1d039 feat(dnn): add nchw_fp32 nchw44_qint8 cuda dct 4 years ago
  Megvii Engine Team a9f98e9c66 refactor(meg/internal): move interal codes back to megbrain 4 years ago
  Megvii Engine Team 44b27f0d6e build(3516): fix some cpu flags build failed and fix 3516 ycm 4 years ago
  Megvii Engine Team 8764a6c8ff feat(dnn/cuda): add volta dp4a int8 sass kernel 4 years ago
  Megvii Engine Team 3635af6274 style(atlas): add comment for async d2d 4 years ago
  Megvii Engine Team d68d4d1d99 perf(atlas): use async d2d 4 years ago
  Megvii Engine Team 215f88f373 fix(dnn/argmxx): fix argmxx on inf 4 years ago
  Megvii Engine Team 92b12685db feat(dnn/aarch64): add aarch64 int8X8X16_mk4_k8x8x8 matmul, performance is better 4 years ago
  Megvii Engine Team 912d733ea9 fix(dnn): support bool for IndexingMultiAxisVec 4 years ago
  Megvii Engine Team edb32495c6 feat(dnn/opr): add megdnn adaptive pooling opr 4 years ago
  Megvii Engine Team 5a85c907e0 feat(mgb/opr): add megbrain adaptive pooling opr 4 years ago
  Megvii Engine Team 310c805f20 fix(dnn/cuda): use kernel parameter instead of user constant memory 4 years ago
  Megvii Engine Team b8ddca4c38 fix(atlas): add MGB_USE_ATLAS_ASYNC_API to enable async api 4 years ago
  Megvii Engine Team 95eb6ae380 feat(mgb/opr): let more ops support empty IO 4 years ago
  Megvii Engine Team 0307598a80 fix(dnn): keep consistent limit between deduce and compute 4 years ago
  Megvii Engine Team 75eebb7c42 feat(opr): use weight preprocess feature of MegDNN 4 years ago
  Megvii Engine Team cc952b2b92 fix(rocm): fix rocm megdnntest sleep and a cut code 4 years ago
  Megvii Engine Team 3a03fa7a50 fix(dnn/cuda): disable pascal sass conv2d 4 years ago
  Megvii Engine Team a5fad7d07c feat(dnn): add compile for riscv64 4 years ago
  Megvii Engine Team 3e11d89415 fix(dnn/dump): add more info for dump CD4 4 years ago
  Megvii Engine Team 76fa71573b feat(dnn/cuda): add cutlass nchw4 convolution 4 years ago
  Megvii Engine Team 1f3f4abc38 fix(dnn): fix compile warnings 4 years ago
  Megvii Engine Team 5b6ebeb563 fix(mgb): append json file for dump and ready for midout open source 4 years ago
  Megvii Engine Team 16324e3076 feat(dnn/cuda): add remap backward 5 years ago
  Megvii Engine Team bd73dabbe2 fix(dnn/build): add CUDNN_INCLUDE_DIR to the megdnn_test target 4 years ago
  Megvii Engine Team 343335932a fix(dnn/arm): fix read invalid data in arm kernel 4 years ago
  Megvii Engine Team 59dcd3b7f3 fix(mgb/build): do not install cutlass 4 years ago
  Megvii Engine Team 6e882c1a86 feat(whl/imperative): compat for build python whl imperative and legacy runtime 4 years ago
  Megvii Engine Team 7f857bd471 feat(mgb/rocm): add cmake for rocm and fix compile errors and bn 4 years ago
  Megvii Engine Team 199eefbd4c fix(dnn): generate mode files 4 years ago
  Megvii Engine Team 9510136223 fix(mgb/rocm): remove begin-internal of rocm 4 years ago
  Megvii Engine Team 6b380e8965 feat(mge/imperative): run oss test and restore cmake list build items 4 years ago
  Megvii Engine Team 0380811218 feat(dnn/arm_common): add nchw44 8x8x16 stride1 stride2 4 years ago
  Megvii Engine Team 00ef677249 fix(mgb): remove internal for cambricon and atlas 4 years ago
  Megvii Engine Team aeffcd5897 feat(dnn/cuda): integrate cutlass nchw32 tensorcore convolution 4 years ago
  Megvii Engine Team 9e5e32dee2 fix(dnn): restore opr_param_defs.py 4 years ago
  Megvii Engine Team d334b229b0 feat(imperative): add nms opr wrapper 4 years ago
  Megvii Engine Team bca00f2e22 fix(dnn): midout at where neccessary in megdnn 4 years ago
  Megvii Engine Team a1e6720756 feat(dnn): enable bool comparison 4 years ago
  Megvii Engine Team 8aa34e4a5d feat(imperative): add advance indexing with bool 4 years ago
  Megvii Engine Team 101b58d1ca fix(dnn): enable bool input to cond_take 4 years ago
  Megvii Engine Team 4a178a8dba feat(windows/cuda/cmake): support cmake cuda build on windows 4 years ago
  Megvii Engine Team 6aade1336d fix(dnn/fallback): disable im2col/conv1x1/conv1x1_gemv Quantized8Asymm in x86 4 years ago
  Megvii Engine Team 56381f808b fix(dnn/arm): use vcvtq_f32_s32 for all arm code 4 years ago
  Megvii Engine Team 1173205726 fix(gopt): nchw_nchwxx useable and opt pass use nchw_nchwxx_valid 4 years ago
  Megvii Engine Team eb18eba87d fix(gopt): fix nchw44 nchw44_dot gopt test 4 years ago
  Megvii Engine Team 40e79e9dab fix(dnn/x86): fix x86 matrix usable ignore format 4 years ago
  Megvii Engine Team 2272abe18d fix(mgb/fallback): disable nchw44 in conv1x1 and im2col in x86 4 years ago