2286 Commits (release-1.8)
 

Author SHA1 Message Date
  megvii-mge c42ce93705 feat(mge/third_party): update cutlass version 3 years ago
  温娟 9902ccfcb0 chore(release): bump version 3 years ago
  Megvii Engine Team 8e5410e41f feat(cuda): add fp16 compute 16 kernel 3 years ago
  Megvii Engine Team 472e2f9655 refactor(cuda): depthwish large kernel 3 years ago
  Megvii Engine Team e698ec20c2 feat(cuda): float16 depthwise large kernel conv compute fp32 3 years ago
  Megvii Engine Team 48406382ce feat(cuda): support float16 depthwise large kernel conv 3 years ago
  Megvii Engine Team 7042f76b34 perf(cuda): speedup conv backward data with small feature map and large filter size 3 years ago
  Megvii Engine Team 87a2aeebb1 perf(cuda): speedup chanwise conv with small feature map and large filter size 3 years ago
  Megvii Engine Team 2293385e93 feat(mge): add conv padding mode 3 years ago
  Megvii Engine Team afe9c4b50d feat(dnn/cuda): add implicit bmm kernels for large kernel depthwise convolution backward filter opr 3 years ago
  Megvii Engine Team e8a169292f feat(dnn/cuda): add heuristic rule for implicit batched gemm large kernel dwconv2d kernels 3 years ago
  Megvii Engine Team 38067472d2 fix(dnn/cuda): fix ci 3 years ago
  Megvii Engine Team 1da58ae17a feat(dnn/cuda): add implicit bmm large kernel dwconv2d dgrad kernels 3 years ago
  Megvii Engine Team 96050073a2 feat(dnn/cuda): add implicit bmm large kernel dwconv2d fprop impl 3 years ago
  温娟 19fe2e94e7 chore(release): bump version 3 years ago
  Megvii Engine Team 1add4517ad test(trace): test subtensor on unknown shape 3 years ago
  Megvii Engine Team 54eef55871 fix(trace): assume result is not scalar when shape is valid 3 years ago
  Megvii Engine Team 84d99d1cc4 fix(traced_module): fix Module compatible issue and traced module getattr check 3 years ago
  Megvii Engine Team 275b63114d fix(imperative): fix use collections error from python3.10 3 years ago
  Megvii Engine Team 95ac055538 feat(dnn,mgb,imperative): add diag opr implement 3 years ago
  Megvii Engine Team 39d77fb55a feat(arm): add arm rnn_cell/lstm_cell/lstm optimized kernel 3 years ago
  Megvii Engine Team 3ddc32d3e3 feat(android/whl): support android whl 3 years ago
  Megvii Engine Team f509b1be9b fix(build): split elemwise_multi_type cpp 3 years ago
  Megvii Engine Team 3252016e05 Merge pull request #401 from LosReturn:patch-1 3 years ago
  Megvii Engine Team f7e034b506 feat(lite): add global layout transform python interface for lite 3 years ago
  Megvii Engine Team e70c07a223 feat(lite): add global layout transform c/c++ interface for lite 3 years ago
  Megvii Engine Team 86ee4638bf Merge pull request #402 from AA1HSHH:docstring-reshape 3 years ago
  Megvii Engine Team 3251f50114 fix(mgb/cuda-stub): add libcuda-wrap_11.4.h to fit the CUDA11.4 toolchain 3 years ago
  Megvii Engine Team 2c2df83051 fix(cmake): enable custom op when building develop to avoid the pytest fail 3 years ago
  Megvii Engine Team ee0b95e935 feat(dnn/elemwise/arm_common): support part of arm ternary elemwise multithread 3 years ago
  Megvii Engine Team 7ea104d788 Revert "fix(mge): replace _full_sync by sync" 3 years ago
  Megvii Engine Team cbbca5fb10 feat(mge): add softmax op use cudnn api 3 years ago
  Megvii Engine Team 1d2510b6d7 fix(module): fix module dumped in old version without _short_name attr 3 years ago
  Megvii Engine Team cf5e9488bb fix(traced_module): fix module trace transformation 3 years ago
  Megvii Engine Team 97c90d9137 feat(traced_module): add _exclude_from_trace 3 years ago
  Megvii Engine Team 30e565e5b8 fix(traced_module): fix error message 3 years ago
  Megvii Engine Team de8ffe0c12 refactor(imperative): unify interpreter option setting 3 years ago
  Megvii Engine Team 8b60bdfa10 fix(mge): replace _full_sync by sync 3 years ago
  Megvii Engine Team 20b42a8c3b fix(dnn): add naive lstm kernel 3 years ago
  Megvii Engine Team 2faa6ea5a9 Merge pull request #213 from kxz18:rnn 3 years ago
  Megvii Engine Team f5b8fec4ca fix(imperative): remove big tensor from host side 3 years ago
  Megvii Engine Team 68cde8734e fix(mge/imperative): support broadcast with None 3 years ago
  Megvii Engine Team 0bdd0b1467 refactor(dispatch): switch to new dispatch system 3 years ago
  Megvii Engine Team d3689c3f3c feat(imperative/python): add transformation manager 3 years ago
  Megvii Engine Team 9ce1f0f5d1 refactor(dispatch): implement grad 3 years ago
  Megvii Engine Team c609c031f1 refactor(dispatch): implement symbol 3 years ago
  Megvii Engine Team e32929dfd2 refactor(dispatch): implement scalar 3 years ago
  Megvii Engine Team 59084fa857 refactor(dispatch): implement lazy_eval 3 years ago
  Megvii Engine Team d2b67c2a88 refactor(dispatch): implement trace 3 years ago
  Megvii Engine Team 39ac606b9c refactor(dispatch): implement eval 3 years ago