793 Commits (dev-support-lite-fork-debug-mode)

Author SHA1 Message Date
  Megvii Engine Team 76fa71573b feat(dnn/cuda): add cutlass nchw4 convolution 4 years ago
  Megvii Engine Team a558d4a253 fix(mgb/atlas): remove unnessary setdevice 4 years ago
  Megvii Engine Team 1fe8a21299 fix(mge): fix sublinear memory in jit.trace 4 years ago
  Megvii Engine Team 651920c7ad fix(dnn): fix nchw88 winograd weight preprocess 4 years ago
  Megvii Engine Team d06f248d90 fix(whl/mgb/imperative): fix symbols conflict runtime crash 4 years ago
  Megvii Engine Team 0e82b959a1 feat(mge/imperative): add sublinear options 4 years ago
  Megvii Engine Team 50d5421aa7 fix(mgb/zmq): fix unused-result when compiling with c++17 4 years ago
  Megvii Engine Team 16324e3076 feat(dnn/cuda): add remap backward 5 years ago
  Megvii Engine Team 968f74ce88 chore(mgb): add no_force_inplace option to ComputingGraph 4 years ago
  Megvii Engine Team 6e882c1a86 feat(whl/imperative): compat for build python whl imperative and legacy runtime 4 years ago
  Megvii Engine Team 7f857bd471 feat(mgb/rocm): add cmake for rocm and fix compile errors and bn 4 years ago
  Megvii Engine Team 9510136223 fix(mgb/rocm): remove begin-internal of rocm 4 years ago
  Megvii Engine Team 6b380e8965 feat(mge/imperative): run oss test and restore cmake list build items 4 years ago
  Megvii Engine Team 2dbe8194ad fix(mge/opr): fix reduction static infer value 4 years ago
  Megvii Engine Team c20d4cc6dc feat(dnn): fix opt pass nchw44 can not dump resnet 4 years ago
  Megvii Engine Team 00ef677249 fix(mgb): remove internal for cambricon and atlas 4 years ago
  Megvii Engine Team bca00f2e22 fix(dnn): midout at where neccessary in megdnn 4 years ago
  Megvii Engine Team a1e6720756 feat(dnn): enable bool comparison 4 years ago
  Megvii Engine Team 4a178a8dba feat(windows/cuda/cmake): support cmake cuda build on windows 4 years ago
  Megvii Engine Team 1915593b6c fix(version_depend): add a fake version on dev 4 years ago
  Megvii Engine Team 56381f808b fix(dnn/arm): use vcvtq_f32_s32 for all arm code 4 years ago
  Megvii Engine Team 1173205726 fix(gopt): nchw_nchwxx useable and opt pass use nchw_nchwxx_valid 4 years ago
  Megvii Engine Team eb18eba87d fix(gopt): fix nchw44 nchw44_dot gopt test 4 years ago
  Megvii Engine Team eab7ab0530 fix(gopt): gen nchw_nchw44 when kernel is optimized 4 years ago
  Megvii Engine Team 777f3ea970 refactor(gopt): format code 4 years ago
  Megvii Engine Team 30ce3c60bd Revert "fix(mgb/opr): change EQ opr's backward_graph to nullptr instead of InvalidGrad" 4 years ago
  Megvii Engine Team 14e71b551b feat(imperative): add helper for dnn opr caller 4 years ago
  Megvii Engine Team 230ab45a1e fix(mgb/naive): fix naive convolution no dispatch kernel in handle 4 years ago
  Megvii Engine Team 1bce857cb8 fix(mgb/opr-mm): use comp_node of config as default in CollectiveComm 4 years ago
  Megvii Engine Team 27205461ae feat(mgb/opr-mm): add register info cache for multi-machine oprs 4 years ago
  Megvii Engine Team 96ec586d28 fix(dnn): fix bool cvt 4 years ago
  Megvii Engine Team f829f836b9 test(mgb/index): add empty index desc tests 4 years ago
  Megvii Engine Team e73f2799d0 fix(mgb/index): enable index desc empty 4 years ago
  Megvii Engine Team ff60fdb82d feat(dnn): add bool type cvt on gpu 4 years ago
  Megvii Engine Team e8571cca51 fix(mgb/cuda): fix cuda host alloc set device 4 years ago
  Megvii Engine Team f7b5eced23 refactor(mgb/opr-mm): set False as default value of local_grad 4 years ago
  Megvii Engine Team c7b6ef35c1 feat(dnn/cuda): add warp perspective backward mat idx 5 years ago
  Megvii Engine Team 09b5f3d434 fix(mgb/core): fix multi thread pool deactive and multi thread conflict 4 years ago
  Megvii Engine Team ef239f835f feat(windows/python_whl): make windows HAPPY for build megbrain python package 4 years ago
  Megvii Engine Team e258812f12 feat(dnn): add bool dtype 4 years ago
  Megvii Engine Team 734c498d27 perf(mgb/core): improve DevMemAlloc when it has single stream 4 years ago
  Megvii Engine Team 39bd66fc63 fix(mgb): fix TensorRT missing cudaSetDevice 4 years ago
  Megvii Engine Team ab9dfbcefc test(mgb): fix tensorrt tests missing cudaSetDevice 4 years ago
  Megvii Engine Team b43fb1a97c perf(mgb): add CUDA host memory allocator 4 years ago
  Megvii Engine Team 2afceb4187 fix(mgb/atlas): use dyn output alloc if enable dynamic batchsize 4 years ago
  Megvii Engine Team 6bcc6faec8 feat(mge/imperative/opr): modify batch_norm to support frozen BN 4 years ago
  Megvii Engine Team 54d18115b6 fix(imperative): fix grad of BatchNorm 4 years ago
  Megvii Engine Team 80c4705317 perf(mgb): use midout in megbrain to reduce binary size 4 years ago
  Megvii Engine Team 4348960c40 fix(mge/gopt): fix fp16 compute mode 4 years ago
  Megvii Engine Team 9f4060b050 fix(mgb/gopt): add ShuffleShuffleRemovePass assert 4 years ago