176 Commits (105e44504ad5d9ff34bee48d5ac2bc0df33da0b6)

Author SHA1 Message Date
  Megvii Engine Team a1e6720756 feat(dnn): enable bool comparison 4 years ago
  Megvii Engine Team 4a178a8dba feat(windows/cuda/cmake): support cmake cuda build on windows 4 years ago
  Megvii Engine Team 1915593b6c fix(version_depend): add a fake version on dev 4 years ago
  Megvii Engine Team 56381f808b fix(dnn/arm): use vcvtq_f32_s32 for all arm code 4 years ago
  Megvii Engine Team 1173205726 fix(gopt): nchw_nchwxx useable and opt pass use nchw_nchwxx_valid 4 years ago
  Megvii Engine Team eb18eba87d fix(gopt): fix nchw44 nchw44_dot gopt test 4 years ago
  Megvii Engine Team eab7ab0530 fix(gopt): gen nchw_nchw44 when kernel is optimized 4 years ago
  Megvii Engine Team 777f3ea970 refactor(gopt): format code 4 years ago
  Megvii Engine Team 30ce3c60bd Revert "fix(mgb/opr): change EQ opr's backward_graph to nullptr instead of InvalidGrad" 4 years ago
  Megvii Engine Team 14e71b551b feat(imperative): add helper for dnn opr caller 4 years ago
  Megvii Engine Team 230ab45a1e fix(mgb/naive): fix naive convolution no dispatch kernel in handle 4 years ago
  Megvii Engine Team 1bce857cb8 fix(mgb/opr-mm): use comp_node of config as default in CollectiveComm 4 years ago
  Megvii Engine Team 27205461ae feat(mgb/opr-mm): add register info cache for multi-machine oprs 4 years ago
  Megvii Engine Team 96ec586d28 fix(dnn): fix bool cvt 4 years ago
  Megvii Engine Team f829f836b9 test(mgb/index): add empty index desc tests 4 years ago
  Megvii Engine Team e73f2799d0 fix(mgb/index): enable index desc empty 4 years ago
  Megvii Engine Team ff60fdb82d feat(dnn): add bool type cvt on gpu 4 years ago
  Megvii Engine Team e8571cca51 fix(mgb/cuda): fix cuda host alloc set device 4 years ago
  Megvii Engine Team f7b5eced23 refactor(mgb/opr-mm): set False as default value of local_grad 4 years ago
  Megvii Engine Team c7b6ef35c1 feat(dnn/cuda): add warp perspective backward mat idx 5 years ago
  Megvii Engine Team 09b5f3d434 fix(mgb/core): fix multi thread pool deactive and multi thread conflict 4 years ago
  Megvii Engine Team ef239f835f feat(windows/python_whl): make windows HAPPY for build megbrain python package 4 years ago
  Megvii Engine Team e258812f12 feat(dnn): add bool dtype 4 years ago
  Megvii Engine Team 734c498d27 perf(mgb/core): improve DevMemAlloc when it has single stream 4 years ago
  Megvii Engine Team 39bd66fc63 fix(mgb): fix TensorRT missing cudaSetDevice 4 years ago
  Megvii Engine Team ab9dfbcefc test(mgb): fix tensorrt tests missing cudaSetDevice 4 years ago
  Megvii Engine Team b43fb1a97c perf(mgb): add CUDA host memory allocator 4 years ago
  Megvii Engine Team 2afceb4187 fix(mgb/atlas): use dyn output alloc if enable dynamic batchsize 4 years ago
  Megvii Engine Team 6bcc6faec8 feat(mge/imperative/opr): modify batch_norm to support frozen BN 4 years ago
  Megvii Engine Team 54d18115b6 fix(imperative): fix grad of BatchNorm 4 years ago
  Megvii Engine Team 80c4705317 perf(mgb): use midout in megbrain to reduce binary size 4 years ago
  Megvii Engine Team 4348960c40 fix(mge/gopt): fix fp16 compute mode 4 years ago
  Megvii Engine Team 9f4060b050 fix(mgb/gopt): add ShuffleShuffleRemovePass assert 4 years ago
  Megvii Engine Team 9b0978597c chore(core): remove index_sequence polyfill for pre-C++14 4 years ago
  Megvii Engine Team 074645d82e feat(core): add array_skip 4 years ago
  Megvii Engine Team 2faab90bd0 feat(core): add a naive if_constexpr for C++14 4 years ago
  Megvii Engine Team 8f42fe0d92 feat(core): import a std::apply polyfill 4 years ago
  Megvii Engine Team be5dde9da8 fix(mgb/opr-mm): fix group manager sync error 4 years ago
  Megvii Engine Team 2dc9190055 fix(fastrun/persistent_cache): fix fastrun crash 4 years ago
  Megvii Engine Team c1fb3c8c1a fix(mgb/gopt): fix ConvertF32ToF16Pass endpoints 4 years ago
  Megvii Engine Team 618faf6497 feat(mgb/profiler): dump static infer info 4 years ago
  Megvii Engine Team 5c224c71f0 refactor(mge/imperative): drop ComputingGraphImpl in proxy graph 4 years ago
  Megvii Engine Team 5e7d2a91c2 refactor(mgb): add TensorND::proxy_to_default_cpu 4 years ago
  Megvii Engine Team 9e9e8ca0fe fix(mgb): fix some compile errors 4 years ago
  Megvii Engine Team d782edf80f refactor(mgb): decouple node insertion from ComputingGraphImpl 5 years ago
  Megvii Engine Team d42cf4cd65 refactor(mgb): replace static_cast<ComputingGraphImpl*> with a checked version 5 years ago
  Megvii Engine Team 55844d3e24 fix(core): fix CPU compnode get_ram_status_byte free_memory error 5 years ago
  Megvii Engine Team 672d4ad0e0 fix(jit): more testcases on the grad of JITExecutor 5 years ago
  Megvii Engine Team bc95e873ef fix(jit): fix jit grad 5 years ago
  Megvii Engine Team 4cf193ab97 fix(mgb/opr-mm): fix device id in TestOprIORemote.IdentityWithGopt 4 years ago