666 Commits (c53cad2049d99c6ada2b0111a6a77d8682f0d83f)

Author SHA1 Message Date
  Megvii Engine Team 3e4e4c4604 feat(mgb/jit): add graph_opt_config and jit_config interfaces 4 years ago
  Megvii Engine Team 1c7d0802ab fix(cuda): remove cuda driver version check and runtime minor version 4 years ago
  tpoisonooo 7038a7f5d0 fix(quant): fix spell error 4 years ago
  Megvii Engine Team 355153e158 feat(mge/dtr): add DTR in computing graph 4 years ago
  Megvii Engine Team 76f4f97536 refactor(sublinear): add SeqModifierBase 4 years ago
  Megvii Engine Team f584416aa2 fix(dnn/bn): revise the conditions for inplace flag 4 years ago
  Megvii Engine Team 2eea00097c feat(mgb): add fast run batch size graph option 4 years ago
  Megvii Engine Team 47dcdf3e17 fix(mgb/core): fix dtype and resize modifiers for tensor 4 years ago
  Megvii Engine Team 29f7cdb84a fix(mgb/opr): correct nvof out shape computation 4 years ago
  Megvii Engine Team 03ab8136e7 fix(core): fix asan error cause by wild thread_pool ptr 4 years ago
  Megvii Engine Team 0fb9cc41e4 fix(gopt): fix nchw64 opt pass 4 years ago
  Megvii Engine Team 86b69cacd0 fix(dnn): fixes for int4 4 years ago
  Megvii Engine Team adf75a291d perf(dnn/cuda): add sass int4 128x128 4 years ago
  Megvii Engine Team 8da2f698a3 feat(dnn/cuda): support warp perspective/pooling op when channel not aligned to 64 4 years ago
  Megvii Engine Team c218d4b029 feat(dnn/cuda): fallback conv qs4 support channel not aligend to 64 4 years ago
  Megvii Engine Team ae6ff2c5a6 feat(mgb/gopt): add opt pass for nchw64 layout transform 4 years ago
  Megvii Engine Team 63a9bd30a8 feat(mgb/gopt): add an opt pass for padding channels to enable fast int8/int4 support on GPU 4 years ago
  Megvii Engine Team 858261af1f fix(python_module): fix conversion between numpy-ndarray and mgb tensor for qint4 and quint4 4 years ago
  Megvii Engine Team 3b9b87809d refactor(dnn): refactor lowbit tensor format 4 years ago
  Megvii Engine Team 2d6827c168 fix(mgb/windows): temporary workround on cuda-windows python exit 4 years ago
  Megvii Engine Team d2e33af52f fix(mgb): fix wrong set of strategy in lar 4 years ago
  Megvii Engine Team 8b7d8d290b fix(core): fix json dump when weight preprocess 4 years ago
  Megvii Engine Team ec65e1f9ba fix(build/windows): fix windows build: 4 years ago
  Megvii Engine Team b06b589960 feat(mgb): get static graph memory info 4 years ago
  Megvii Engine Team 3591ef1f6a fix(mgb): fix conv cudnnconvbackwarddata algo witch is not shake 4 years ago
  Megvii Engine Team 1525a02530 feat(mge/module): add python wrapper for unfold 4 years ago
  Megvii Engine Team 13b15fb08c feat(megbrain): add correlation opr 4 years ago
  Megvii Engine Team 9b69a02fb2 fix(imperative/tensor): init m_offset when constructing a Tensor with DeviceTensorND 4 years ago
  Megvii Engine Team 36b1ba052f fix(mgb/dnn): fix cudnn8.0.4 convbias with z 4 years ago
  Megvii Engine Team 6ab1c55d2c fix(mgb): fix fastrun workspace limit 4 years ago
  Megvii Engine Team 13e6ea349d feat(imperative/opr): rebase rng refactoring to dev & add python module 4 years ago
  Megvii Engine Team 40bab1ed66 feat(log): opt log, enable mgb sdk log at opt build 4 years ago
  Megvii Engine Team 00b48dfe7f Revert "perf(opr): use pin mem for param_pack_concat" 4 years ago
  Megvii Engine Team e18afa0b05 feat(mge/module): python wrapper for conv_transpose3d 4 years ago
  Megvii Engine Team f16c9eb9d7 refactor(mgb): simplify fast run proccess with storing algo desc instead of algo name 4 years ago
  Megvii Engine Team 5d637d0723 refactor(mgb): code refactor of fast run 4 years ago
  Megvii Engine Team f6bd4f59f7 fix(mgb): fix attribute check compatibility when profiling and read_from_cache in fast run 4 years ago
  Megvii Engine Team 1e6ef3771f feat(mgb/dnn): add accuracy shake checker 4 years ago
  Megvii Engine Team 4b141f8de4 fix(mgb): add usable-depend-on-shape attr 4 years ago
  Megvii Engine Team 15b647aee2 fix(externcopr): check loader imp dynmaic param 4 years ago
  Megvii Engine Team 3bd0df8e12 fix(rocm): enable var_releaser for rocm 4 years ago
  Megvii Engine Team 21c6c4371d perf(opr): use pin mem for param_pack_concat 4 years ago
  Megvii Engine Team 1a7112997c feat(opr-mm): add backend argument for remote send/recv 4 years ago
  Megvii Engine Team 1bec737d1a feat(distributed): support distributed opr for rocm 4 years ago
  Megvii Engine Team 6de3e4baa3 refactor(mgb/opr): make trt batch flag only depend on inputs dimension 4 years ago
  Megvii Engine Team ce610ca31a fix(mgb): fix attribute uncomplete filter when get_profile_result_from_cache in fast run 4 years ago
  Megvii Engine Team c9348b162f build(cuda): link to cuda_stub 4 years ago
  Megvii Engine Team 8585aa61ad fix(mgb): fix fast run crash when profile heuristic strategy 4 years ago
  Megvii Engine Team 62c394ca91 feat(cuda/comp_node): enable to query adjacent free blocks size 4 years ago
  Megvii Engine Team fd61f09540 feat(cuda/comp_node): enable to directly query memory status 4 years ago