793 Commits (dev-support-lite-fork-debug-mode)

Author SHA1 Message Date
  Megvii Engine Team bba04f02e5 feat(mgb/gopt): add fusion support for conv, astype(s4) and reformat 4 years ago
  Megvii Engine Team 6d686ff26f feat(gopt/inference): allow Float32 output dtype in EnableNCHW64Pass 4 years ago
  Megvii Engine Team 7d3df995cb feat(gopt/inference): allow Float32 output dtype in EnableNCHW4Pass 4 years ago
  Megvii Engine Team e6caa9ff89 feat(opr): add bn backward for inference mode 4 years ago
  Megvii Engine Team 77ead9377b fix(src/serialization): fix compatibility error of oss model 3 years ago
  Megvii Engine Team 07de15713c fix(mgb): remove static mem record from tee 4 years ago
  Megvii Engine Team 4e4497b903 refactor(mgb/dnn): x86 pooling rebase algochooser 3 years ago
  Megvii Engine Team 8a73193c2d feat(dtr): remove eviction threshold 3 years ago
  Megvii Engine Team 43098fb8f1 feat(mge): add SlidingWindowTranspose opr 4 years ago
  Megvii Engine Team 1eaf32cd78 fix(mgb): fix typo in message 4 years ago
  Megvii Engine Team 2cd9823210 fix(mgb/tensorrt): fix trt runtime, padding channel to a multiple of 4 when using kCHW4 IOFormat 3 years ago
  Megvii Engine Team b078dda90b feat(mge/random): add some random op and remove random/distrbution.py 4 years ago
  Megvii Engine Team f30c0e06a6 feat(mgb/opr): add lsq opr 4 years ago
  Megvii Engine Team 6cd01d5a74 feat(imperative/functional): let elemwise support empty IO & add some tests 4 years ago
  Megvii Engine Team dea5278172 feat(mgb/opr): let PowC & TypeCvt support empty IO 4 years ago
  Megvii Engine Team 2f68aeb9b6 feat(imperative/jit): let trace support empty IO 4 years ago
  Megvii Engine Team 809d5056cd feat(mge/distributed): enable pt shm allreduce 4 years ago
  Megvii Engine Team 88898e63a5 fix(mgb): replace if_constexpr with runtime function to avoid potential 4 years ago
  Megvii Engine Team 1cfdbc565c feat(dnn): add deterministic max pooling 4 years ago
  Megvii Engine Team 933dd9a497 feat(mge/distributed): add cuda env check before forked thread 4 years ago
  Megvii Engine Team 2a54196117 fix(tee): fix tee link 4 years ago
  Megvii Engine Team a5060a2bfe feat(mgb/opr): add check_has_inf kernel and opr 4 years ago
  Megvii Engine Team 3597a6dbd7 feat(dnn/arm): nchw_nchw44 conv support 1x1s1 4 years ago
  Megvii Engine Team 40085acbae fix(mgb): remove unnecessary cudnn8 warning 4 years ago
  Megvii Engine Team 54a4d70eb5 feat(src/serialization): add support of serializing metadata 4 years ago
  Megvii Engine Team 721091faf0 fix(core): fix thread local is not supported in ios 4 years ago
  Megvii Engine Team 62bd6c823b feat(cmake/debug): misc for build 4 years ago
  Megvii Engine Team 3e4e4c4604 feat(mgb/jit): add graph_opt_config and jit_config interfaces 4 years ago
  Megvii Engine Team 1c7d0802ab fix(cuda): remove cuda driver version check and runtime minor version 4 years ago
  tpoisonooo 7038a7f5d0 fix(quant): fix spell error 4 years ago
  Megvii Engine Team 355153e158 feat(mge/dtr): add DTR in computing graph 4 years ago
  Megvii Engine Team 76f4f97536 refactor(sublinear): add SeqModifierBase 4 years ago
  Megvii Engine Team f584416aa2 fix(dnn/bn): revise the conditions for inplace flag 4 years ago
  Megvii Engine Team 2eea00097c feat(mgb): add fast run batch size graph option 4 years ago
  Megvii Engine Team 47dcdf3e17 fix(mgb/core): fix dtype and resize modifiers for tensor 4 years ago
  Megvii Engine Team 29f7cdb84a fix(mgb/opr): correct nvof out shape computation 4 years ago
  Megvii Engine Team 03ab8136e7 fix(core): fix asan error cause by wild thread_pool ptr 4 years ago
  Megvii Engine Team 0fb9cc41e4 fix(gopt): fix nchw64 opt pass 4 years ago
  Megvii Engine Team 86b69cacd0 fix(dnn): fixes for int4 4 years ago
  Megvii Engine Team adf75a291d perf(dnn/cuda): add sass int4 128x128 4 years ago
  Megvii Engine Team 8da2f698a3 feat(dnn/cuda): support warp perspective/pooling op when channel not aligned to 64 4 years ago
  Megvii Engine Team c218d4b029 feat(dnn/cuda): fallback conv qs4 support channel not aligend to 64 4 years ago
  Megvii Engine Team ae6ff2c5a6 feat(mgb/gopt): add opt pass for nchw64 layout transform 4 years ago
  Megvii Engine Team 63a9bd30a8 feat(mgb/gopt): add an opt pass for padding channels to enable fast int8/int4 support on GPU 4 years ago
  Megvii Engine Team 858261af1f fix(python_module): fix conversion between numpy-ndarray and mgb tensor for qint4 and quint4 4 years ago
  Megvii Engine Team 3b9b87809d refactor(dnn): refactor lowbit tensor format 4 years ago
  Megvii Engine Team 2d6827c168 fix(mgb/windows): temporary workround on cuda-windows python exit 4 years ago
  Megvii Engine Team d2e33af52f fix(mgb): fix wrong set of strategy in lar 4 years ago
  Megvii Engine Team 8b7d8d290b fix(core): fix json dump when weight preprocess 4 years ago
  Megvii Engine Team ec65e1f9ba fix(build/windows): fix windows build: 4 years ago