2268 Commits (v1.5.0)
 

Author SHA1 Message Date
  Megvii Engine Team 1c7d0802ab fix(cuda): remove cuda driver version check and runtime minor version 4 years ago
  Megvii Engine Team b87af9f77f feat(dnn/cuda): topk support fp16 4 years ago
  Megvii Engine Team 34262d904c fix(imperative): fix the size of blob with offset 4 years ago
  Megvii Engine Team 787f187e7d fix(imperative/src): fix dot backward error 4 years ago
  megvii-mge f35687caa3
Merge pull request #181 from haolongzhangm/update-readme 3 years ago
  zhanghaolong bf2e2d29bd Update README.md 4 years ago
  Json Lee 8d76576fac fix typos 4 years ago
  huangxinda e67fefcdca ci(yml): enable try-import branch invoke ci on push 4 years ago
  tpoisonooo 7038a7f5d0 fix(quant): fix spell error 4 years ago
  Megvii Engine Team 4d72e7071d deps: update cutlass 4 years ago
  Megvii Engine Team 355153e158 feat(mge/dtr): add DTR in computing graph 4 years ago
  Megvii Engine Team 76f4f97536 refactor(sublinear): add SeqModifierBase 4 years ago
  Megvii Engine Team f584416aa2 fix(dnn/bn): revise the conditions for inplace flag 4 years ago
  Megvii Engine Team a9b60fbfb5 fix(ci/lite): reopen lite_test build by cmake 4 years ago
  Megvii Engine Team 2eea00097c feat(mgb): add fast run batch size graph option 4 years ago
  Megvii Engine Team 0ac642b5d5 fix(imperative): persistent cache write through on put 4 years ago
  Megvii Engine Team 47dcdf3e17 fix(mgb/core): fix dtype and resize modifiers for tensor 4 years ago
  Megvii Engine Team 29f7cdb84a fix(mgb/opr): correct nvof out shape computation 4 years ago
  Megvii Engine Team 71cc814eaf feat(ci): add aarch64 linux ci 4 years ago
  Megvii Engine Team 31a1f53817 feat(whl/opencl): enable OpenCL in python whl 4 years ago
  Megvii Engine Team b07f372835 feat(aarch64/whl): support aarch64 whl 4 years ago
  Megvii Engine Team d8ee0d7b5c fix(mge/distributed): fix the mutli dataloader test error 4 years ago
  Megvii Engine Team e275dfeca1 feat(imperative/python): support pooling mode "average" for avg pool2d module 4 years ago
  Megvii Engine Team 03ab8136e7 fix(core): fix asan error cause by wild thread_pool ptr 4 years ago
  Megvii Engine Team 24a3878130 feat(dnn/cuda): add nchw conv u4xs4 support 4 years ago
  Megvii Engine Team 606540bef4 feat(dnn/cuda): add nhwc 4bit warp perspective 4 years ago
  Megvii Engine Team 1e6019436c feat(dnn/cuda): add nhwc int4 pooling 4 years ago
  Megvii Engine Team 0fb9cc41e4 fix(gopt): fix nchw64 opt pass 4 years ago
  Megvii Engine Team e661ae904f feat(dnn/cuda): add base class for cutlass uint4 and int4 algos 4 years ago
  Megvii Engine Team 319436dd14 feat(dnn/cuda): add cutlass impls for uint4 x int4 conv bias 4 years ago
  Megvii Engine Team d28eba4ea5 feat(dnn/cuda): add cutlass impls for int4 conv bias 4 years ago
  Megvii Engine Team 14b65e4da7 feat(dnn/cuda): add reduce_filter_and_update_bias 4 years ago
  Megvii Engine Team 2d4e62ef58 feat(dnn/cuda): add cuda uint4 pooling 4 years ago
  Megvii Engine Team 19919384fc feat(dnn/cuda): add cuda uint warp perspective 4 years ago
  Megvii Engine Team 01354337a9 fix(mge/autodiff): fix incorrect handling of tuple dy 4 years ago
  Megvii Engine Team 5868d1fe4f fix(arm_common/pooling): check mode in pooling algo to avoid wrong use AVERAGE_COUNT_EXCLUDE_PADDING 4 years ago
  Megvii Engine Team 86b69cacd0 fix(dnn): fixes for int4 4 years ago
  Megvii Engine Team 4a802d21ca feat(dnn/cuda): add conv u4xs4 sass kernel 4 years ago
  Megvii Engine Team adf75a291d perf(dnn/cuda): add sass int4 128x128 4 years ago
  Megvii Engine Team 8da2f698a3 feat(dnn/cuda): support warp perspective/pooling op when channel not aligned to 64 4 years ago
  Megvii Engine Team c218d4b029 feat(dnn/cuda): fallback conv qs4 support channel not aligend to 64 4 years ago
  Megvii Engine Team 4fe68ac9ed feat(dnn/cuda): support transforming layout between nchw and nchw64 when channel not aligned to 64 4 years ago
  Megvii Engine Team ae6ff2c5a6 feat(mgb/gopt): add opt pass for nchw64 layout transform 4 years ago
  Megvii Engine Team 63a9bd30a8 feat(mgb/gopt): add an opt pass for padding channels to enable fast int8/int4 support on GPU 4 years ago
  Megvii Engine Team 56e863b7d4 fix(dnn/cuda): fix int4 epilogue stg bug 4 years ago
  Megvii Engine Team cff61a53d4 perf(dnn/cuda): optimize int4 sass conv main loop and epilogue without fuse_z 4 years ago
  Megvii Engine Team 12a0e61542 feat(dnn/cuda): add cuda elemwise int4 4 years ago
  Megvii Engine Team df1af59b5c feat(dnn): warp perspective support int4 4 years ago
  Megvii Engine Team 2398df079c feat(dnn/cuda): add cuda int4 pooling 4 years ago
  Megvii Engine Team 2a2a7f4552 test(mgb/opr): add testcase for conv bias int4 4 years ago