1946 Commits (369c2ccc5a4906e1e043c6e54321b4a33a14bed0)
 

Author SHA1 Message Date
  Megvii Engine Team 5f558042b2 fix(imperative/ops): use tblgen to generate FastpathCopy 3 years ago
  Megvii Engine Team bfc4e7a966 docs(mge): fix amp docstring problems 3 years ago
  Megvii Engine Team 0b764cf2d2 docs(mge/functional): add docs for megengine.functional.full_like 3 years ago
  Megvii Engine Team f141159088 refactor(mge): loose the error bound of fastrun 3 years ago
  Megvii Engine Team 1f0436967c refactor(mge/distributed): using nccl as default in distributed training 3 years ago
  Megvii Engine Team b17a02d44a feat(mge/distributed): deprecate get_device_count_by_fork 3 years ago
  Megvii Engine Team f8b0f2cb91 build(dnn/cutlass): fix build for cutlass 3 years ago
  konghuanjun 0fb4e9a9ca fix(ci): git set user and email 4 years ago
  huangxinda 6af4a32e17 feat(mge/third_party): update MegRay version 3 years ago
  huangxinda 093f7ae774 feat(mge/third_party): update cutlass version 3 years ago
  Megvii Engine Team c2daea3cba chore(release): bump version 3 years ago
  Megvii Engine Team 207a346351 chore(mge): run get_device_count("gpu") in subprocess 4 years ago
  Megvii Engine Team 869a03271b perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance 3 years ago
  liuke 0baf6b0d63 Merge pull request #175 from tpoisonooo:fix-spell-error 3 years ago
  Megvii Engine Team 239916a997 fix(mgb/gopt): fix testcase for enable nchw64 pass 4 years ago
  Megvii Engine Team 2ab5c53f1d feat(mgb/gopt): support nhwc conv in tensor reformat pass 4 years ago
  Megvii Engine Team 009c90a2fe feat(mgb/gopt): modify padding policy for 4bit conv bias oprs 4 years ago
  Megvii Engine Team 4eda338876 feat(dnn/cuda): generate cutlass kimpls using cmake and bazel 4 years ago
  Megvii Engine Team 8d248a6a9a fix(dnn/cuda): fix testcase for fallback nchw qs8 conv 4 years ago
  Megvii Engine Team 894a2407c2 feat(dnn/cuda): add relayout format kernel for nchw <-> nhwc 4 years ago
  Megvii Engine Team 43c59204df refactor(dnn/cuda): refactor relayout format kernels 4 years ago
  Megvii Engine Team f41a808694 feat(dnn/cuda): add nhwc int4 conv support 4 years ago
  Megvii Engine Team 5a14a89224 refactor(dnn/cuda): refactor cutlass kernel generator for gemm and gemv 4 years ago
  Megvii Engine Team b33217d8f0 refactor(dnn/cuda): refactor cutlass kernel generator for deconv operation 4 years ago
  Megvii Engine Team 4abf7bd36f refactor(dnn/cuda): refactor kernel generator for cutlass convolution kernels 4 years ago
  Megvii Engine Team b4687ce8da feat(dnn/cuda): add convolution with i8 input and u4 output 4 years ago
  Megvii Engine Team 00083d13b6 fix(dnn/cuda): fix recursive algo search for fallback_nchw_qs8 4 years ago
  Megvii Engine Team bba04f02e5 feat(mgb/gopt): add fusion support for conv, astype(s4) and reformat 4 years ago
  Megvii Engine Team 66f70578c2 feat(dnn/cuda): add convolution with i8 input and i4 output 4 years ago
  Megvii Engine Team 6d686ff26f feat(gopt/inference): allow Float32 output dtype in EnableNCHW64Pass 4 years ago
  Megvii Engine Team 7d3df995cb feat(gopt/inference): allow Float32 output dtype in EnableNCHW4Pass 4 years ago
  Megvii Engine Team 633016a962 fix(dnn/cuda): fix AlgoFallbackNCHWQS8 to support Float32 dst 4 years ago
  Megvii Engine Team e6caa9ff89 feat(opr): add bn backward for inference mode 4 years ago
  Xinda Huang c90fa087ea test(mge): delete test_external.py 3 years ago
  Megvii Engine Team b2944559a8 fix(imperative/module): remove ``__getattribute__`` method in module 4 years ago
  Megvii Engine Team 77ead9377b fix(src/serialization): fix compatibility error of oss model 3 years ago
  Megvii Engine Team 070c811732 fix(imperative): remove convert_inputs 3 years ago
  Megvii Engine Team f40df60242 docs(mge): refactor docs to remove warnings 3 years ago
  Megvii Engine Team 1040b77843 fix(mge/functional): fix F.topk(kth_only=True) 4 years ago
  Megvii Engine Team 551cc701c6 docs(distributed.functional): add return type for all_reduce_max (jira #MGE-2706) 3 years ago
  Megvii Engine Team 72ff7aeccb feat(docs): add docs for megengine.functional.ones_like(jira #MGE-2702) 3 years ago
  Megvii Engine Team 7c9569e4e5 fix(mge/random): fix random seed 3 years ago
  Megvii Engine Team 07de15713c fix(mgb): remove static mem record from tee 4 years ago
  Megvii Engine Team d7b6bfd56c test(mge/fakequant): use fixed input for lsq test to temperarily avoid precision error 3 years ago
  Megvii Engine Team 5cef74a77e feat(mge/amp): add GradScaler support 4 years ago
  Megvii Engine Team 1bf18252c4 feat(mge/amp): add mix precision autocast support 4 years ago
  Megvii Engine Team f12355f727 fix(imperative/grad): fix hardcode dtype in subtensor_grad_rule 4 years ago
  Megvii Engine Team 4e4497b903 refactor(mgb/dnn): x86 pooling rebase algochooser 3 years ago
  Megvii Engine Team a33c3b73bd refactor(mgb/dnn): arm pooling rebase algochooser 3 years ago
  Megvii Engine Team 8dea6b3c68 build(dnn): compat for more windows env 3 years ago