1670 Commits (release-1.5)
 

Author SHA1 Message Date
  Xu Xinran 5985392297 chore(release): bump version 3 years ago
  Megvii Engine Team 10bcf75767 feat(dnn/x86): add algo for x86 max pooling for Window size bigger than 10 and S1 under NCHW88 3 years ago
  Megvii Engine Team ddba5c9674 fix(core): fix nr_threads is zero 3 years ago
  Megvii Engine Team 67f117882b perf(arm_common): add elemwise unary multithread support 3 years ago
  Megvii Engine Team 3afa3893d7 perf(arm_common): optimize arm common pooling 9x9 and 13x13 3 years ago
  Megvii Engine Team d16c5caf6b fix(mge/dump): fix dump device error with const 3 years ago
  Megvii Engine Team 2c4ff5431b fix(mgb): fix cudnn ConvolutionBackwardData 3 years ago
  Megvii Engine Team 7138e4fd02 feat(docs): add docs for megengine.functional.full 3 years ago
  Megvii Engine Team 0b4a767965 feat(mge/distributed): enable uint8 for collective communication 3 years ago
  Megvii Engine Team a22b2cf473 ci(copybara): add config files and fix format script 3 years ago
  Megvii Engine Team 287cab49c2 fix(mgb/sereg): fix rng operator compatibility 3 years ago
  Megvii Engine Team e3fc783642 fix(mgb/opr): fix nvof shape error 3 years ago
  Megvii Engine Team 3f3a256e0f fix(mge/functional): fix conv* dtype promotion 3 years ago
  Megvii Engine Team 536506c3f4 feat(functional): let interpolate support more modes 3 years ago
  Megvii Engine Team d811dc5478 docs(mge/distributed): add document for distributed.backend 3 years ago
  Megvii Engine Team 9526ee521b docs(distributed.functional): add return type for all_reduce_min 3 years ago
  Megvii Engine Team 2aba0378b9 refactor(mgb/dnn): fix group conv is_available 3 years ago
  Megvii Engine Team 4a92346b7a refactor(mgb): refactor group conv3d 3 years ago
  Megvii Engine Team 6ce212d2e0 refactor(mgb): refactor group conv 4 years ago
  XindaH febd0b1798 ci(fix): fail when git user name or email is empty 3 years ago
  Megvii Engine Team eb2dd018d9 build(fp16): fix fp16 build 3 years ago
  Megvii Engine Team f76a2cc2c6 feat(mge/opr): add silu and gelu 3 years ago
  Megvii Engine Team f2ac4c345b docs(distributed.functional.all_reduce_sum): googlestring and examples 3 years ago
  Megvii Engine Team 186bacfb71 fix(mge): recover bn freeze fastpath execution 3 years ago
  Megvii Engine Team 5f558042b2 fix(imperative/ops): use tblgen to generate FastpathCopy 3 years ago
  Megvii Engine Team bfc4e7a966 docs(mge): fix amp docstring problems 3 years ago
  Megvii Engine Team 0b764cf2d2 docs(mge/functional): add docs for megengine.functional.full_like 3 years ago
  Megvii Engine Team f141159088 refactor(mge): loose the error bound of fastrun 3 years ago
  Megvii Engine Team 1f0436967c refactor(mge/distributed): using nccl as default in distributed training 3 years ago
  Megvii Engine Team b17a02d44a feat(mge/distributed): deprecate get_device_count_by_fork 3 years ago
  Megvii Engine Team f8b0f2cb91 build(dnn/cutlass): fix build for cutlass 3 years ago
  konghuanjun 0fb4e9a9ca fix(ci): git set user and email 4 years ago
  huangxinda 6af4a32e17 feat(mge/third_party): update MegRay version 3 years ago
  huangxinda 093f7ae774 feat(mge/third_party): update cutlass version 3 years ago
  Megvii Engine Team c2daea3cba chore(release): bump version 3 years ago
  Megvii Engine Team 207a346351 chore(mge): run get_device_count("gpu") in subprocess 4 years ago
  Megvii Engine Team 869a03271b perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance 3 years ago
  liuke 0baf6b0d63 Merge pull request #175 from tpoisonooo:fix-spell-error 3 years ago
  Megvii Engine Team 239916a997 fix(mgb/gopt): fix testcase for enable nchw64 pass 4 years ago
  Megvii Engine Team 2ab5c53f1d feat(mgb/gopt): support nhwc conv in tensor reformat pass 4 years ago
  Megvii Engine Team 009c90a2fe feat(mgb/gopt): modify padding policy for 4bit conv bias oprs 4 years ago
  Megvii Engine Team 4eda338876 feat(dnn/cuda): generate cutlass kimpls using cmake and bazel 4 years ago
  Megvii Engine Team 8d248a6a9a fix(dnn/cuda): fix testcase for fallback nchw qs8 conv 4 years ago
  Megvii Engine Team 894a2407c2 feat(dnn/cuda): add relayout format kernel for nchw <-> nhwc 4 years ago
  Megvii Engine Team 43c59204df refactor(dnn/cuda): refactor relayout format kernels 4 years ago
  Megvii Engine Team f41a808694 feat(dnn/cuda): add nhwc int4 conv support 4 years ago
  Megvii Engine Team 5a14a89224 refactor(dnn/cuda): refactor cutlass kernel generator for gemm and gemv 4 years ago
  Megvii Engine Team b33217d8f0 refactor(dnn/cuda): refactor cutlass kernel generator for deconv operation 4 years ago
  Megvii Engine Team 4abf7bd36f refactor(dnn/cuda): refactor kernel generator for cutlass convolution kernels 4 years ago
  Megvii Engine Team b4687ce8da feat(dnn/cuda): add convolution with i8 input and u4 output 4 years ago