451 Commits (3eb0505f9b9db06160fc1f4d2d8724856da9d46b)

Author SHA1 Message Date
  Megvii Engine Team 3eb0505f9b feat(imperative): add support for quantized conv transpose2d 3 years ago
  Megvii Engine Team c68e669530 feat(bazel/windows/xp/sp2/inference): implement inference on windows xp 3 years ago
  Megvii Engine Team 3b452d8c16 feat(mgb): cuda conv support nhwc format and fp16 dtype 3 years ago
  Megvii Engine Team 10bcf75767 feat(dnn/x86): add algo for x86 max pooling for Window size bigger than 10 and S1 under NCHW88 3 years ago
  Megvii Engine Team ddba5c9674 fix(core): fix nr_threads is zero 3 years ago
  Megvii Engine Team 67f117882b perf(arm_common): add elemwise unary multithread support 3 years ago
  Megvii Engine Team 3afa3893d7 perf(arm_common): optimize arm common pooling 9x9 and 13x13 3 years ago
  Megvii Engine Team 2c4ff5431b fix(mgb): fix cudnn ConvolutionBackwardData 3 years ago
  Megvii Engine Team 287cab49c2 fix(mgb/sereg): fix rng operator compatibility 3 years ago
  Megvii Engine Team 2aba0378b9 refactor(mgb/dnn): fix group conv is_available 3 years ago
  Megvii Engine Team 4a92346b7a refactor(mgb): refactor group conv3d 3 years ago
  Megvii Engine Team 6ce212d2e0 refactor(mgb): refactor group conv 4 years ago
  Megvii Engine Team f76a2cc2c6 feat(mge/opr): add silu and gelu 3 years ago
  Megvii Engine Team f8b0f2cb91 build(dnn/cutlass): fix build for cutlass 3 years ago
  Megvii Engine Team 869a03271b perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance 3 years ago
  Megvii Engine Team 239916a997 fix(mgb/gopt): fix testcase for enable nchw64 pass 4 years ago
  Megvii Engine Team 4eda338876 feat(dnn/cuda): generate cutlass kimpls using cmake and bazel 4 years ago
  Megvii Engine Team 8d248a6a9a fix(dnn/cuda): fix testcase for fallback nchw qs8 conv 4 years ago
  Megvii Engine Team 894a2407c2 feat(dnn/cuda): add relayout format kernel for nchw <-> nhwc 4 years ago
  Megvii Engine Team 43c59204df refactor(dnn/cuda): refactor relayout format kernels 4 years ago
  Megvii Engine Team f41a808694 feat(dnn/cuda): add nhwc int4 conv support 4 years ago
  Megvii Engine Team 5a14a89224 refactor(dnn/cuda): refactor cutlass kernel generator for gemm and gemv 4 years ago
  Megvii Engine Team b33217d8f0 refactor(dnn/cuda): refactor cutlass kernel generator for deconv operation 4 years ago
  Megvii Engine Team 4abf7bd36f refactor(dnn/cuda): refactor kernel generator for cutlass convolution kernels 4 years ago
  Megvii Engine Team b4687ce8da feat(dnn/cuda): add convolution with i8 input and u4 output 4 years ago
  Megvii Engine Team 00083d13b6 fix(dnn/cuda): fix recursive algo search for fallback_nchw_qs8 4 years ago
  Megvii Engine Team 66f70578c2 feat(dnn/cuda): add convolution with i8 input and i4 output 4 years ago
  Megvii Engine Team 7d3df995cb feat(gopt/inference): allow Float32 output dtype in EnableNCHW4Pass 4 years ago
  Megvii Engine Team 633016a962 fix(dnn/cuda): fix AlgoFallbackNCHWQS8 to support Float32 dst 4 years ago
  Megvii Engine Team 4e4497b903 refactor(mgb/dnn): x86 pooling rebase algochooser 3 years ago
  Megvii Engine Team a33c3b73bd refactor(mgb/dnn): arm pooling rebase algochooser 3 years ago
  Megvii Engine Team ea70d99b4d fix(mge/convbias): make fallback convbias support nhwcd4 layout 4 years ago
  Megvii Engine Team 43098fb8f1 feat(mge): add SlidingWindowTranspose opr 4 years ago
  Megvii Engine Team b078dda90b feat(mge/random): add some random op and remove random/distrbution.py 4 years ago
  Megvii Engine Team 83e4c9d7ab fix(opencl): open opencl topk test when opencl beyond 2.0 4 years ago
  Megvii Engine Team f30c0e06a6 feat(mgb/opr): add lsq opr 4 years ago
  Megvii Engine Team 25932352e9 refactor(mgb/dnn): rocm pooling rebase algochooser 4 years ago
  Megvii Engine Team 1cfdbc565c feat(dnn): add deterministic max pooling 4 years ago
  Megvii Engine Team 20ab82d00c fix(tee): fix tee crash 4 years ago
  Megvii Engine Team a5060a2bfe feat(mgb/opr): add check_has_inf kernel and opr 4 years ago
  Megvii Engine Team 3597a6dbd7 feat(dnn/arm): nchw_nchw44 conv support 1x1s1 4 years ago
  Megvii Engine Team d915c5a3fd refactor(mgb): make convolution3D handle noncontiguous tensors 4 years ago
  Megvii Engine Team d04cd67faf refactor(mgb): make conv-backward-filter handle noncontiguous tensors 4 years ago
  Megvii Engine Team 44376f702a refactor(mgb): make conv-backward-data handle noncontiguous tensors 4 years ago
  Megvii Engine Team 7b2a76d1ee refactor(mgb): make conv handle noncontiguous tensors 4 years ago
  Megvii Engine Team ca2828ddcb fix(dnn/x86): fix x86 int8 matmul ldc bug 4 years ago
  Megvii Engine Team 40085acbae fix(mgb): remove unnecessary cudnn8 warning 4 years ago
  Megvii Engine Team 62bd6c823b feat(cmake/debug): misc for build 4 years ago
  Megvii Engine Team b87af9f77f feat(dnn/cuda): topk support fp16 4 years ago
  Megvii Engine Team 2eea00097c feat(mgb): add fast run batch size graph option 4 years ago