242 Commits (f76a2cc2c6d96b1d563fefd3a46bd41f4ed2bc9b)

Author SHA1 Message Date
  Megvii Engine Team 869a03271b perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance 3 years ago
  Megvii Engine Team 8d248a6a9a fix(dnn/cuda): fix testcase for fallback nchw qs8 conv 4 years ago
  Megvii Engine Team 894a2407c2 feat(dnn/cuda): add relayout format kernel for nchw <-> nhwc 4 years ago
  Megvii Engine Team f41a808694 feat(dnn/cuda): add nhwc int4 conv support 4 years ago
  Megvii Engine Team 633016a962 fix(dnn/cuda): fix AlgoFallbackNCHWQS8 to support Float32 dst 4 years ago
  Megvii Engine Team 43098fb8f1 feat(mge): add SlidingWindowTranspose opr 4 years ago
  Megvii Engine Team b078dda90b feat(mge/random): add some random op and remove random/distrbution.py 4 years ago
  Megvii Engine Team 83e4c9d7ab fix(opencl): open opencl topk test when opencl beyond 2.0 4 years ago
  Megvii Engine Team f30c0e06a6 feat(mgb/opr): add lsq opr 4 years ago
  Megvii Engine Team 1cfdbc565c feat(dnn): add deterministic max pooling 4 years ago
  Megvii Engine Team a5060a2bfe feat(mgb/opr): add check_has_inf kernel and opr 4 years ago
  Megvii Engine Team 3597a6dbd7 feat(dnn/arm): nchw_nchw44 conv support 1x1s1 4 years ago
  Megvii Engine Team d915c5a3fd refactor(mgb): make convolution3D handle noncontiguous tensors 4 years ago
  Megvii Engine Team d04cd67faf refactor(mgb): make conv-backward-filter handle noncontiguous tensors 4 years ago
  Megvii Engine Team 44376f702a refactor(mgb): make conv-backward-data handle noncontiguous tensors 4 years ago
  Megvii Engine Team 7b2a76d1ee refactor(mgb): make conv handle noncontiguous tensors 4 years ago
  Megvii Engine Team ca2828ddcb fix(dnn/x86): fix x86 int8 matmul ldc bug 4 years ago
  Megvii Engine Team b87af9f77f feat(dnn/cuda): topk support fp16 4 years ago
  Megvii Engine Team 71cc814eaf feat(ci): add aarch64 linux ci 4 years ago
  Megvii Engine Team 606540bef4 feat(dnn/cuda): add nhwc 4bit warp perspective 4 years ago
  Megvii Engine Team 1e6019436c feat(dnn/cuda): add nhwc int4 pooling 4 years ago
  Megvii Engine Team 319436dd14 feat(dnn/cuda): add cutlass impls for uint4 x int4 conv bias 4 years ago
  Megvii Engine Team d28eba4ea5 feat(dnn/cuda): add cutlass impls for int4 conv bias 4 years ago
  Megvii Engine Team 2d4e62ef58 feat(dnn/cuda): add cuda uint4 pooling 4 years ago
  Megvii Engine Team 19919384fc feat(dnn/cuda): add cuda uint warp perspective 4 years ago
  Megvii Engine Team 5868d1fe4f fix(arm_common/pooling): check mode in pooling algo to avoid wrong use AVERAGE_COUNT_EXCLUDE_PADDING 4 years ago
  Megvii Engine Team 86b69cacd0 fix(dnn): fixes for int4 4 years ago
  Megvii Engine Team 4a802d21ca feat(dnn/cuda): add conv u4xs4 sass kernel 4 years ago
  Megvii Engine Team adf75a291d perf(dnn/cuda): add sass int4 128x128 4 years ago
  Megvii Engine Team 8da2f698a3 feat(dnn/cuda): support warp perspective/pooling op when channel not aligned to 64 4 years ago
  Megvii Engine Team 4fe68ac9ed feat(dnn/cuda): support transforming layout between nchw and nchw64 when channel not aligned to 64 4 years ago
  Megvii Engine Team 56e863b7d4 fix(dnn/cuda): fix int4 epilogue stg bug 4 years ago
  Megvii Engine Team 12a0e61542 feat(dnn/cuda): add cuda elemwise int4 4 years ago
  Megvii Engine Team df1af59b5c feat(dnn): warp perspective support int4 4 years ago
  Megvii Engine Team 2398df079c feat(dnn/cuda): add cuda int4 pooling 4 years ago
  Megvii Engine Team e250afb08f feat(dnn/cuda): support conv_bias for nchw64 and qint4 4 years ago
  Megvii Engine Team 3b9b87809d refactor(dnn): refactor lowbit tensor format 4 years ago
  Megvii Engine Team 8fef78d06d feat(dnn/cuda): add relayout format when width is an odd number 4 years ago
  Megvii Engine Team 91d6160769 feat(dnn/common): add tensor format for low-bits tensor layout 4 years ago
  Megvii Engine Team 19a554d674 test(dnn/cuda): add testcase for transforming tensor layout between nchw and nchw64 4 years ago
  Megvii Engine Team 23032f50f2 feat(dnn/cuda): support float16 for index_incr_multi_axis_vec 4 years ago
  Megvii Engine Team 938944027d fix(mgb/dnn): fix cudnn8 convbias 4 years ago
  Megvii Engine Team 1525a02530 feat(mge/module): add python wrapper for unfold 4 years ago
  Megvii Engine Team 13b15fb08c feat(megbrain): add correlation opr 4 years ago
  Megvii Engine Team 1997b1a289 feat(dnn/cuda): add correlation kernel 4 years ago
  Megvii Engine Team c3f8cf04fa feat(dnn): add conv_bwd_data and conv_bwd_filter accuracy shake check 4 years ago
  Megvii Engine Team 1e6ef3771f feat(mgb/dnn): add accuracy shake checker 4 years ago
  Megvii Engine Team 78fff72a95 feat(dnn): add param_pack for rocm 4 years ago
  Megvii Engine Team ff755451d2 refactor(mgb): move algo's name from info to desc and delete some algo's unnecessary param() method 4 years ago
  Megvii Engine Team 756c1eb7f2 fix(mgb/dnn): add cuda float naive matmul algo 4 years ago