412 Commits (master)

Author SHA1 Message Date
  Megvii Engine Team 4c13bc7e1b feat(dnn/cuda): add nhwc int8 deconv 3 years ago
  Megvii Engine Team 11f022ff7c feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt 3 years ago
  Megvii Engine Team 67575d582c feat(mge/opr): add interpolate bilinear mode 3 years ago
  Megvii Engine Team 0558b2123d feat(mge/opr): add interpolate nearest mode 3 years ago
  Megvii Engine Team c25125e3d2 perf(dnn/cuda): sass int8 epilogue remove shared load 3 years ago
  Megvii Engine Team c9d060307f feat(dnn/common): add named tensor shape 4 years ago
  Megvii Engine Team ff0e6be7b9 fix(dnn/cuda): fix cutlass tensorop kernels 3 years ago
  Megvii Engine Team 336761253d feat(dnn/cuda): add tensorcore matmul for fp16 data type 3 years ago
  Megvii Engine Team eab6afab47 feat(mgb): add padding opr for megbrain 4 years ago
  Megvii Engine Team b18feaab33 feat(dnn/cuda): use cutlass remove shared load imma conv kernel 4 years ago
  Megvii Engine Team 1af350c6d2 feat(dnn): add fill kernel 3 years ago
  Megvii Engine Team 3eb0505f9b feat(imperative): add support for quantized conv transpose2d 3 years ago
  Megvii Engine Team 3b452d8c16 feat(mgb): cuda conv support nhwc format and fp16 dtype 3 years ago
  Megvii Engine Team 10bcf75767 feat(dnn/x86): add algo for x86 max pooling for Window size bigger than 10 and S1 under NCHW88 3 years ago
  Megvii Engine Team ddba5c9674 fix(core): fix nr_threads is zero 3 years ago
  Megvii Engine Team 67f117882b perf(arm_common): add elemwise unary multithread support 3 years ago
  Megvii Engine Team 3afa3893d7 perf(arm_common): optimize arm common pooling 9x9 and 13x13 3 years ago
  Megvii Engine Team 2aba0378b9 refactor(mgb/dnn): fix group conv is_available 3 years ago
  Megvii Engine Team 4a92346b7a refactor(mgb): refactor group conv3d 3 years ago
  Megvii Engine Team 6ce212d2e0 refactor(mgb): refactor group conv 4 years ago
  Megvii Engine Team 869a03271b perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance 3 years ago
  Megvii Engine Team 8d248a6a9a fix(dnn/cuda): fix testcase for fallback nchw qs8 conv 4 years ago
  Megvii Engine Team 894a2407c2 feat(dnn/cuda): add relayout format kernel for nchw <-> nhwc 4 years ago
  Megvii Engine Team f41a808694 feat(dnn/cuda): add nhwc int4 conv support 4 years ago
  Megvii Engine Team 633016a962 fix(dnn/cuda): fix AlgoFallbackNCHWQS8 to support Float32 dst 4 years ago
  Megvii Engine Team 43098fb8f1 feat(mge): add SlidingWindowTranspose opr 4 years ago
  Megvii Engine Team b078dda90b feat(mge/random): add some random op and remove random/distrbution.py 4 years ago
  Megvii Engine Team 83e4c9d7ab fix(opencl): open opencl topk test when opencl beyond 2.0 4 years ago
  Megvii Engine Team f30c0e06a6 feat(mgb/opr): add lsq opr 4 years ago
  Megvii Engine Team 1cfdbc565c feat(dnn): add deterministic max pooling 4 years ago
  Megvii Engine Team a5060a2bfe feat(mgb/opr): add check_has_inf kernel and opr 4 years ago
  Megvii Engine Team 3597a6dbd7 feat(dnn/arm): nchw_nchw44 conv support 1x1s1 4 years ago
  Megvii Engine Team d915c5a3fd refactor(mgb): make convolution3D handle noncontiguous tensors 4 years ago
  Megvii Engine Team d04cd67faf refactor(mgb): make conv-backward-filter handle noncontiguous tensors 4 years ago
  Megvii Engine Team 44376f702a refactor(mgb): make conv-backward-data handle noncontiguous tensors 4 years ago
  Megvii Engine Team 7b2a76d1ee refactor(mgb): make conv handle noncontiguous tensors 4 years ago
  Megvii Engine Team ca2828ddcb fix(dnn/x86): fix x86 int8 matmul ldc bug 4 years ago
  Megvii Engine Team b87af9f77f feat(dnn/cuda): topk support fp16 4 years ago
  Megvii Engine Team 71cc814eaf feat(ci): add aarch64 linux ci 4 years ago
  Megvii Engine Team 606540bef4 feat(dnn/cuda): add nhwc 4bit warp perspective 4 years ago
  Megvii Engine Team 1e6019436c feat(dnn/cuda): add nhwc int4 pooling 4 years ago
  Megvii Engine Team 319436dd14 feat(dnn/cuda): add cutlass impls for uint4 x int4 conv bias 4 years ago
  Megvii Engine Team d28eba4ea5 feat(dnn/cuda): add cutlass impls for int4 conv bias 4 years ago
  Megvii Engine Team 2d4e62ef58 feat(dnn/cuda): add cuda uint4 pooling 4 years ago
  Megvii Engine Team 19919384fc feat(dnn/cuda): add cuda uint warp perspective 4 years ago
  Megvii Engine Team 5868d1fe4f fix(arm_common/pooling): check mode in pooling algo to avoid wrong use AVERAGE_COUNT_EXCLUDE_PADDING 4 years ago
  Megvii Engine Team 86b69cacd0 fix(dnn): fixes for int4 4 years ago
  Megvii Engine Team 4a802d21ca feat(dnn/cuda): add conv u4xs4 sass kernel 4 years ago
  Megvii Engine Team adf75a291d perf(dnn/cuda): add sass int4 128x128 4 years ago
  Megvii Engine Team 8da2f698a3 feat(dnn/cuda): support warp perspective/pooling op when channel not aligned to 64 4 years ago