448 Commits (release-1.5)

Author SHA1 Message Date
  Megvii Engine Team 606540bef4 feat(dnn/cuda): add nhwc 4bit warp perspective 4 years ago
  Megvii Engine Team 1e6019436c feat(dnn/cuda): add nhwc int4 pooling 4 years ago
  Megvii Engine Team e661ae904f feat(dnn/cuda): add base class for cutlass uint4 and int4 algos 4 years ago
  Megvii Engine Team 319436dd14 feat(dnn/cuda): add cutlass impls for uint4 x int4 conv bias 4 years ago
  Megvii Engine Team d28eba4ea5 feat(dnn/cuda): add cutlass impls for int4 conv bias 4 years ago
  Megvii Engine Team 14b65e4da7 feat(dnn/cuda): add reduce_filter_and_update_bias 4 years ago
  Megvii Engine Team 2d4e62ef58 feat(dnn/cuda): add cuda uint4 pooling 4 years ago
  Megvii Engine Team 19919384fc feat(dnn/cuda): add cuda uint warp perspective 4 years ago
  Megvii Engine Team 5868d1fe4f fix(arm_common/pooling): check mode in pooling algo to avoid wrong use AVERAGE_COUNT_EXCLUDE_PADDING 4 years ago
  Megvii Engine Team 86b69cacd0 fix(dnn): fixes for int4 4 years ago
  Megvii Engine Team 4a802d21ca feat(dnn/cuda): add conv u4xs4 sass kernel 4 years ago
  Megvii Engine Team adf75a291d perf(dnn/cuda): add sass int4 128x128 4 years ago
  Megvii Engine Team 8da2f698a3 feat(dnn/cuda): support warp perspective/pooling op when channel not aligned to 64 4 years ago
  Megvii Engine Team c218d4b029 feat(dnn/cuda): fallback conv qs4 support channel not aligend to 64 4 years ago
  Megvii Engine Team 4fe68ac9ed feat(dnn/cuda): support transforming layout between nchw and nchw64 when channel not aligned to 64 4 years ago
  Megvii Engine Team ae6ff2c5a6 feat(mgb/gopt): add opt pass for nchw64 layout transform 4 years ago
  Megvii Engine Team 56e863b7d4 fix(dnn/cuda): fix int4 epilogue stg bug 4 years ago
  Megvii Engine Team cff61a53d4 perf(dnn/cuda): optimize int4 sass conv main loop and epilogue without fuse_z 4 years ago
  Megvii Engine Team 12a0e61542 feat(dnn/cuda): add cuda elemwise int4 4 years ago
  Megvii Engine Team df1af59b5c feat(dnn): warp perspective support int4 4 years ago
  Megvii Engine Team 2398df079c feat(dnn/cuda): add cuda int4 pooling 4 years ago
  Megvii Engine Team 2a2a7f4552 test(mgb/opr): add testcase for conv bias int4 4 years ago
  Megvii Engine Team 858261af1f fix(python_module): fix conversion between numpy-ndarray and mgb tensor for qint4 and quint4 4 years ago
  Megvii Engine Team e250afb08f feat(dnn/cuda): support conv_bias for nchw64 and qint4 4 years ago
  Megvii Engine Team 3b9b87809d refactor(dnn): refactor lowbit tensor format 4 years ago
  Megvii Engine Team c74660ea88 fix(dnn/cuda): fix invalid local read for relayout format kernel 4 years ago
  Megvii Engine Team 8fef78d06d feat(dnn/cuda): add relayout format when width is an odd number 4 years ago
  Megvii Engine Team 91d6160769 feat(dnn/common): add tensor format for low-bits tensor layout 4 years ago
  Megvii Engine Team 19a554d674 test(dnn/cuda): add testcase for transforming tensor layout between nchw and nchw64 4 years ago
  Megvii Engine Team 71c2f61254 feat(dnn/cuda): add relayout format to support layout transform between NCHW and NCHW64 4 years ago
  Megvii Engine Team df009e89e1 feat(dnn/cuda): add cuda conv bias impls for NCHW format tensors with qint4 data type 4 years ago
  Megvii Engine Team ed92207585 feat(dnn/cuda): add conv bias impl for int4 data type using sass language 4 years ago
  Megvii Engine Team 52b55564d7 refactor(dnn/cuda): refactor reorder filter and bias kernel to support conv imma with data type s4 4 years ago
  Megvii Engine Team 517cc6846a ci(gitlab-ci): add inline lineno checking in copybara linter 4 years ago
  Megvii Engine Team 23032f50f2 feat(dnn/cuda): support float16 for index_incr_multi_axis_vec 4 years ago
  Megvii Engine Team 938944027d fix(mgb/dnn): fix cudnn8 convbias 4 years ago
  Megvii Engine Team 3591ef1f6a fix(mgb): fix conv cudnnconvbackwarddata algo witch is not shake 4 years ago
  Megvii Engine Team 1525a02530 feat(mge/module): add python wrapper for unfold 4 years ago
  Megvii Engine Team 13b15fb08c feat(megbrain): add correlation opr 4 years ago
  Megvii Engine Team 1997b1a289 feat(dnn/cuda): add correlation kernel 4 years ago
  Megvii Engine Team acb000d07f fix(api_cache): fix serialization for conv_desc 4 years ago
  Megvii Engine Team c4032222fa fix(api_cache): lock api cache for thread safety 4 years ago
  Megvii Engine Team 5419a95d1e perf(cuda/conv): cache serval cudnn api 4 years ago
  Megvii Engine Team 19887942c8 feat(dnn/apicache): add generic apicache 4 years ago
  Megvii Engine Team e4af4225ec fix(cmake): fix cmake depends 4 years ago
  Megvii Engine Team 6bb6787d9a feat(mge): add a tool which can analyze the file generated by compare_binary_iodump.py 4 years ago
  Megvii Engine Team c3f8cf04fa feat(dnn): add conv_bwd_data and conv_bwd_filter accuracy shake check 4 years ago
  Megvii Engine Team f36e99d30b fix(build): fix naive build 4 years ago
  Megvii Engine Team 0a86a07096 fix(mgb/dnn): fix cub potential issues 4 years ago
  Megvii Engine Team 1e6ef3771f feat(mgb/dnn): add accuracy shake checker 4 years ago