Megvii Engine Team
|
d915c5a3fd
|
refactor(mgb): make convolution3D handle noncontiguous tensors
GitOrigin-RevId: 3d3c31b021
|
4 years ago |
Megvii Engine Team
|
d04cd67faf
|
refactor(mgb): make conv-backward-filter handle noncontiguous tensors
GitOrigin-RevId: 44c586f912
|
4 years ago |
Megvii Engine Team
|
44376f702a
|
refactor(mgb): make conv-backward-data handle noncontiguous tensors
GitOrigin-RevId: 0a8f66f9d3
|
4 years ago |
Megvii Engine Team
|
7b2a76d1ee
|
refactor(mgb): make conv handle noncontiguous tensors
GitOrigin-RevId: 86282709b3
|
4 years ago |
Megvii Engine Team
|
ca2828ddcb
|
fix(dnn/x86): fix x86 int8 matmul ldc bug
GitOrigin-RevId: 2502f99000
|
4 years ago |
Megvii Engine Team
|
40085acbae
|
fix(mgb): remove unnecessary cudnn8 warning
GitOrigin-RevId: 04cf1bfca9
|
4 years ago |
Megvii Engine Team
|
62bd6c823b
|
feat(cmake/debug): misc for build
* add asan build option
* fix cpuinfo build opt level
* fix host release build with out debug info
* opt "fix lite bazel/cmake symbols MR"
* other misc build opt
GitOrigin-RevId: 6ca286e195
|
4 years ago |
Megvii Engine Team
|
b87af9f77f
|
feat(dnn/cuda): topk support fp16
GitOrigin-RevId: c6610d4cf0
|
4 years ago |
Megvii Engine Team
|
2eea00097c
|
feat(mgb): add fast run batch size graph option
GitOrigin-RevId: 94e333ec80
|
4 years ago |
Megvii Engine Team
|
47dcdf3e17
|
fix(mgb/core): fix dtype and resize modifiers for tensor
GitOrigin-RevId: a9d95a4cd8
|
4 years ago |
Megvii Engine Team
|
71cc814eaf
|
feat(ci): add aarch64 linux ci
GitOrigin-RevId: 2c0d3a8cc2
|
4 years ago |
Megvii Engine Team
|
24a3878130
|
feat(dnn/cuda): add nchw conv u4xs4 support
GitOrigin-RevId: 5edba47bd9
|
4 years ago |
Megvii Engine Team
|
606540bef4
|
feat(dnn/cuda): add nhwc 4bit warp perspective
GitOrigin-RevId: fbec4a4a1f
|
4 years ago |
Megvii Engine Team
|
1e6019436c
|
feat(dnn/cuda): add nhwc int4 pooling
GitOrigin-RevId: 9cf14cde4e
|
4 years ago |
Megvii Engine Team
|
e661ae904f
|
feat(dnn/cuda): add base class for cutlass uint4 and int4 algos
GitOrigin-RevId: a4d42f032c
|
4 years ago |
Megvii Engine Team
|
319436dd14
|
feat(dnn/cuda): add cutlass impls for uint4 x int4 conv bias
GitOrigin-RevId: cf4536855a
|
4 years ago |
Megvii Engine Team
|
d28eba4ea5
|
feat(dnn/cuda): add cutlass impls for int4 conv bias
GitOrigin-RevId: 878bb8c955
|
4 years ago |
Megvii Engine Team
|
14b65e4da7
|
feat(dnn/cuda): add reduce_filter_and_update_bias
GitOrigin-RevId: 31b6e6b0ab
|
4 years ago |
Megvii Engine Team
|
2d4e62ef58
|
feat(dnn/cuda): add cuda uint4 pooling
GitOrigin-RevId: a728977206
|
4 years ago |
Megvii Engine Team
|
19919384fc
|
feat(dnn/cuda): add cuda uint warp perspective
GitOrigin-RevId: 2aec72010f
|
4 years ago |
Megvii Engine Team
|
5868d1fe4f
|
fix(arm_common/pooling): check mode in pooling algo to avoid wrong use AVERAGE_COUNT_EXCLUDE_PADDING
GitOrigin-RevId: 7a2d243db7
|
4 years ago |
Megvii Engine Team
|
86b69cacd0
|
fix(dnn): fixes for int4
GitOrigin-RevId: 845e164fd3
|
4 years ago |
Megvii Engine Team
|
4a802d21ca
|
feat(dnn/cuda): add conv u4xs4 sass kernel
GitOrigin-RevId: 4defcf5f1f
|
4 years ago |
Megvii Engine Team
|
adf75a291d
|
perf(dnn/cuda): add sass int4 128x128
GitOrigin-RevId: 1bc5482102
|
4 years ago |
Megvii Engine Team
|
8da2f698a3
|
feat(dnn/cuda): support warp perspective/pooling op when channel not aligned to 64
GitOrigin-RevId: 39f29ec990
|
4 years ago |
Megvii Engine Team
|
c218d4b029
|
feat(dnn/cuda): fallback conv qs4 support channel not aligend to 64
GitOrigin-RevId: f0d080f35c
|
4 years ago |
Megvii Engine Team
|
4fe68ac9ed
|
feat(dnn/cuda): support transforming layout between nchw and nchw64 when channel not aligned to 64
GitOrigin-RevId: e9ecbcf2e2
|
4 years ago |
Megvii Engine Team
|
ae6ff2c5a6
|
feat(mgb/gopt): add opt pass for nchw64 layout transform
GitOrigin-RevId: adede7cef6
|
4 years ago |
Megvii Engine Team
|
56e863b7d4
|
fix(dnn/cuda): fix int4 epilogue stg bug
GitOrigin-RevId: e86da9a8a8
|
4 years ago |
Megvii Engine Team
|
cff61a53d4
|
perf(dnn/cuda): optimize int4 sass conv main loop and epilogue without fuse_z
GitOrigin-RevId: 4274e58d64
|
4 years ago |
Megvii Engine Team
|
12a0e61542
|
feat(dnn/cuda): add cuda elemwise int4
GitOrigin-RevId: 8a9aaec328
|
4 years ago |
Megvii Engine Team
|
df1af59b5c
|
feat(dnn): warp perspective support int4
GitOrigin-RevId: 826a43b349
|
4 years ago |
Megvii Engine Team
|
2398df079c
|
feat(dnn/cuda): add cuda int4 pooling
GitOrigin-RevId: 14ed4e6f00
|
4 years ago |
Megvii Engine Team
|
2a2a7f4552
|
test(mgb/opr): add testcase for conv bias int4
GitOrigin-RevId: e3fff5e30b
|
4 years ago |
Megvii Engine Team
|
858261af1f
|
fix(python_module): fix conversion between numpy-ndarray and mgb tensor for qint4 and quint4
GitOrigin-RevId: 7450c4f25e
|
4 years ago |
Megvii Engine Team
|
e250afb08f
|
feat(dnn/cuda): support conv_bias for nchw64 and qint4
GitOrigin-RevId: 1c65ba87d7
|
4 years ago |
Megvii Engine Team
|
3b9b87809d
|
refactor(dnn): refactor lowbit tensor format
GitOrigin-RevId: b646dc085b
|
4 years ago |
Megvii Engine Team
|
c74660ea88
|
fix(dnn/cuda): fix invalid local read for relayout format kernel
GitOrigin-RevId: 5a77b82212
|
4 years ago |
Megvii Engine Team
|
8fef78d06d
|
feat(dnn/cuda): add relayout format when width is an odd number
GitOrigin-RevId: f059f1f56d
|
4 years ago |
Megvii Engine Team
|
91d6160769
|
feat(dnn/common): add tensor format for low-bits tensor layout
GitOrigin-RevId: 0aa3753f37
|
4 years ago |
Megvii Engine Team
|
19a554d674
|
test(dnn/cuda): add testcase for transforming tensor layout between nchw and nchw64
GitOrigin-RevId: 75d579635a
|
4 years ago |
Megvii Engine Team
|
71c2f61254
|
feat(dnn/cuda): add relayout format to support layout transform between NCHW and NCHW64
GitOrigin-RevId: 1445ecfabe
|
4 years ago |
Megvii Engine Team
|
df009e89e1
|
feat(dnn/cuda): add cuda conv bias impls for NCHW format tensors with qint4 data type
GitOrigin-RevId: a0a08cf42c
|
4 years ago |
Megvii Engine Team
|
ed92207585
|
feat(dnn/cuda): add conv bias impl for int4 data type using sass language
GitOrigin-RevId: ae3d3e1c98
|
4 years ago |
Megvii Engine Team
|
52b55564d7
|
refactor(dnn/cuda): refactor reorder filter and bias kernel to support conv imma with data type s4
GitOrigin-RevId: 6827b73770
|
4 years ago |
Megvii Engine Team
|
517cc6846a
|
ci(gitlab-ci): add inline lineno checking in copybara linter
GitOrigin-RevId: 56c5068009
|
4 years ago |
Megvii Engine Team
|
23032f50f2
|
feat(dnn/cuda): support float16 for index_incr_multi_axis_vec
GitOrigin-RevId: c2ae93d568
|
4 years ago |
Megvii Engine Team
|
938944027d
|
fix(mgb/dnn): fix cudnn8 convbias
GitOrigin-RevId: 0fdbfd258c
|
4 years ago |
Megvii Engine Team
|
3591ef1f6a
|
fix(mgb): fix conv cudnnconvbackwarddata algo witch is not shake
GitOrigin-RevId: 379bfbe376
|
4 years ago |
Megvii Engine Team
|
1525a02530
|
feat(mge/module): add python wrapper for unfold
GitOrigin-RevId: 562103186f
|
4 years ago |