Megvii Engine Team
8da2f698a3
feat(dnn/cuda): support warp perspective/pooling op when channel not aligned to 64
GitOrigin-RevId: 39f29ec990
4 years ago
Megvii Engine Team
c218d4b029
feat(dnn/cuda): fallback conv qs4 support channel not aligend to 64
GitOrigin-RevId: f0d080f35c
4 years ago
Megvii Engine Team
4fe68ac9ed
feat(dnn/cuda): support transforming layout between nchw and nchw64 when channel not aligned to 64
GitOrigin-RevId: e9ecbcf2e2
4 years ago
Megvii Engine Team
ae6ff2c5a6
feat(mgb/gopt): add opt pass for nchw64 layout transform
GitOrigin-RevId: adede7cef6
4 years ago
Megvii Engine Team
56e863b7d4
fix(dnn/cuda): fix int4 epilogue stg bug
GitOrigin-RevId: e86da9a8a8
4 years ago
Megvii Engine Team
cff61a53d4
perf(dnn/cuda): optimize int4 sass conv main loop and epilogue without fuse_z
GitOrigin-RevId: 4274e58d64
4 years ago
Megvii Engine Team
12a0e61542
feat(dnn/cuda): add cuda elemwise int4
GitOrigin-RevId: 8a9aaec328
4 years ago
Megvii Engine Team
df1af59b5c
feat(dnn): warp perspective support int4
GitOrigin-RevId: 826a43b349
4 years ago
Megvii Engine Team
2398df079c
feat(dnn/cuda): add cuda int4 pooling
GitOrigin-RevId: 14ed4e6f00
4 years ago
Megvii Engine Team
2a2a7f4552
test(mgb/opr): add testcase for conv bias int4
GitOrigin-RevId: e3fff5e30b
4 years ago
Megvii Engine Team
858261af1f
fix(python_module): fix conversion between numpy-ndarray and mgb tensor for qint4 and quint4
GitOrigin-RevId: 7450c4f25e
4 years ago
Megvii Engine Team
e250afb08f
feat(dnn/cuda): support conv_bias for nchw64 and qint4
GitOrigin-RevId: 1c65ba87d7
4 years ago
Megvii Engine Team
3b9b87809d
refactor(dnn): refactor lowbit tensor format
GitOrigin-RevId: b646dc085b
4 years ago
Megvii Engine Team
c74660ea88
fix(dnn/cuda): fix invalid local read for relayout format kernel
GitOrigin-RevId: 5a77b82212
4 years ago
Megvii Engine Team
8fef78d06d
feat(dnn/cuda): add relayout format when width is an odd number
GitOrigin-RevId: f059f1f56d
4 years ago
Megvii Engine Team
91d6160769
feat(dnn/common): add tensor format for low-bits tensor layout
GitOrigin-RevId: 0aa3753f37
4 years ago
Megvii Engine Team
19a554d674
test(dnn/cuda): add testcase for transforming tensor layout between nchw and nchw64
GitOrigin-RevId: 75d579635a
4 years ago
Megvii Engine Team
71c2f61254
feat(dnn/cuda): add relayout format to support layout transform between NCHW and NCHW64
GitOrigin-RevId: 1445ecfabe
4 years ago
Megvii Engine Team
df009e89e1
feat(dnn/cuda): add cuda conv bias impls for NCHW format tensors with qint4 data type
GitOrigin-RevId: a0a08cf42c
4 years ago
Megvii Engine Team
ed92207585
feat(dnn/cuda): add conv bias impl for int4 data type using sass language
GitOrigin-RevId: ae3d3e1c98
4 years ago
Megvii Engine Team
52b55564d7
refactor(dnn/cuda): refactor reorder filter and bias kernel to support conv imma with data type s4
GitOrigin-RevId: 6827b73770
4 years ago
Megvii Engine Team
517cc6846a
ci(gitlab-ci): add inline lineno checking in copybara linter
GitOrigin-RevId: 56c5068009
4 years ago
Megvii Engine Team
23032f50f2
feat(dnn/cuda): support float16 for index_incr_multi_axis_vec
GitOrigin-RevId: c2ae93d568
4 years ago
Megvii Engine Team
938944027d
fix(mgb/dnn): fix cudnn8 convbias
GitOrigin-RevId: 0fdbfd258c
4 years ago
Megvii Engine Team
3591ef1f6a
fix(mgb): fix conv cudnnconvbackwarddata algo witch is not shake
GitOrigin-RevId: 379bfbe376
4 years ago
Megvii Engine Team
1525a02530
feat(mge/module): add python wrapper for unfold
GitOrigin-RevId: 562103186f
4 years ago
Megvii Engine Team
13b15fb08c
feat(megbrain): add correlation opr
GitOrigin-RevId: 6d44598891
4 years ago
Megvii Engine Team
1997b1a289
feat(dnn/cuda): add correlation kernel
GitOrigin-RevId: 25e58b61e6
4 years ago
Megvii Engine Team
acb000d07f
fix(api_cache): fix serialization for conv_desc
GitOrigin-RevId: 95dbc9c685
4 years ago
Megvii Engine Team
c4032222fa
fix(api_cache): lock api cache for thread safety
GitOrigin-RevId: 8a244677c3
4 years ago
Megvii Engine Team
5419a95d1e
perf(cuda/conv): cache serval cudnn api
GitOrigin-RevId: 188c62cdd6
4 years ago
Megvii Engine Team
19887942c8
feat(dnn/apicache): add generic apicache
GitOrigin-RevId: 40b8ac2ab6
4 years ago
Megvii Engine Team
e4af4225ec
fix(cmake): fix cmake depends
reason: some target call python3 to generate some headers,
if PYTHON_EXECUTABLE changed, headers will be invalid, which
will lead to rebuild most of obj, because this headers will
be include in most cpp/c, what`s more, this target do not depends
python3 version(python API) the output is same when use python3.5
or python3.x so change use PYTHON_EXECUTABLE to
PYTHON3_EXECUTABLE_WITHOUT_VERSION, when
PYTHON_EXECUTABLE/PYTHON_LIBRARY/PYTHON_INCLUDE_DIR changed,
can reuse the build obj
GitOrigin-RevId: c2ffe3ead3
4 years ago
Megvii Engine Team
6bb6787d9a
feat(mge): add a tool which can analyze the file generated by compare_binary_iodump.py
GitOrigin-RevId: 9acab0a49f
4 years ago
Megvii Engine Team
c3f8cf04fa
feat(dnn): add conv_bwd_data and conv_bwd_filter accuracy shake check
GitOrigin-RevId: 4069e083d2
4 years ago
Megvii Engine Team
f36e99d30b
fix(build): fix naive build
GitOrigin-RevId: 0050ff5d9c
4 years ago
Megvii Engine Team
0a86a07096
fix(mgb/dnn): fix cub potential issues
Wrap cub with CUB_NS_PREFIX and remove dependency on Thrust to avoid potential linking issues
GitOrigin-RevId: 53893b0a39
4 years ago
Megvii Engine Team
1e6ef3771f
feat(mgb/dnn): add accuracy shake checker
GitOrigin-RevId: 0bb52078a1
4 years ago
Megvii Engine Team
a5a29826dc
feat(mgb/dnn): add accuracy_depend_on_batch attribute
GitOrigin-RevId: 2be1db43d0
4 years ago
Megvii Engine Team
4b141f8de4
fix(mgb): add usable-depend-on-shape attr
GitOrigin-RevId: 3a14fa6b6f
4 years ago
Megvii Engine Team
69a146c8c3
build(rocm): support rocm-3.9
GitOrigin-RevId: 85f8911736
4 years ago
Megvii Engine Team
a31b7c6ee3
build(rocm): partially support hcc compilation
GitOrigin-RevId: ca9f1f8e8e
4 years ago
Megvii Engine Team
621ae0a1e8
fix(dnn): replace kernel launch syntax with macro for hcc
GitOrigin-RevId: f9e69d4825
4 years ago
Megvii Engine Team
78fff72a95
feat(dnn): add param_pack for rocm
GitOrigin-RevId: 2180504c71
4 years ago
Megvii Engine Team
8163ed157d
fix(dnn/cuda): fix cutlass matmul splitk limit
GitOrigin-RevId: fc9a7c638c
4 years ago
Megvii Engine Team
ef9aa80074
fix(mgb/dnn): fix cuda naive matmul algo
GitOrigin-RevId: 79c9bba73b
4 years ago
Megvii Engine Team
2d18074a70
fix(mgb): fix spell error
GitOrigin-RevId: acae00e0a5
4 years ago
Megvii Engine Team
ff755451d2
refactor(mgb): move algo's name from info to desc and delete some algo's unnecessary param() method
GitOrigin-RevId: 144ff547d1
4 years ago
Megvii Engine Team
756c1eb7f2
fix(mgb/dnn): add cuda float naive matmul algo
GitOrigin-RevId: db7f7fc057
4 years ago
Megvii Engine Team
8494a1529e
chore(scripts): clarify and fix default value of bit combined enum
GitOrigin-RevId: 3716bf9bb5
4 years ago