huangxinda
e67fefcdca
ci(yml): enable try-import branch invoke ci on push
4 years ago
Megvii Engine Team
4d72e7071d
deps: update cutlass
4 years ago
Megvii Engine Team
355153e158
feat(mge/dtr): add DTR in computing graph
GitOrigin-RevId: 8941810319
4 years ago
Megvii Engine Team
76f4f97536
refactor(sublinear): add SeqModifierBase
GitOrigin-RevId: 2d0393be6b
4 years ago
Megvii Engine Team
f584416aa2
fix(dnn/bn): revise the conditions for inplace flag
GitOrigin-RevId: 59a104bf6a
4 years ago
Megvii Engine Team
a9b60fbfb5
fix(ci/lite): reopen lite_test build by cmake
as some reason, lite_test need static link lite
when cuda enable on gcc7 and gcc8, if not
cask_trt::AbiInfo::~AbiInfo will double call at
atexit stage, which will lead double free at the
end of test, gcc9 do not have this issue, for
compat all CI env, we use static link!!!
GitOrigin-RevId: 1dc2115948
4 years ago
Megvii Engine Team
2eea00097c
feat(mgb): add fast run batch size graph option
GitOrigin-RevId: 94e333ec80
4 years ago
Megvii Engine Team
0ac642b5d5
fix(imperative): persistent cache write through on put
GitOrigin-RevId: f9408ae504
4 years ago
Megvii Engine Team
47dcdf3e17
fix(mgb/core): fix dtype and resize modifiers for tensor
GitOrigin-RevId: a9d95a4cd8
4 years ago
Megvii Engine Team
29f7cdb84a
fix(mgb/opr): correct nvof out shape computation
GitOrigin-RevId: 16bf086e92
4 years ago
Megvii Engine Team
71cc814eaf
feat(ci): add aarch64 linux ci
GitOrigin-RevId: 2c0d3a8cc2
4 years ago
Megvii Engine Team
31a1f53817
feat(whl/opencl): enable OpenCL in python whl
GitOrigin-RevId: a1c34ef40b
4 years ago
Megvii Engine Team
b07f372835
feat(aarch64/whl): support aarch64 whl
GitOrigin-RevId: 656a27d62b
4 years ago
Megvii Engine Team
d8ee0d7b5c
fix(mge/distributed): fix the mutli dataloader test error
GitOrigin-RevId: 86c8925916
4 years ago
Megvii Engine Team
e275dfeca1
feat(imperative/python): support pooling mode "average" for avg pool2d module
GitOrigin-RevId: 9fe442129f
4 years ago
Megvii Engine Team
03ab8136e7
fix(core): fix asan error cause by wild thread_pool ptr
GitOrigin-RevId: b1c1b452cd
4 years ago
Megvii Engine Team
24a3878130
feat(dnn/cuda): add nchw conv u4xs4 support
GitOrigin-RevId: 5edba47bd9
4 years ago
Megvii Engine Team
606540bef4
feat(dnn/cuda): add nhwc 4bit warp perspective
GitOrigin-RevId: fbec4a4a1f
4 years ago
Megvii Engine Team
1e6019436c
feat(dnn/cuda): add nhwc int4 pooling
GitOrigin-RevId: 9cf14cde4e
4 years ago
Megvii Engine Team
0fb9cc41e4
fix(gopt): fix nchw64 opt pass
GitOrigin-RevId: dec18d1ab1
4 years ago
Megvii Engine Team
e661ae904f
feat(dnn/cuda): add base class for cutlass uint4 and int4 algos
GitOrigin-RevId: a4d42f032c
4 years ago
Megvii Engine Team
319436dd14
feat(dnn/cuda): add cutlass impls for uint4 x int4 conv bias
GitOrigin-RevId: cf4536855a
4 years ago
Megvii Engine Team
d28eba4ea5
feat(dnn/cuda): add cutlass impls for int4 conv bias
GitOrigin-RevId: 878bb8c955
4 years ago
Megvii Engine Team
14b65e4da7
feat(dnn/cuda): add reduce_filter_and_update_bias
GitOrigin-RevId: 31b6e6b0ab
4 years ago
Megvii Engine Team
2d4e62ef58
feat(dnn/cuda): add cuda uint4 pooling
GitOrigin-RevId: a728977206
4 years ago
Megvii Engine Team
19919384fc
feat(dnn/cuda): add cuda uint warp perspective
GitOrigin-RevId: 2aec72010f
4 years ago
Megvii Engine Team
01354337a9
fix(mge/autodiff): fix incorrect handling of tuple dy
GitOrigin-RevId: beca8e3711
4 years ago
Megvii Engine Team
5868d1fe4f
fix(arm_common/pooling): check mode in pooling algo to avoid wrong use AVERAGE_COUNT_EXCLUDE_PADDING
GitOrigin-RevId: 7a2d243db7
4 years ago
Megvii Engine Team
86b69cacd0
fix(dnn): fixes for int4
GitOrigin-RevId: 845e164fd3
4 years ago
Megvii Engine Team
4a802d21ca
feat(dnn/cuda): add conv u4xs4 sass kernel
GitOrigin-RevId: 4defcf5f1f
4 years ago
Megvii Engine Team
adf75a291d
perf(dnn/cuda): add sass int4 128x128
GitOrigin-RevId: 1bc5482102
4 years ago
Megvii Engine Team
8da2f698a3
feat(dnn/cuda): support warp perspective/pooling op when channel not aligned to 64
GitOrigin-RevId: 39f29ec990
4 years ago
Megvii Engine Team
c218d4b029
feat(dnn/cuda): fallback conv qs4 support channel not aligend to 64
GitOrigin-RevId: f0d080f35c
4 years ago
Megvii Engine Team
4fe68ac9ed
feat(dnn/cuda): support transforming layout between nchw and nchw64 when channel not aligned to 64
GitOrigin-RevId: e9ecbcf2e2
4 years ago
Megvii Engine Team
ae6ff2c5a6
feat(mgb/gopt): add opt pass for nchw64 layout transform
GitOrigin-RevId: adede7cef6
4 years ago
Megvii Engine Team
63a9bd30a8
feat(mgb/gopt): add an opt pass for padding channels to enable fast int8/int4 support on GPU
GitOrigin-RevId: 94c719bb5c
4 years ago
Megvii Engine Team
56e863b7d4
fix(dnn/cuda): fix int4 epilogue stg bug
GitOrigin-RevId: e86da9a8a8
4 years ago
Megvii Engine Team
cff61a53d4
perf(dnn/cuda): optimize int4 sass conv main loop and epilogue without fuse_z
GitOrigin-RevId: 4274e58d64
4 years ago
Megvii Engine Team
12a0e61542
feat(dnn/cuda): add cuda elemwise int4
GitOrigin-RevId: 8a9aaec328
4 years ago
Megvii Engine Team
df1af59b5c
feat(dnn): warp perspective support int4
GitOrigin-RevId: 826a43b349
4 years ago
Megvii Engine Team
2398df079c
feat(dnn/cuda): add cuda int4 pooling
GitOrigin-RevId: 14ed4e6f00
4 years ago
Megvii Engine Team
2a2a7f4552
test(mgb/opr): add testcase for conv bias int4
GitOrigin-RevId: e3fff5e30b
4 years ago
Megvii Engine Team
858261af1f
fix(python_module): fix conversion between numpy-ndarray and mgb tensor for qint4 and quint4
GitOrigin-RevId: 7450c4f25e
4 years ago
Megvii Engine Team
e250afb08f
feat(dnn/cuda): support conv_bias for nchw64 and qint4
GitOrigin-RevId: 1c65ba87d7
4 years ago
Megvii Engine Team
3b9b87809d
refactor(dnn): refactor lowbit tensor format
GitOrigin-RevId: b646dc085b
4 years ago
Megvii Engine Team
c74660ea88
fix(dnn/cuda): fix invalid local read for relayout format kernel
GitOrigin-RevId: 5a77b82212
4 years ago
Megvii Engine Team
8fef78d06d
feat(dnn/cuda): add relayout format when width is an odd number
GitOrigin-RevId: f059f1f56d
4 years ago
Megvii Engine Team
91d6160769
feat(dnn/common): add tensor format for low-bits tensor layout
GitOrigin-RevId: 0aa3753f37
4 years ago
Megvii Engine Team
19a554d674
test(dnn/cuda): add testcase for transforming tensor layout between nchw and nchw64
GitOrigin-RevId: 75d579635a
4 years ago
Megvii Engine Team
71c2f61254
feat(dnn/cuda): add relayout format to support layout transform between NCHW and NCHW64
GitOrigin-RevId: 1445ecfabe
4 years ago