Megvii Engine Team
|
1217801133
|
perf(mge): add opdef for broadcast
GitOrigin-RevId: 92f0af29eb
|
4 years ago |
Megvii Engine Team
|
2a3f4d099a
|
refactor(dnn/arm): refactor CPU heuristic algo selection
GitOrigin-RevId: 60d2646bb3
|
4 years ago |
Megvii Engine Team
|
ba66e1d039
|
feat(dnn): add nchw_fp32 nchw44_qint8 cuda dct
GitOrigin-RevId: 581e31fc20
|
4 years ago |
Megvii Engine Team
|
a9f98e9c66
|
refactor(meg/internal): move interal codes back to megbrain
GitOrigin-RevId: b2dbda96be
|
4 years ago |
Megvii Engine Team
|
44b27f0d6e
|
build(3516): fix some cpu flags build failed and fix 3516 ycm
GitOrigin-RevId: a0c73fa18a
|
4 years ago |
Megvii Engine Team
|
8764a6c8ff
|
feat(dnn/cuda): add volta dp4a int8 sass kernel
GitOrigin-RevId: 9fefd39678
|
4 years ago |
Megvii Engine Team
|
3635af6274
|
style(atlas): add comment for async d2d
GitOrigin-RevId: 606a56ac4e
|
4 years ago |
Megvii Engine Team
|
d68d4d1d99
|
perf(atlas): use async d2d
GitOrigin-RevId: 55914631cb
|
4 years ago |
Megvii Engine Team
|
215f88f373
|
fix(dnn/argmxx): fix argmxx on inf
GitOrigin-RevId: 740f67b73a
|
4 years ago |
Megvii Engine Team
|
92b12685db
|
feat(dnn/aarch64): add aarch64 int8X8X16_mk4_k8x8x8 matmul, performance is better
GitOrigin-RevId: b6af21e8e3
|
4 years ago |
Megvii Engine Team
|
912d733ea9
|
fix(dnn): support bool for IndexingMultiAxisVec
GitOrigin-RevId: ddcfaa06b0
|
4 years ago |
Megvii Engine Team
|
edb32495c6
|
feat(dnn/opr): add megdnn adaptive pooling opr
GitOrigin-RevId: 563ce65479
|
4 years ago |
Megvii Engine Team
|
5a85c907e0
|
feat(mgb/opr): add megbrain adaptive pooling opr
GitOrigin-RevId: 82833f41d9
|
4 years ago |
Megvii Engine Team
|
310c805f20
|
fix(dnn/cuda): use kernel parameter instead of user constant memory
GitOrigin-RevId: 6080b24cc8
|
4 years ago |
Megvii Engine Team
|
b8ddca4c38
|
fix(atlas): add MGB_USE_ATLAS_ASYNC_API to enable async api
GitOrigin-RevId: ab821f4966
|
4 years ago |
Megvii Engine Team
|
95eb6ae380
|
feat(mgb/opr): let more ops support empty IO
GitOrigin-RevId: 84dddb4b23
|
4 years ago |
Megvii Engine Team
|
0307598a80
|
fix(dnn): keep consistent limit between deduce and compute
GitOrigin-RevId: 8de5f17ced
|
4 years ago |
Megvii Engine Team
|
75eebb7c42
|
feat(opr): use weight preprocess feature of MegDNN
GitOrigin-RevId: 779041f8a8
|
4 years ago |
Megvii Engine Team
|
cc952b2b92
|
fix(rocm): fix rocm megdnntest sleep and a cut code
GitOrigin-RevId: 26de5ca98b
|
4 years ago |
Megvii Engine Team
|
3a03fa7a50
|
fix(dnn/cuda): disable pascal sass conv2d
GitOrigin-RevId: 385d066595
|
4 years ago |
Megvii Engine Team
|
a5fad7d07c
|
feat(dnn): add compile for riscv64
GitOrigin-RevId: fa0c163527
|
4 years ago |
Megvii Engine Team
|
3e11d89415
|
fix(dnn/dump): add more info for dump CD4
GitOrigin-RevId: 5840afaacd
|
4 years ago |
Megvii Engine Team
|
76fa71573b
|
feat(dnn/cuda): add cutlass nchw4 convolution
GitOrigin-RevId: 93c9b212f4
|
4 years ago |
Megvii Engine Team
|
1f3f4abc38
|
fix(dnn): fix compile warnings
GitOrigin-RevId: 519a6c0c34
|
4 years ago |
Megvii Engine Team
|
5b6ebeb563
|
fix(mgb): append json file for dump and ready for midout open source
GitOrigin-RevId: 71ae7f1f4a
|
4 years ago |
Megvii Engine Team
|
16324e3076
|
feat(dnn/cuda): add remap backward
GitOrigin-RevId: 1b1bcf5db3
|
5 years ago |
Megvii Engine Team
|
bd73dabbe2
|
fix(dnn/build): add CUDNN_INCLUDE_DIR to the megdnn_test target
GitOrigin-RevId: 49d8842dd4
|
4 years ago |
Megvii Engine Team
|
343335932a
|
fix(dnn/arm): fix read invalid data in arm kernel
GitOrigin-RevId: f1c4cae667
|
4 years ago |
Megvii Engine Team
|
59dcd3b7f3
|
fix(mgb/build): do not install cutlass
GitOrigin-RevId: ba8b047659
|
4 years ago |
Megvii Engine Team
|
6e882c1a86
|
feat(whl/imperative): compat for build python whl imperative and legacy runtime
GitOrigin-RevId: 7f6629ae1f
|
4 years ago |
Megvii Engine Team
|
7f857bd471
|
feat(mgb/rocm): add cmake for rocm and fix compile errors and bn
GitOrigin-RevId: c73ed4adc3
|
4 years ago |
Megvii Engine Team
|
199eefbd4c
|
fix(dnn): generate mode files
GitOrigin-RevId: 9b1e840f00
|
4 years ago |
Megvii Engine Team
|
9510136223
|
fix(mgb/rocm): remove begin-internal of rocm
GitOrigin-RevId: 1523833fcb
|
4 years ago |
Megvii Engine Team
|
6b380e8965
|
feat(mge/imperative): run oss test and restore cmake list build items
GitOrigin-RevId: 11411b6964
|
4 years ago |
Megvii Engine Team
|
0380811218
|
feat(dnn/arm_common): add nchw44 8x8x16 stride1 stride2
2x2 3x3 5x5 7x7 directconv
GitOrigin-RevId: 3710182af1
|
4 years ago |
Megvii Engine Team
|
00ef677249
|
fix(mgb): remove internal for cambricon and atlas
GitOrigin-RevId: 861e349eb4
|
4 years ago |
Megvii Engine Team
|
aeffcd5897
|
feat(dnn/cuda): integrate cutlass nchw32 tensorcore convolution
GitOrigin-RevId: 9d6c48ed99
|
4 years ago |
Megvii Engine Team
|
9e5e32dee2
|
fix(dnn): restore opr_param_defs.py
GitOrigin-RevId: b92747cad3
|
4 years ago |
Megvii Engine Team
|
d334b229b0
|
feat(imperative): add nms opr wrapper
GitOrigin-RevId: d92241a234
|
4 years ago |
Megvii Engine Team
|
bca00f2e22
|
fix(dnn): midout at where neccessary in megdnn
GitOrigin-RevId: 191334bd96
|
4 years ago |
Megvii Engine Team
|
a1e6720756
|
feat(dnn): enable bool comparison
GitOrigin-RevId: 735693b81e
|
4 years ago |
Megvii Engine Team
|
8aa34e4a5d
|
feat(imperative): add advance indexing with bool
also fix bug to deal with empty tensor in np2tensor
GitOrigin-RevId: 2bd8152c90
|
4 years ago |
Megvii Engine Team
|
101b58d1ca
|
fix(dnn): enable bool input to cond_take
GitOrigin-RevId: 532fa7c073
|
4 years ago |
Megvii Engine Team
|
4a178a8dba
|
feat(windows/cuda/cmake): support cmake cuda build on windows
GitOrigin-RevId: 4d9832e559
|
4 years ago |
Megvii Engine Team
|
6aade1336d
|
fix(dnn/fallback): disable im2col/conv1x1/conv1x1_gemv Quantized8Asymm in x86
GitOrigin-RevId: b094634254
|
4 years ago |
Megvii Engine Team
|
56381f808b
|
fix(dnn/arm): use vcvtq_f32_s32 for all arm code
GitOrigin-RevId: 27effe7d24
|
4 years ago |
Megvii Engine Team
|
1173205726
|
fix(gopt): nchw_nchwxx useable and opt pass use nchw_nchwxx_valid
GitOrigin-RevId: 60942aca5b
|
4 years ago |
Megvii Engine Team
|
eb18eba87d
|
fix(gopt): fix nchw44 nchw44_dot gopt test
GitOrigin-RevId: 06b38dcd30
|
4 years ago |
Megvii Engine Team
|
40e79e9dab
|
fix(dnn/x86): fix x86 matrix usable ignore format
GitOrigin-RevId: 40fe508aca
|
4 years ago |
Megvii Engine Team
|
2272abe18d
|
fix(mgb/fallback): disable nchw44 in conv1x1 and im2col in x86
GitOrigin-RevId: 603d2eb94a
|
4 years ago |