Megvii Engine Team
|
3bf73ff16f
|
feat(dnn): add cuda preprocess fusion
GitOrigin-RevId: d789c99e59
|
4 years ago |
Megvii Engine Team
|
86cf7490ec
|
feat(dnn/aarch64): add quantizeds4 matmul int4x4x16_k8x8x8
GitOrigin-RevId: 7812900244
|
4 years ago |
Megvii Engine Team
|
142f31a875
|
perf(dnn/cuda): change conv_bias heu, prefer dnn chanwise impl, dislike dnn batch gemm conv1x1
GitOrigin-RevId: 323bf6073a
|
4 years ago |
Megvii Engine Team
|
f214e14695
|
refactor(mgb/cuda): use single implementation of get_device_prop from utils
GitOrigin-RevId: 5cc95472b9
|
4 years ago |
Megvii Engine Team
|
54e79dd1d9
|
perf(mgb/cuda): do not call cudaGetDeviceProperties to avoid io traffic
GitOrigin-RevId: 6aa35928c8
|
4 years ago |
Megvii Engine Team
|
98a74e4a7b
|
refactor(dnn): refactor opr proxy in test
GitOrigin-RevId: a1d8682e6f
|
4 years ago |
Megvii Engine Team
|
7066ad5ba6
|
feat(dnn): add uint16 support
GitOrigin-RevId: f4c4b1c7b9
|
4 years ago |
Megvii Engine Team
|
a1877ee0fa
|
refactor(dnn): refactor algo interface, use algoinfo instead of global algorithm
GitOrigin-RevId: 479718ac75
|
4 years ago |
Megvii Engine Team
|
6f5d0febf1
|
perf(dnn/cuda): enhance performance for pooling forward
GitOrigin-RevId: 55fb2a9b25
|
4 years ago |
Megvii Engine Team
|
0560a218af
|
chore(dnn/test): refactor megdnn arm_common test
GitOrigin-RevId: 4168910301
|
4 years ago |
Megvii Engine Team
|
6856ce9ce2
|
feat(dnn): support conv bias activation for nchw4 input tensor format and nchw output tensor format
GitOrigin-RevId: 29cd73f87b
|
4 years ago |
Megvii Engine Team
|
60c6d59fc9
|
feat(mbg/core): support bias preprocess in conv_bias
GitOrigin-RevId: d2e1e14d41
|
4 years ago |
Megvii Engine Team
|
ff8ef9eda7
|
docs(dnn): add comments of weight prerpocess interface
GitOrigin-RevId: bb496ed219
|
4 years ago |
Megvii Engine Team
|
1f75c7ade4
|
ci(midout): fix midout and reopen midout test
we just test function, do not check size
GitOrigin-RevId: dce5387e83
|
4 years ago |
Megvii Engine Team
|
1e71e0afe0
|
refactor(dnn): refactor deconv algo
GitOrigin-RevId: 422be792eb
|
4 years ago |
Megvii Engine Team
|
89ad33aeb3
|
feat(dnn/cuda): support weight preprocessing for cutlass algorithms
GitOrigin-RevId: 7b77579acd
|
4 years ago |
Megvii Engine Team
|
c03249c059
|
feat(dnn/opr): add megdnn fake quant opr
GitOrigin-RevId: 5a04b6da2f
|
4 years ago |
Megvii Engine Team
|
739f927c4c
|
feat(dnn/cuda): opt dp4a conv for small channel base on cutlass
GitOrigin-RevId: 2a74c35f27
|
4 years ago |
Megvii Engine Team
|
1f8e40753f
|
fix(mkl): fix windows mkl LOG compute exception
GitOrigin-RevId: cd2ebaaec1
|
4 years ago |
Megvii Engine Team
|
4aa277a203
|
refactor(dnn/cuda): misc
GitOrigin-RevId: 1f8f91a0cc
|
4 years ago |
Megvii Engine Team
|
f7b2bdae1a
|
refactor(dnn): refactor algorithm type interface
GitOrigin-RevId: 843d885f82
|
4 years ago |
Megvii Engine Team
|
18ec5341f2
|
refactor(dnn): remove unused costmodel in cuda
GitOrigin-RevId: b15f0607b9
|
4 years ago |
Megvii Engine Team
|
e39f938662
|
refactor(dnn): remove ProfileCache and matmul algo in x86
GitOrigin-RevId: 55a700d747
|
4 years ago |
Megvii Engine Team
|
89303cd829
|
feat(megdnn/rocm): add bn for rocm backend
GitOrigin-RevId: 8bd49599b2
|
4 years ago |
Megvii Engine Team
|
aea829c9fa
|
feat(megdnn/rocm): add average inclusive mode for pooling
GitOrigin-RevId: 08162bd1fb
|
4 years ago |
Megvii Engine Team
|
1217801133
|
perf(mge): add opdef for broadcast
GitOrigin-RevId: 92f0af29eb
|
4 years ago |
Megvii Engine Team
|
2a3f4d099a
|
refactor(dnn/arm): refactor CPU heuristic algo selection
GitOrigin-RevId: 60d2646bb3
|
4 years ago |
Megvii Engine Team
|
ba66e1d039
|
feat(dnn): add nchw_fp32 nchw44_qint8 cuda dct
GitOrigin-RevId: 581e31fc20
|
4 years ago |
Megvii Engine Team
|
a9f98e9c66
|
refactor(meg/internal): move interal codes back to megbrain
GitOrigin-RevId: b2dbda96be
|
4 years ago |
Megvii Engine Team
|
44b27f0d6e
|
build(3516): fix some cpu flags build failed and fix 3516 ycm
GitOrigin-RevId: a0c73fa18a
|
4 years ago |
Megvii Engine Team
|
8764a6c8ff
|
feat(dnn/cuda): add volta dp4a int8 sass kernel
GitOrigin-RevId: 9fefd39678
|
4 years ago |
Megvii Engine Team
|
3635af6274
|
style(atlas): add comment for async d2d
GitOrigin-RevId: 606a56ac4e
|
4 years ago |
Megvii Engine Team
|
d68d4d1d99
|
perf(atlas): use async d2d
GitOrigin-RevId: 55914631cb
|
4 years ago |
Megvii Engine Team
|
215f88f373
|
fix(dnn/argmxx): fix argmxx on inf
GitOrigin-RevId: 740f67b73a
|
4 years ago |
Megvii Engine Team
|
92b12685db
|
feat(dnn/aarch64): add aarch64 int8X8X16_mk4_k8x8x8 matmul, performance is better
GitOrigin-RevId: b6af21e8e3
|
4 years ago |
Megvii Engine Team
|
912d733ea9
|
fix(dnn): support bool for IndexingMultiAxisVec
GitOrigin-RevId: ddcfaa06b0
|
4 years ago |
Megvii Engine Team
|
edb32495c6
|
feat(dnn/opr): add megdnn adaptive pooling opr
GitOrigin-RevId: 563ce65479
|
4 years ago |
Megvii Engine Team
|
5a85c907e0
|
feat(mgb/opr): add megbrain adaptive pooling opr
GitOrigin-RevId: 82833f41d9
|
4 years ago |
Megvii Engine Team
|
310c805f20
|
fix(dnn/cuda): use kernel parameter instead of user constant memory
GitOrigin-RevId: 6080b24cc8
|
4 years ago |
Megvii Engine Team
|
b8ddca4c38
|
fix(atlas): add MGB_USE_ATLAS_ASYNC_API to enable async api
GitOrigin-RevId: ab821f4966
|
4 years ago |
Megvii Engine Team
|
95eb6ae380
|
feat(mgb/opr): let more ops support empty IO
GitOrigin-RevId: 84dddb4b23
|
4 years ago |
Megvii Engine Team
|
0307598a80
|
fix(dnn): keep consistent limit between deduce and compute
GitOrigin-RevId: 8de5f17ced
|
4 years ago |
Megvii Engine Team
|
75eebb7c42
|
feat(opr): use weight preprocess feature of MegDNN
GitOrigin-RevId: 779041f8a8
|
4 years ago |
Megvii Engine Team
|
cc952b2b92
|
fix(rocm): fix rocm megdnntest sleep and a cut code
GitOrigin-RevId: 26de5ca98b
|
4 years ago |
Megvii Engine Team
|
3a03fa7a50
|
fix(dnn/cuda): disable pascal sass conv2d
GitOrigin-RevId: 385d066595
|
4 years ago |
Megvii Engine Team
|
a5fad7d07c
|
feat(dnn): add compile for riscv64
GitOrigin-RevId: fa0c163527
|
4 years ago |
Megvii Engine Team
|
3e11d89415
|
fix(dnn/dump): add more info for dump CD4
GitOrigin-RevId: 5840afaacd
|
4 years ago |
Megvii Engine Team
|
76fa71573b
|
feat(dnn/cuda): add cutlass nchw4 convolution
GitOrigin-RevId: 93c9b212f4
|
4 years ago |
Megvii Engine Team
|
1f3f4abc38
|
fix(dnn): fix compile warnings
GitOrigin-RevId: 519a6c0c34
|
4 years ago |
Megvii Engine Team
|
5b6ebeb563
|
fix(mgb): append json file for dump and ready for midout open source
GitOrigin-RevId: 71ae7f1f4a
|
4 years ago |