Megvii Engine Team
|
369c2ccc5a
|
style(all): reformat c++ code
GitOrigin-RevId: 3ffd1b211f
|
3 years ago |
Megvii Engine Team
|
11f022ff7c
|
feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt
GitOrigin-RevId: 229e1eb4be
|
3 years ago |
Megvii Engine Team
|
c25125e3d2
|
perf(dnn/cuda): sass int8 epilogue remove shared load
GitOrigin-RevId: 2b49f5069b
|
3 years ago |
Megvii Engine Team
|
b18feaab33
|
feat(dnn/cuda): use cutlass remove shared load imma conv kernel
GitOrigin-RevId: 0b5574f526
|
4 years ago |
Megvii Engine Team
|
3eb0505f9b
|
feat(imperative): add support for quantized conv transpose2d
GitOrigin-RevId: ffd6431299
|
3 years ago |
Megvii Engine Team
|
869a03271b
|
perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance
GitOrigin-RevId: d1b95a6f01
|
3 years ago |
Megvii Engine Team
|
8d248a6a9a
|
fix(dnn/cuda): fix testcase for fallback nchw qs8 conv
GitOrigin-RevId: 646440db59
|
4 years ago |
Megvii Engine Team
|
633016a962
|
fix(dnn/cuda): fix AlgoFallbackNCHWQS8 to support Float32 dst
GitOrigin-RevId: 06f90f5cf3
|
4 years ago |
Megvii Engine Team
|
319436dd14
|
feat(dnn/cuda): add cutlass impls for uint4 x int4 conv bias
GitOrigin-RevId: cf4536855a
|
4 years ago |
Megvii Engine Team
|
d28eba4ea5
|
feat(dnn/cuda): add cutlass impls for int4 conv bias
GitOrigin-RevId: 878bb8c955
|
4 years ago |
Megvii Engine Team
|
e250afb08f
|
feat(dnn/cuda): support conv_bias for nchw64 and qint4
GitOrigin-RevId: 1c65ba87d7
|
4 years ago |
Megvii Engine Team
|
f14e0c17e7
|
feat(mgb): add recursive for fastrun and megdnn test
GitOrigin-RevId: 743846f645
|
4 years ago |
Megvii Engine Team
|
364afec033
|
chore(mge): update copyright years
GitOrigin-RevId: 3c0690bcc1
|
4 years ago |
Megvii Engine Team
|
c3a4b2225d
|
feat(dnn/cuda): add cutlass impls for fused convolution reformat operation
GitOrigin-RevId: 02ef559c3f
|
4 years ago |
Megvii Engine Team
|
5f44203d7b
|
feat(dnn/cuda): add a cutlass impl for fusing convolution and dimshuffle
GitOrigin-RevId: 3fc6faef01
|
4 years ago |
Megvii Engine Team
|
a1877ee0fa
|
refactor(dnn): refactor algo interface, use algoinfo instead of global algorithm
GitOrigin-RevId: 479718ac75
|
4 years ago |
Megvii Engine Team
|
89ad33aeb3
|
feat(dnn/cuda): support weight preprocessing for cutlass algorithms
GitOrigin-RevId: 7b77579acd
|
4 years ago |
Megvii Engine Team
|
739f927c4c
|
feat(dnn/cuda): opt dp4a conv for small channel base on cutlass
GitOrigin-RevId: 2a74c35f27
|
4 years ago |
Megvii Engine Team
|
4aa277a203
|
refactor(dnn/cuda): misc
GitOrigin-RevId: 1f8f91a0cc
|
4 years ago |
Megvii Engine Team
|
310c805f20
|
fix(dnn/cuda): use kernel parameter instead of user constant memory
GitOrigin-RevId: 6080b24cc8
|
4 years ago |
Megvii Engine Team
|
3a03fa7a50
|
fix(dnn/cuda): disable pascal sass conv2d
GitOrigin-RevId: 385d066595
|
4 years ago |
Megvii Engine Team
|
76fa71573b
|
feat(dnn/cuda): add cutlass nchw4 convolution
GitOrigin-RevId: 93c9b212f4
|
4 years ago |
Megvii Engine Team
|
aeffcd5897
|
feat(dnn/cuda): integrate cutlass nchw32 tensorcore convolution
GitOrigin-RevId: 9d6c48ed99
|
4 years ago |
Megvii Engine Team
|
69fe5ab3b3
|
feat(dnn/cuda): add conv2d-sass-kernel
GitOrigin-RevId: f284d5a4ce
|
5 years ago |
Megvii Engine Team
|
f91881ffdc
|
MegEngine: Initial commit of MegEngine.
GitOrigin-RevId: f0c8338beb
|
5 years ago |