Megvii Engine Team
3bda334798
fix(dnn/fallback): fix segmentfault caused by im2col/conv1x1 using
fallback naive matmul.
GitOrigin-RevId: 03ef904b11
4 years ago
Megvii Engine Team
87ff58f7fc
fix(megdnn): add algo for matmul/batchedmatrixmul of naive and opencl
GitOrigin-RevId: 2409b6ba16
4 years ago
Megvii Engine Team
a3caa5d3b7
fix(mgb(dnn)): fix convbias cudnnConvBiasActivation
GitOrigin-RevId: c0e44feffb
4 years ago
Megvii Engine Team
55042195d4
chore(winograd): add Convolutionv2 param
GitOrigin-RevId: 1a9e2ea340
4 years ago
Megvii Engine Team
409a877267
feat(dnn): add algo interface for rocm&fallback matmul and batched matrix mul
GitOrigin-RevId: dea03a0f7a
4 years ago
Megvii Engine Team
8f7f52ae4d
feat(jit): add memfwd in jit executor opr
GitOrigin-RevId: b58860bbe8
4 years ago
Megvii Engine Team
dfb2b2ce49
fix(dnn): change pooling window size smaller than padding constraint to log_error
GitOrigin-RevId: c3cda68f6d
4 years ago
Megvii Engine Team
d1fbec4fe2
feat(dnn/atlas): add atlas stub
GitOrigin-RevId: c63294378e
4 years ago
Megvii Engine Team
a85531dd0f
feat(mgb/opr): add tqt opr
GitOrigin-RevId: 49c62cd532
4 years ago
Megvii Engine Team
c3a4b2225d
feat(dnn/cuda): add cutlass impls for fused convolution reformat operation
GitOrigin-RevId: 02ef559c3f
4 years ago
Megvii Engine Team
5f44203d7b
feat(dnn/cuda): add a cutlass impl for fusing convolution and dimshuffle
GitOrigin-RevId: 3fc6faef01
4 years ago
Megvii Engine Team
61f917fb8e
feat(dnn/cuda): add impl for fusing warp perspective and dimshuffle
GitOrigin-RevId: 51e025973f
4 years ago
Megvii Engine Team
eb826422c4
fix(dnn): forbid pooling window size smaller than padding
GitOrigin-RevId: 9ad61c409d
4 years ago
Megvii Engine Team
fc0fcd2f7f
chore(winograd): remove winograd transform code
GitOrigin-RevId: 78c3cfceae
4 years ago
Megvii Engine Team
d1adc9a22f
fix(dnn): fix opencl algo search
GitOrigin-RevId: 25997d0ef1
4 years ago
Megvii Engine Team
7e2b2dbffc
fix(dnn/test): delete large size in ARM_COMMON.FP32_GEVM
GitOrigin-RevId: 581ef43816
4 years ago
Megvii Engine Team
69e3e32240
feat(imperative): auto generated opdef header and python binding
GitOrigin-RevId: d2f22ad5fe
4 years ago
Megvii Engine Team
0398a7867f
fix(build/windows/cuda/llvm): fix windows bazel build with cuda
* Adapt to the new version llvm/clang-11
* fix windows bazel build with cuda
* add windows bazel build cuda ci
* opt windows bazel ci scripts
GitOrigin-RevId: 6ea7c66585
4 years ago
Megvii Engine Team
3bf73ff16f
feat(dnn): add cuda preprocess fusion
GitOrigin-RevId: d789c99e59
4 years ago
Megvii Engine Team
86cf7490ec
feat(dnn/aarch64): add quantizeds4 matmul int4x4x16_k8x8x8
GitOrigin-RevId: 7812900244
4 years ago
Megvii Engine Team
142f31a875
perf(dnn/cuda): change conv_bias heu, prefer dnn chanwise impl, dislike dnn batch gemm conv1x1
GitOrigin-RevId: 323bf6073a
4 years ago
Megvii Engine Team
f214e14695
refactor(mgb/cuda): use single implementation of get_device_prop from utils
GitOrigin-RevId: 5cc95472b9
4 years ago
Megvii Engine Team
54e79dd1d9
perf(mgb/cuda): do not call cudaGetDeviceProperties to avoid io traffic
GitOrigin-RevId: 6aa35928c8
4 years ago
Megvii Engine Team
98a74e4a7b
refactor(dnn): refactor opr proxy in test
GitOrigin-RevId: a1d8682e6f
4 years ago
Megvii Engine Team
7066ad5ba6
feat(dnn): add uint16 support
GitOrigin-RevId: f4c4b1c7b9
4 years ago
Megvii Engine Team
a1877ee0fa
refactor(dnn): refactor algo interface, use algoinfo instead of global algorithm
GitOrigin-RevId: 479718ac75
4 years ago
Megvii Engine Team
6f5d0febf1
perf(dnn/cuda): enhance performance for pooling forward
GitOrigin-RevId: 55fb2a9b25
4 years ago
Megvii Engine Team
0560a218af
chore(dnn/test): refactor megdnn arm_common test
GitOrigin-RevId: 4168910301
4 years ago
Megvii Engine Team
6856ce9ce2
feat(dnn): support conv bias activation for nchw4 input tensor format and nchw output tensor format
GitOrigin-RevId: 29cd73f87b
4 years ago
Megvii Engine Team
60c6d59fc9
feat(mbg/core): support bias preprocess in conv_bias
GitOrigin-RevId: d2e1e14d41
4 years ago
Megvii Engine Team
ff8ef9eda7
docs(dnn): add comments of weight prerpocess interface
GitOrigin-RevId: bb496ed219
4 years ago
Megvii Engine Team
1f75c7ade4
ci(midout): fix midout and reopen midout test
we just test function, do not check size
GitOrigin-RevId: dce5387e83
4 years ago
Megvii Engine Team
1e71e0afe0
refactor(dnn): refactor deconv algo
GitOrigin-RevId: 422be792eb
4 years ago
Megvii Engine Team
89ad33aeb3
feat(dnn/cuda): support weight preprocessing for cutlass algorithms
GitOrigin-RevId: 7b77579acd
4 years ago
Megvii Engine Team
c03249c059
feat(dnn/opr): add megdnn fake quant opr
GitOrigin-RevId: 5a04b6da2f
4 years ago
Megvii Engine Team
739f927c4c
feat(dnn/cuda): opt dp4a conv for small channel base on cutlass
GitOrigin-RevId: 2a74c35f27
4 years ago
Megvii Engine Team
1f8e40753f
fix(mkl): fix windows mkl LOG compute exception
GitOrigin-RevId: cd2ebaaec1
4 years ago
Megvii Engine Team
4aa277a203
refactor(dnn/cuda): misc
GitOrigin-RevId: 1f8f91a0cc
4 years ago
Megvii Engine Team
f7b2bdae1a
refactor(dnn): refactor algorithm type interface
GitOrigin-RevId: 843d885f82
4 years ago
Megvii Engine Team
18ec5341f2
refactor(dnn): remove unused costmodel in cuda
GitOrigin-RevId: b15f0607b9
4 years ago
Megvii Engine Team
e39f938662
refactor(dnn): remove ProfileCache and matmul algo in x86
GitOrigin-RevId: 55a700d747
4 years ago
Megvii Engine Team
89303cd829
feat(megdnn/rocm): add bn for rocm backend
GitOrigin-RevId: 8bd49599b2
4 years ago
Megvii Engine Team
aea829c9fa
feat(megdnn/rocm): add average inclusive mode for pooling
GitOrigin-RevId: 08162bd1fb
4 years ago
Megvii Engine Team
1217801133
perf(mge): add opdef for broadcast
GitOrigin-RevId: 92f0af29eb
4 years ago
Megvii Engine Team
2a3f4d099a
refactor(dnn/arm): refactor CPU heuristic algo selection
GitOrigin-RevId: 60d2646bb3
4 years ago
Megvii Engine Team
ba66e1d039
feat(dnn): add nchw_fp32 nchw44_qint8 cuda dct
GitOrigin-RevId: 581e31fc20
4 years ago
Megvii Engine Team
a9f98e9c66
refactor(meg/internal): move interal codes back to megbrain
GitOrigin-RevId: b2dbda96be
4 years ago
Megvii Engine Team
44b27f0d6e
build(3516): fix some cpu flags build failed and fix 3516 ycm
GitOrigin-RevId: a0c73fa18a
4 years ago
Megvii Engine Team
8764a6c8ff
feat(dnn/cuda): add volta dp4a int8 sass kernel
GitOrigin-RevId: 9fefd39678
4 years ago
Megvii Engine Team
3635af6274
style(atlas): add comment for async d2d
GitOrigin-RevId: 606a56ac4e
4 years ago