Megvii Engine Team
b90c1540db
fix(dnn/naive): fix midout for pooling
GitOrigin-RevId: 4edd99f3ec
5 years ago
Megvii Engine Team
df47637d03
fix(dnn/naive): fix midout for relayout_format
GitOrigin-RevId: 6ff9e2280e
5 years ago
Megvii Engine Team
f60ab501ef
fix(mgb/opt): nchw to nchw4 pass suppport ic less than 4
GitOrigin-RevId: a3c205f38f
5 years ago
Megvii Engine Team
8ec099221f
fix(dnn): fix Image2DPack4TensorFormat check
GitOrigin-RevId: b9a8ae4e1a
5 years ago
Megvii Engine Team
28d85838ef
feat(dnn): add relayout_format for nchw to nchw4 and ic <=4
GitOrigin-RevId: 07f2ee6c5b
5 years ago
Megvii Engine Team
3a53872f73
fix(dnn/native): also fix native logic
GitOrigin-RevId: a80f090271
5 years ago
Megvii Engine Team
43b42a651c
fix(dnn/cuda): fix indexing logic in psroi_pooling
a variable relating to indexing was not computed correctly
GitOrigin-RevId: 548c8f3f14
5 years ago
Megvii Engine Team
be205727bc
fix(mge): fix some warnings
GitOrigin-RevId: 38b285f991
5 years ago
Megvii Engine Team
786afef461
feat(build): install CMake config module and pkg-config descriptor
Also, upgrade to CMake 3.13.
The commit also contains significant refactors, as otherwise it is not
possible to properly export target `megengine` to
MegEngine-targets.cmake:
1. Optionally use system provided Flatbuffers.
2. Optionally use system provided MKL-DNN (Tested with Debian).
3. Refactor megbrain and megdnn targets into object libraries.
4. Set different path in BUILD_INTERFACE and INSTALL_INTERFACE of
various target_include_directories.
5. Specify PUBLIC/PRIVATE on various target_link_libraries.
GitOrigin-RevId: df118a879e
5 years ago
Megvii Engine Team
4d35397bdf
fix(dnn/fallback): fix conv1x1/im2col usable and fuse-conv-bias get fp32xfp32-->qint8 error
GitOrigin-RevId: 5a3bfedd8a
5 years ago
Megvii Engine Team
6b2760dd72
feat(dnn/fallback): add float32 nchw44 fuse packb 3x3 s2
GitOrigin-RevId: 3b664bb4f5
5 years ago
Megvii Engine Team
2b4b4d66d9
feat(dnn/fallback): add aarch64 mk4 dot 3x3 s1 fuse packb
GitOrigin-RevId: 3e69878d8d
5 years ago
Megvii Engine Team
a1677d7aa9
feat(dnn/arm_common): add fp32 gevm
GitOrigin-RevId: 4d348bbb34
5 years ago
Megvii Engine Team
5d950063cf
feat(dnn): refactor dot gemv for both aarch64 and aarch32
GitOrigin-RevId: 2b98867e45
5 years ago
Megvii Engine Team
53c288a304
fix(dnn/cuda): fix topk grid oversize
GitOrigin-RevId: d3c811a034
5 years ago
Megvii Engine Team
124767b4f8
fix(dnn/fallback): fix mk4_dot test after remove mk4_dot_8x6x4 matmul
GitOrigin-RevId: e3a12cf9b3
5 years ago
Megvii Engine Team
34659c2ea4
fix(mgb/dnn): remove armv7 matmul mk4dot block 8x6
GitOrigin-RevId: 4c746ef228
5 years ago
Megvii Engine Team
48ac1e1abd
feat(dnn/fallback): delete nopack onlypacka noneed datatype,and add
im2co and conv1x1 mk4_dot support
GitOrigin-RevId: 096b16a3ab
5 years ago
Megvii Engine Team
3117bfb738
fix(dnn/arm): nchw44 direct int8 support 8832
GitOrigin-RevId: 696fa05d94
5 years ago
Megvii Engine Team
9f352b1c45
feat(megbrain/dnn): add indexing remap int32 for naive and cuda
GitOrigin-RevId: 5f66d51de4
5 years ago
Megvii Engine Team
5dbf218d19
feat(dnn/x86): add sse 8816 matmul
GitOrigin-RevId: ed8d9ee5db
5 years ago
Megvii Engine Team
25b6a13148
feat(dnn/x86): add x86 avx2 8x8x16 matmul
GitOrigin-RevId: d2172c50b2
5 years ago
Megvii Engine Team
273f891b55
fix(mgb/gopt): fix run-time winograd-transform and nchwxx error
GitOrigin-RevId: aca796f17d
5 years ago
Megvii Engine Team
02abc36ea6
fix(mbg/arm_common): fix nchw44-dot misc issue
GitOrigin-RevId: f870ad964c
5 years ago
Megvii Engine Team
9ed3882a94
fix(opr/dnn): fix winograd fast run mismatch
GitOrigin-RevId: d308085b9f
5 years ago
Megvii Engine Team
18be23f328
fix(mbg/gopt): fix nchwxx gopt with no fuse conv_bias and winograd
fast-run
GitOrigin-RevId: 49ccbdf2d4
5 years ago
Megvii Engine Team
65ec4f7c26
fix(ci): fix test timeout
GitOrigin-RevId: 875fc613cf
5 years ago
Megvii Engine Team
ea6bfe6cd9
fix(dnn/cuda-stub): simplify and use proper search paths
Removed the `access()` call before `dlopen()`.
It was copy-pasted from the opencl-stub, does not make sense here, and
prevents `dlopen()` from loading `libcuda.so` from non-default path.
Updated the name of the library providing CUDA Driver API on different
platforms, these are harvested from the following file in a CUDA
install:
samples/6_Advanced/matrixMulDynlinkJIT/cuda_drvapi_dynlink.c
GitOrigin-RevId: ed43cab8c8
5 years ago
Megvii Engine Team
32c86211ee
fix(dnn/cuda): enable cuda algos for nchw quantized
GitOrigin-RevId: 4d1e167b86
5 years ago
Megvii Engine Team
7b0dbe6af8
fix(dnn/arm): fix stride 1 support for int8 nchw_nchw44
GitOrigin-RevId: 9d718eb7a4
5 years ago
Megvii Engine Team
198f3eb5f6
fix(dnn/arm): fix fp32 nchw44 direct workspace bug
GitOrigin-RevId: 6ee433b02c
5 years ago
Megvii Engine Team
9e876203b5
feat(dnn): add int8 direct conv dot nchw44
GitOrigin-RevId: 31830ba7a4
5 years ago
Megvii Engine Team
09ceaaaecf
fix(dnn/arm): stride1 support for nchw_nchw44 fp32 conv
GitOrigin-RevId: 744c5db3dc
5 years ago
Megvii Engine Team
f56f187f6e
fix(mbg/gopt): fix nchw44-dot channel wise trans to nchw44
GitOrigin-RevId: aa2059a796
5 years ago
Megvii Engine Team
4f8e60801c
feat(dnn): fix Werror by adding macro
GitOrigin-RevId: 1f5fe4d46a
5 years ago
Megvii Engine Team
3966bb08b3
feat(dnn/test): split cpu.convolution
GitOrigin-RevId: fa28d3d760
5 years ago
Megvii Engine Team
8f87a3e988
feat(dnn/arm_common): add int8 nchw44 winograd f23_4x4 f23_8x8 compute float32/int16 output int8
GitOrigin-RevId: d99ef7efcd
5 years ago
Megvii Engine Team
8ffed043be
fix(dnn/x86): fix matrix_mul quantized performance on vnni
GitOrigin-RevId: 4af6b8be60
5 years ago
Megvii Engine Team
1d860f4d6f
fix(dnn/x86): fix dnnl int8 algo on vnni
GitOrigin-RevId: 2384e09558
5 years ago
Megvii Engine Team
871e6a516f
feat(dnn/x86): opt x86 quantized heuristic
GitOrigin-RevId: 72abe9efcc
5 years ago
Megvii Engine Team
6c29548d20
fix(dnn/arm): fix nchw_nchw44 dot stride1 support
GitOrigin-RevId: c8d3d55b25
5 years ago
Megvii Engine Team
02cbb13bbc
fix(dnn/arm): fix nchw44 fp32 direct algo oh block and unused stride2 algo
GitOrigin-RevId: 8012678fae
5 years ago
Megvii Engine Team
30b3d3aa3e
fix(dnn/gopt): add convolution nchw44-dot format gopt
GitOrigin-RevId: e8e1e96379
5 years ago
Megvii Engine Team
48d1ac1433
fix(dnn/arm): fix consistence between create_conv1x1_strategy and can_create_conv1x1_strategy
GitOrigin-RevId: 2d32998aca
5 years ago
Megvii Engine Team
a1f8ecc74f
fix(dnn/naive): add convolution nchw44-dot format
GitOrigin-RevId: 87a7c9c575
5 years ago
Megvii Engine Team
73d8416273
feat(dnn/aarch64): add matmul with dotprod for mk4
GitOrigin-RevId: feb391d635
5 years ago
Megvii Engine Team
c1397792a7
feat(dnn): add winograd-fp32-nchw44 support
GitOrigin-RevId: a6e2e735f1
5 years ago
Megvii Engine Team
3c32ad6d6d
feat(dnn/x86): imp avx2 int8 stride2 chanwise conv
GitOrigin-RevId: 288792de42
5 years ago
Megvii Engine Team
8937452153
fix(dnn/arm_common): add nchw44 float channel wise s1/s2
GitOrigin-RevId: 73e6aa1e57
5 years ago
Megvii Engine Team
9f997ac5ce
fix(dnn/x86): enable i8i8i16 gemv used in conv
GitOrigin-RevId: d946e22243
5 years ago