Megvii Engine Team
4d35397bdf
fix(dnn/fallback): fix conv1x1/im2col usable and fuse-conv-bias get fp32xfp32-->qint8 error
GitOrigin-RevId: 5a3bfedd8a
5 years ago
Megvii Engine Team
6b2760dd72
feat(dnn/fallback): add float32 nchw44 fuse packb 3x3 s2
GitOrigin-RevId: 3b664bb4f5
5 years ago
Megvii Engine Team
2b4b4d66d9
feat(dnn/fallback): add aarch64 mk4 dot 3x3 s1 fuse packb
GitOrigin-RevId: 3e69878d8d
5 years ago
Megvii Engine Team
a1677d7aa9
feat(dnn/arm_common): add fp32 gevm
GitOrigin-RevId: 4d348bbb34
5 years ago
Megvii Engine Team
5d950063cf
feat(dnn): refactor dot gemv for both aarch64 and aarch32
GitOrigin-RevId: 2b98867e45
5 years ago
Megvii Engine Team
53c288a304
fix(dnn/cuda): fix topk grid oversize
GitOrigin-RevId: d3c811a034
5 years ago
Megvii Engine Team
124767b4f8
fix(dnn/fallback): fix mk4_dot test after remove mk4_dot_8x6x4 matmul
GitOrigin-RevId: e3a12cf9b3
5 years ago
Megvii Engine Team
34659c2ea4
fix(mgb/dnn): remove armv7 matmul mk4dot block 8x6
GitOrigin-RevId: 4c746ef228
5 years ago
Megvii Engine Team
48ac1e1abd
feat(dnn/fallback): delete nopack onlypacka noneed datatype,and add
im2co and conv1x1 mk4_dot support
GitOrigin-RevId: 096b16a3ab
5 years ago
Megvii Engine Team
3117bfb738
fix(dnn/arm): nchw44 direct int8 support 8832
GitOrigin-RevId: 696fa05d94
5 years ago
Megvii Engine Team
9f352b1c45
feat(megbrain/dnn): add indexing remap int32 for naive and cuda
GitOrigin-RevId: 5f66d51de4
5 years ago
Megvii Engine Team
5dbf218d19
feat(dnn/x86): add sse 8816 matmul
GitOrigin-RevId: ed8d9ee5db
5 years ago
Megvii Engine Team
25b6a13148
feat(dnn/x86): add x86 avx2 8x8x16 matmul
GitOrigin-RevId: d2172c50b2
5 years ago
Megvii Engine Team
273f891b55
fix(mgb/gopt): fix run-time winograd-transform and nchwxx error
GitOrigin-RevId: aca796f17d
5 years ago
Megvii Engine Team
02abc36ea6
fix(mbg/arm_common): fix nchw44-dot misc issue
GitOrigin-RevId: f870ad964c
5 years ago
Megvii Engine Team
9ed3882a94
fix(opr/dnn): fix winograd fast run mismatch
GitOrigin-RevId: d308085b9f
5 years ago
Megvii Engine Team
18be23f328
fix(mbg/gopt): fix nchwxx gopt with no fuse conv_bias and winograd
fast-run
GitOrigin-RevId: 49ccbdf2d4
5 years ago
Megvii Engine Team
65ec4f7c26
fix(ci): fix test timeout
GitOrigin-RevId: 875fc613cf
5 years ago
Megvii Engine Team
ea6bfe6cd9
fix(dnn/cuda-stub): simplify and use proper search paths
Removed the `access()` call before `dlopen()`.
It was copy-pasted from the opencl-stub, does not make sense here, and
prevents `dlopen()` from loading `libcuda.so` from non-default path.
Updated the name of the library providing CUDA Driver API on different
platforms, these are harvested from the following file in a CUDA
install:
samples/6_Advanced/matrixMulDynlinkJIT/cuda_drvapi_dynlink.c
GitOrigin-RevId: ed43cab8c8
5 years ago
Megvii Engine Team
32c86211ee
fix(dnn/cuda): enable cuda algos for nchw quantized
GitOrigin-RevId: 4d1e167b86
5 years ago
Megvii Engine Team
7b0dbe6af8
fix(dnn/arm): fix stride 1 support for int8 nchw_nchw44
GitOrigin-RevId: 9d718eb7a4
5 years ago
Megvii Engine Team
198f3eb5f6
fix(dnn/arm): fix fp32 nchw44 direct workspace bug
GitOrigin-RevId: 6ee433b02c
5 years ago
Megvii Engine Team
9e876203b5
feat(dnn): add int8 direct conv dot nchw44
GitOrigin-RevId: 31830ba7a4
5 years ago
Megvii Engine Team
09ceaaaecf
fix(dnn/arm): stride1 support for nchw_nchw44 fp32 conv
GitOrigin-RevId: 744c5db3dc
5 years ago
Megvii Engine Team
f56f187f6e
fix(mbg/gopt): fix nchw44-dot channel wise trans to nchw44
GitOrigin-RevId: aa2059a796
5 years ago
Megvii Engine Team
4f8e60801c
feat(dnn): fix Werror by adding macro
GitOrigin-RevId: 1f5fe4d46a
5 years ago
Megvii Engine Team
3966bb08b3
feat(dnn/test): split cpu.convolution
GitOrigin-RevId: fa28d3d760
5 years ago
Megvii Engine Team
8f87a3e988
feat(dnn/arm_common): add int8 nchw44 winograd f23_4x4 f23_8x8 compute float32/int16 output int8
GitOrigin-RevId: d99ef7efcd
5 years ago
Megvii Engine Team
8ffed043be
fix(dnn/x86): fix matrix_mul quantized performance on vnni
GitOrigin-RevId: 4af6b8be60
5 years ago
Megvii Engine Team
1d860f4d6f
fix(dnn/x86): fix dnnl int8 algo on vnni
GitOrigin-RevId: 2384e09558
5 years ago
Megvii Engine Team
871e6a516f
feat(dnn/x86): opt x86 quantized heuristic
GitOrigin-RevId: 72abe9efcc
5 years ago
Megvii Engine Team
6c29548d20
fix(dnn/arm): fix nchw_nchw44 dot stride1 support
GitOrigin-RevId: c8d3d55b25
5 years ago
Megvii Engine Team
02cbb13bbc
fix(dnn/arm): fix nchw44 fp32 direct algo oh block and unused stride2 algo
GitOrigin-RevId: 8012678fae
5 years ago
Megvii Engine Team
30b3d3aa3e
fix(dnn/gopt): add convolution nchw44-dot format gopt
GitOrigin-RevId: e8e1e96379
5 years ago
Megvii Engine Team
48d1ac1433
fix(dnn/arm): fix consistence between create_conv1x1_strategy and can_create_conv1x1_strategy
GitOrigin-RevId: 2d32998aca
5 years ago
Megvii Engine Team
a1f8ecc74f
fix(dnn/naive): add convolution nchw44-dot format
GitOrigin-RevId: 87a7c9c575
5 years ago
Megvii Engine Team
73d8416273
feat(dnn/aarch64): add matmul with dotprod for mk4
GitOrigin-RevId: feb391d635
5 years ago
Megvii Engine Team
c1397792a7
feat(dnn): add winograd-fp32-nchw44 support
GitOrigin-RevId: a6e2e735f1
5 years ago
Megvii Engine Team
3c32ad6d6d
feat(dnn/x86): imp avx2 int8 stride2 chanwise conv
GitOrigin-RevId: 288792de42
5 years ago
Megvii Engine Team
8937452153
fix(dnn/arm_common): add nchw44 float channel wise s1/s2
GitOrigin-RevId: 73e6aa1e57
5 years ago
Megvii Engine Team
9f997ac5ce
fix(dnn/x86): enable i8i8i16 gemv used in conv
GitOrigin-RevId: d946e22243
5 years ago
Megvii Engine Team
36e3bb6ea7
feat(mgb/dnn): add armv7 mk4_dot matmul
GitOrigin-RevId: d4206f8e21
5 years ago
Megvii Engine Team
580a275332
feat(dnn/arm): add nchw44 fp32 direct stride 1
GitOrigin-RevId: 65f54a4f7e
5 years ago
Megvii Engine Team
ad3c931553
feat(dnn/arm): add arm nchw44 fp32 pooling
GitOrigin-RevId: 6a26dad0a1
5 years ago
Megvii Engine Team
27ef788f84
feat(dnn/armv7): add armv7 mk4 matmul
GitOrigin-RevId: 8ef24bf53b
5 years ago
Megvii Engine Team
9320bf92af
feat(mgb/dnn): add matmul mk4 dot naive test
GitOrigin-RevId: 2f16d4f89b
5 years ago
Megvii Engine Team
a6bc250d1c
feat(dnn/common): add matmul impl for naive with matrix format mk4_dot
GitOrigin-RevId: 7c6fbdfa97
5 years ago
Megvii Engine Team
270b74886a
feat(dnn/fallback): support mk4 fp32 im2col
GitOrigin-RevId: 178d723172
5 years ago
Megvii Engine Team
a4879fc67a
feat(cmake/cross_build/host_build/windows): imp windows
host build and cross build
now cmake status:
a: host build
1: windows build -- ok
2: linux build -- ok
3: macos build -- ok
b: cross build
1: windows cross build arm-android -- ok
2: windows cross build arm-linux -- ok
3: linux cross build arm-android -- ok
4: linux cross build arm-linux -- ok
5: macos cross build arm-android -- ok
6: macos cross build arm-linux -- ok
7: macos cross build ios -- ok
GitOrigin-RevId: f7f376fe8c
5 years ago
Megvii Engine Team
cdefe90ecc
feat(dnn/fallback): support mk4 fp32 conv1x1
GitOrigin-RevId: 301ef0137f
5 years ago