Megvii Engine Team
|
1404437a90
|
fix(mgb): fix the compatibility issue of cuda stub with older version drivers
GitOrigin-RevId: 628afbf3cf
|
2 years ago |
Megvii Engine Team
|
a6a2646c10
|
feat(arm): add AlgoFP32Winograd F43, and add filter size into name of winograd-related algorithms
GitOrigin-RevId: 909503a90d
|
2 years ago |
Megvii Engine Team
|
b8821edb3d
|
perf(dnn/aarch64): optimize aarch64 sigmoid with asm
GitOrigin-RevId: 7d54d67669
|
2 years ago |
Megvii Engine Team
|
2b99bfec4e
|
feat(arm): supports weight pre-processing for winograd benchmark tests
GitOrigin-RevId: 1797f3b91c
|
2 years ago |
Megvii Engine Team
|
421bcfd3d8
|
style(mgb/tools): add format for tools, dnn and ci
GitOrigin-RevId: 5684e5ea43
|
3 years ago |
Megvii Engine Team
|
116781ba9c
|
fix(mgb): fix megtee build errors
GitOrigin-RevId: b351dd3994
|
2 years ago |
Megvii Engine Team
|
54b5db1729
|
feat(x86/rvv): add AGENT_NCHW_NCHW44 algo
GitOrigin-RevId: 8cf6c3fac0
|
2 years ago |
Megvii Engine Team
|
eaa180181a
|
feat(x86/rvv): opt gi intrinsic helper
for rvv, detail: https://github.com/riscv-collab/riscv-gnu-toolchain/issues/1106
GitOrigin-RevId: 27615584c0
|
2 years ago |
Megvii Engine Team
|
399db31aab
|
fix(dnn): fix build
GitOrigin-RevId: d91077248a
|
2 years ago |
Megvii Engine Team
|
f31e52d521
|
feat(mgb): warpperspective support multi src input
GitOrigin-RevId: 0887656864
|
2 years ago |
Megvii Engine Team
|
669816e291
|
feat(dnn): warpperspective support multi src input
GitOrigin-RevId: 8a4789852e
|
2 years ago |
Megvii Engine Team
|
1b94380794
|
fix(dnn): fix reduce sum/mean error when b is large
GitOrigin-RevId: d1bae619b1
|
2 years ago |
Megvii Engine Team
|
c7a9909839
|
feat(cuda): add int4 ptx 256x64 mma kernel
GitOrigin-RevId: 8f7475b0f6
|
2 years ago |
Megvii Engine Team
|
cf3ca1e9a2
|
feat(cuda): add int4 ptx 128x256 mma kernel
GitOrigin-RevId: 1ae7c9f034
|
2 years ago |
Megvii Engine Team
|
1f8e930e28
|
feat(cuda): add int4 ptx 128x128 mma kernel
GitOrigin-RevId: 5a8b9c3f8e
|
2 years ago |
Megvii Engine Team
|
1a2ed8c47b
|
feat(cuda): add convbias ptx algo testcase
GitOrigin-RevId: 9ad6d4561f
|
2 years ago |
Megvii Engine Team
|
64551105f9
|
feat(cuda): add convbias ptx algo
GitOrigin-RevId: 08e9f66641
|
2 years ago |
Megvii Engine Team
|
8395a459b5
|
fix(dnn/fallback): fix naive shift multidefination error and optimize GiCvtFromInt32V4ToUint8
GitOrigin-RevId: 6660c35214
|
2 years ago |
Megvii Engine Team
|
23a3d13350
|
fix(dnn/softmax): create redcue and elemwise opr when get workspace size
GitOrigin-RevId: 476a39bdd3
|
2 years ago |
Megvii Engine Team
|
b3a7d149a0
|
feat(dnn/fallback): add some new gi api
GitOrigin-RevId: 4aede0ac6a
|
2 years ago |
Megvii Engine Team
|
fac67e7c2b
|
feat(gopt): support nchw44 global pooling with fuse_grain
GitOrigin-RevId: 4c43a149f8
|
2 years ago |
Megvii Engine Team
|
43bd949af0
|
fix(dnn): fix cudnn include
GitOrigin-RevId: f6f9731c3e
|
2 years ago |
Megvii Engine Team
|
8abc3ab8fc
|
fix(imperative): fix convolution in rocm
GitOrigin-RevId: 9e97099fd5
|
2 years ago |
Megvii Engine Team
|
5f86368219
|
Revert "feat(dnn): add elemwise modes"
This reverts commit cb713ddb24 .
GitOrigin-RevId: 02adf025e6
|
2 years ago |
Megvii Engine Team
|
d2a1905ad5
|
Revert "feat(mgb): add cumprod opr"
This reverts commit 3436c3bdaa .
GitOrigin-RevId: 95ab3d1aa7
|
2 years ago |
Megvii Engine Team
|
49e14f87b5
|
feat(mgb): add cumprod opr
GitOrigin-RevId: 3436c3bdaa
|
3 years ago |
Megvii Engine Team
|
87aedc2991
|
feat(dnn): add elemwise modes
GitOrigin-RevId: cb713ddb24
|
3 years ago |
Megvii Engine Team
|
25e89d68b0
|
feat(gi/rvv): remove winograd rvv do not use FIXLEN workaround
GitOrigin-RevId: fce5103088
|
2 years ago |
Megvii Engine Team
|
b3f46734e7
|
feat(megdnn/softmax): add softmax operator in fallback
GitOrigin-RevId: 97bc32f561
|
3 years ago |
Megvii Engine Team
|
c49d3070ba
|
refactor(imperative/ops): extends DnnOprCaller with template
GitOrigin-RevId: 402cba209a
|
2 years ago |
Megvii Engine Team
|
f5597d9a10
|
fix(mgb): make error infomation of input channel mismatch more readable
GitOrigin-RevId: 6f95260070
|
2 years ago |
Megvii Engine Team
|
38bd599911
|
fix(mgb): make error infomation of invalid MatMul more readable
GitOrigin-RevId: 96b922dd20
|
2 years ago |
Megvii Engine Team
|
e0d505e6bd
|
fix(mgb/dnn): fix bug that some cutlass file compile very slowly on SM86
GitOrigin-RevId: 91d7ac1927
|
2 years ago |
Megvii Engine Team
|
58ba080d5f
|
feat(x86/rvv): make gi conv algo adapt to vv and vf model
GitOrigin-RevId: f29593be4d
|
2 years ago |
Megvii Engine Team
|
bd50e457ee
|
feat(x86/rvv): make MATRIX_MUL_GI_F32_4x12 and FP32_GEMV_MK4_GI
adapt to vv and vf model
GitOrigin-RevId: 691434c598
|
2 years ago |
Megvii Engine Team
|
5c3b4e9584
|
feat(x86/rvv): opt AlgoFP32WinogradF63_4x4_NCHW44
GitOrigin-RevId: 0cd0089982
|
2 years ago |
Megvii Engine Team
|
fa59a7b061
|
feat(x86/rvv): opt AlgoF32DirectNCHWNCHW44
and opt GiMaximumFloat32/GiMinimumFloat32 on x86
GitOrigin-RevId: 825021e867
|
2 years ago |
Megvii Engine Team
|
0d82e9b72b
|
feat(x86/rvv): opt FB_GI_F32_MK4_4x8
GitOrigin-RevId: 9e17de18b4
|
2 years ago |
Megvii Engine Team
|
a54d9cb9cd
|
feat(x86/rvv): opt FB_GI_F32_MK4_PACK_4x12 algo
GitOrigin-RevId: a80805c119
|
2 years ago |
Megvii Engine Team
|
247e2f59a4
|
feat(mgb/dnn): add modes that the output type is bool in elemwise
GitOrigin-RevId: fd0134fca2
|
3 years ago |
Megvii Engine Team
|
16ba05a81b
|
fix(dnn): fix dnn nchwxx elemwise performance
GitOrigin-RevId: 5a715d7b2a
|
2 years ago |
Megvii Engine Team
|
7b17c1180e
|
refactor(dnn): make cudnn_frontend work
GitOrigin-RevId: f089f93494
|
3 years ago |
Megvii Engine Team
|
35e9cc9845
|
feat(dnn/cuda): add cudnn frontend api
GitOrigin-RevId: 9b18a57893
|
3 years ago |
Megvii Engine Team
|
ab8f6398d9
|
fix(test): make test install
GitOrigin-RevId: e38d6c5e9f
|
3 years ago |
Megvii Engine Team
|
99cfefbfe0
|
fix(test): fix test copybara
GitOrigin-RevId: 19b7bdf377
|
3 years ago |
Megvii Engine Team
|
0d7ace15c8
|
fix(mgb/dnn): suport fp16 for resize nhwc
GitOrigin-RevId: bb04d2a801
|
3 years ago |
Megvii Engine Team
|
f12b75c04b
|
perf(dnn/fallback): optimize some corner case in reduce
GitOrigin-RevId: 1185594301
|
3 years ago |
Megvii Engine Team
|
7f02407281
|
perf(dnn): speed up pad kernel
GitOrigin-RevId: 33db700687
|
3 years ago |
Megvii Engine Team
|
2886245bb1
|
perf(imperative/src): improve pad host performance
GitOrigin-RevId: 05223deca7
|
3 years ago |
Megvii Engine Team
|
b55942a94d
|
feat(dnn/naive/norm,-dnn/cuda/norm,-dnn/test/norm): add norm dnn opr,
fwd only
GitOrigin-RevId: 989474168d
|
3 years ago |