Megvii Engine Team
|
4e9b1c4eee
|
feat(dnn): add rrconv wgrad, support int32 and uint8 region mask
GitOrigin-RevId: 0da9b3bca8
|
2 years ago |
Megvii Engine Team
|
421bcfd3d8
|
style(mgb/tools): add format for tools, dnn and ci
GitOrigin-RevId: 5684e5ea43
|
3 years ago |
Megvii Engine Team
|
e0d505e6bd
|
fix(mgb/dnn): fix bug that some cutlass file compile very slowly on SM86
GitOrigin-RevId: 91d7ac1927
|
2 years ago |
Megvii Engine Team
|
81065cf00e
|
build(mgb/cutlass): merge partial headers
GitOrigin-RevId: 1bc2af604b
|
3 years ago |
Megvii Engine Team
|
47fe766310
|
feat(dnn/cuda): add implicit bmm kernels for large kernel depthwise convolution backward filter opr
GitOrigin-RevId: 932e7689e8
|
3 years ago |
Megvii Engine Team
|
888f4e46ae
|
feat(dnn/cuda): add implicit bmm large kernel dwconv2d dgrad kernels
GitOrigin-RevId: fcb7974d62
|
3 years ago |
Megvii Engine Team
|
08d8635ff5
|
feat(dnn/cuda): add implicit bmm large kernel dwconv2d fprop impl
GitOrigin-RevId: feb09ebb58
|
3 years ago |
Megvii Engine Team
|
4c13bc7e1b
|
feat(dnn/cuda): add nhwc int8 deconv
GitOrigin-RevId: ad361a0f81
|
3 years ago |
Megvii Engine Team
|
11f022ff7c
|
feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt
GitOrigin-RevId: 229e1eb4be
|
3 years ago |
Megvii Engine Team
|
ff0e6be7b9
|
fix(dnn/cuda): fix cutlass tensorop kernels
do not compile cutlass tensorop kernels, when using cuda version less than 10.2
GitOrigin-RevId: d4c37d5f41
|
3 years ago |
Megvii Engine Team
|
336761253d
|
feat(dnn/cuda): add tensorcore matmul for fp16 data type
GitOrigin-RevId: 025c591f75
|
3 years ago |
Megvii Engine Team
|
2c4ee99227
|
fix(dnn): short cutlass filename in windows
GitOrigin-RevId: 83a43fdf87
|
3 years ago |
Megvii Engine Team
|
9b4b910dc1
|
feat(dnn/cuda): integrate cutlass operation table and replace all cutlass wrappers
GitOrigin-RevId: 2a70335441
|
3 years ago |
Megvii Engine Team
|
b18feaab33
|
feat(dnn/cuda): use cutlass remove shared load imma conv kernel
GitOrigin-RevId: 0b5574f526
|
4 years ago |
Megvii Engine Team
|
4eda338876
|
feat(dnn/cuda): generate cutlass kimpls using cmake and bazel
GitOrigin-RevId: da3bcfb85a
|
4 years ago |