Megvii Engine Team
|
287cab49c2
|
fix(mgb/sereg): fix rng operator compatibility
GitOrigin-RevId: 66d1694035
|
3 years ago |
Megvii Engine Team
|
2aba0378b9
|
refactor(mgb/dnn): fix group conv is_available
GitOrigin-RevId: b279909168
|
3 years ago |
Megvii Engine Team
|
4a92346b7a
|
refactor(mgb): refactor group conv3d
GitOrigin-RevId: 15360a3a41
|
3 years ago |
Megvii Engine Team
|
6ce212d2e0
|
refactor(mgb): refactor group conv
GitOrigin-RevId: 7afd312690
|
4 years ago |
Megvii Engine Team
|
f76a2cc2c6
|
feat(mge/opr): add silu and gelu
GitOrigin-RevId: 75aa42947e
|
3 years ago |
Megvii Engine Team
|
f8b0f2cb91
|
build(dnn/cutlass): fix build for cutlass
GitOrigin-RevId: 9aa095fe84
|
3 years ago |
Megvii Engine Team
|
869a03271b
|
perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance
GitOrigin-RevId: d1b95a6f01
|
3 years ago |
Megvii Engine Team
|
239916a997
|
fix(mgb/gopt): fix testcase for enable nchw64 pass
GitOrigin-RevId: 2ae8d1608d
|
4 years ago |
Megvii Engine Team
|
4eda338876
|
feat(dnn/cuda): generate cutlass kimpls using cmake and bazel
GitOrigin-RevId: da3bcfb85a
|
4 years ago |
Megvii Engine Team
|
8d248a6a9a
|
fix(dnn/cuda): fix testcase for fallback nchw qs8 conv
GitOrigin-RevId: 646440db59
|
4 years ago |
Megvii Engine Team
|
894a2407c2
|
feat(dnn/cuda): add relayout format kernel for nchw <-> nhwc
GitOrigin-RevId: e11f3e5408
|
4 years ago |
Megvii Engine Team
|
43c59204df
|
refactor(dnn/cuda): refactor relayout format kernels
GitOrigin-RevId: ab86e66533
|
4 years ago |
Megvii Engine Team
|
f41a808694
|
feat(dnn/cuda): add nhwc int4 conv support
GitOrigin-RevId: 5236b235d0
|
4 years ago |
Megvii Engine Team
|
5a14a89224
|
refactor(dnn/cuda): refactor cutlass kernel generator for gemm and gemv
GitOrigin-RevId: 11d78ab227
|
4 years ago |
Megvii Engine Team
|
b33217d8f0
|
refactor(dnn/cuda): refactor cutlass kernel generator for deconv operation
GitOrigin-RevId: 88e962a912
|
4 years ago |
Megvii Engine Team
|
4abf7bd36f
|
refactor(dnn/cuda): refactor kernel generator for cutlass convolution kernels
GitOrigin-RevId: 7882f9c68c
|
4 years ago |
Megvii Engine Team
|
b4687ce8da
|
feat(dnn/cuda): add convolution with i8 input and u4 output
GitOrigin-RevId: 8be439abf1
|
4 years ago |
Megvii Engine Team
|
00083d13b6
|
fix(dnn/cuda): fix recursive algo search for fallback_nchw_qs8
GitOrigin-RevId: 6be2991224
|
4 years ago |
Megvii Engine Team
|
66f70578c2
|
feat(dnn/cuda): add convolution with i8 input and i4 output
GitOrigin-RevId: 10512645d5
|
4 years ago |
Megvii Engine Team
|
7d3df995cb
|
feat(gopt/inference): allow Float32 output dtype in EnableNCHW4Pass
GitOrigin-RevId: 81100dbaf7
|
4 years ago |
Megvii Engine Team
|
633016a962
|
fix(dnn/cuda): fix AlgoFallbackNCHWQS8 to support Float32 dst
GitOrigin-RevId: 06f90f5cf3
|
4 years ago |
Megvii Engine Team
|
4e4497b903
|
refactor(mgb/dnn): x86 pooling rebase algochooser
GitOrigin-RevId: 96cdc57180
|
3 years ago |
Megvii Engine Team
|
a33c3b73bd
|
refactor(mgb/dnn): arm pooling rebase algochooser
GitOrigin-RevId: 21d17e647a
|
3 years ago |
Megvii Engine Team
|
ea70d99b4d
|
fix(mge/convbias): make fallback convbias support nhwcd4 layout
GitOrigin-RevId: 1c306f867d
|
4 years ago |
Megvii Engine Team
|
43098fb8f1
|
feat(mge): add SlidingWindowTranspose opr
BREAKING CHANGE:
GitOrigin-RevId: 54d726d2fe
|
4 years ago |
Megvii Engine Team
|
b078dda90b
|
feat(mge/random): add some random op and remove random/distrbution.py
GitOrigin-RevId: 4c05ebc266
|
4 years ago |
Megvii Engine Team
|
83e4c9d7ab
|
fix(opencl): open opencl topk test when opencl beyond 2.0
GitOrigin-RevId: f2ad6b4af2
|
4 years ago |
Megvii Engine Team
|
f30c0e06a6
|
feat(mgb/opr): add lsq opr
GitOrigin-RevId: 45494a2b57
|
4 years ago |
Megvii Engine Team
|
25932352e9
|
refactor(mgb/dnn): rocm pooling rebase algochooser
GitOrigin-RevId: 95be929841
|
4 years ago |
Megvii Engine Team
|
1cfdbc565c
|
feat(dnn): add deterministic max pooling
GitOrigin-RevId: 9ab4c7a748
|
4 years ago |
Megvii Engine Team
|
20ab82d00c
|
fix(tee): fix tee crash
GitOrigin-RevId: 379f970c87
|
4 years ago |
Megvii Engine Team
|
a5060a2bfe
|
feat(mgb/opr): add check_has_inf kernel and opr
GitOrigin-RevId: 0d042dbfce
|
4 years ago |
Megvii Engine Team
|
3597a6dbd7
|
feat(dnn/arm): nchw_nchw44 conv support 1x1s1
GitOrigin-RevId: 8c8f7d7c76
|
4 years ago |
Megvii Engine Team
|
d915c5a3fd
|
refactor(mgb): make convolution3D handle noncontiguous tensors
GitOrigin-RevId: 3d3c31b021
|
4 years ago |
Megvii Engine Team
|
d04cd67faf
|
refactor(mgb): make conv-backward-filter handle noncontiguous tensors
GitOrigin-RevId: 44c586f912
|
4 years ago |
Megvii Engine Team
|
44376f702a
|
refactor(mgb): make conv-backward-data handle noncontiguous tensors
GitOrigin-RevId: 0a8f66f9d3
|
4 years ago |
Megvii Engine Team
|
7b2a76d1ee
|
refactor(mgb): make conv handle noncontiguous tensors
GitOrigin-RevId: 86282709b3
|
4 years ago |
Megvii Engine Team
|
ca2828ddcb
|
fix(dnn/x86): fix x86 int8 matmul ldc bug
GitOrigin-RevId: 2502f99000
|
4 years ago |
Megvii Engine Team
|
40085acbae
|
fix(mgb): remove unnecessary cudnn8 warning
GitOrigin-RevId: 04cf1bfca9
|
4 years ago |
Megvii Engine Team
|
62bd6c823b
|
feat(cmake/debug): misc for build
* add asan build option
* fix cpuinfo build opt level
* fix host release build with out debug info
* opt "fix lite bazel/cmake symbols MR"
* other misc build opt
GitOrigin-RevId: 6ca286e195
|
4 years ago |
Megvii Engine Team
|
b87af9f77f
|
feat(dnn/cuda): topk support fp16
GitOrigin-RevId: c6610d4cf0
|
4 years ago |
Megvii Engine Team
|
2eea00097c
|
feat(mgb): add fast run batch size graph option
GitOrigin-RevId: 94e333ec80
|
4 years ago |
Megvii Engine Team
|
47dcdf3e17
|
fix(mgb/core): fix dtype and resize modifiers for tensor
GitOrigin-RevId: a9d95a4cd8
|
4 years ago |
Megvii Engine Team
|
71cc814eaf
|
feat(ci): add aarch64 linux ci
GitOrigin-RevId: 2c0d3a8cc2
|
4 years ago |
Megvii Engine Team
|
24a3878130
|
feat(dnn/cuda): add nchw conv u4xs4 support
GitOrigin-RevId: 5edba47bd9
|
4 years ago |
Megvii Engine Team
|
606540bef4
|
feat(dnn/cuda): add nhwc 4bit warp perspective
GitOrigin-RevId: fbec4a4a1f
|
4 years ago |
Megvii Engine Team
|
1e6019436c
|
feat(dnn/cuda): add nhwc int4 pooling
GitOrigin-RevId: 9cf14cde4e
|
4 years ago |
Megvii Engine Team
|
e661ae904f
|
feat(dnn/cuda): add base class for cutlass uint4 and int4 algos
GitOrigin-RevId: a4d42f032c
|
4 years ago |
Megvii Engine Team
|
319436dd14
|
feat(dnn/cuda): add cutlass impls for uint4 x int4 conv bias
GitOrigin-RevId: cf4536855a
|
4 years ago |
Megvii Engine Team
|
d28eba4ea5
|
feat(dnn/cuda): add cutlass impls for int4 conv bias
GitOrigin-RevId: 878bb8c955
|
4 years ago |