megvii-mge
|
c42ce93705
|
feat(mge/third_party): update cutlass version
|
3 years ago |
温娟
|
9902ccfcb0
|
chore(release): bump version
|
3 years ago |
Megvii Engine Team
|
8e5410e41f
|
feat(cuda): add fp16 compute 16 kernel
GitOrigin-RevId: e03435be02
|
3 years ago |
Megvii Engine Team
|
472e2f9655
|
refactor(cuda): depthwish large kernel
GitOrigin-RevId: dade8710b4
|
3 years ago |
Megvii Engine Team
|
e698ec20c2
|
feat(cuda): float16 depthwise large kernel conv compute fp32
GitOrigin-RevId: 3050d48f26
|
3 years ago |
Megvii Engine Team
|
48406382ce
|
feat(cuda): support float16 depthwise large kernel conv
GitOrigin-RevId: fdc1b15fbc
|
3 years ago |
Megvii Engine Team
|
7042f76b34
|
perf(cuda): speedup conv backward data with small feature map and large filter size
GitOrigin-RevId: 85592bca6b
|
3 years ago |
Megvii Engine Team
|
87a2aeebb1
|
perf(cuda): speedup chanwise conv with small feature map and large filter size
GitOrigin-RevId: e65b2ce856
|
3 years ago |
Megvii Engine Team
|
2293385e93
|
feat(mge): add conv padding mode
GitOrigin-RevId: 147ced856e
|
3 years ago |
Megvii Engine Team
|
afe9c4b50d
|
feat(dnn/cuda): add implicit bmm kernels for large kernel depthwise convolution backward filter opr
GitOrigin-RevId: 932e7689e8
|
3 years ago |
Megvii Engine Team
|
e8a169292f
|
feat(dnn/cuda): add heuristic rule for implicit batched gemm large kernel dwconv2d kernels
GitOrigin-RevId: 2d2c213bfd
|
3 years ago |
Megvii Engine Team
|
38067472d2
|
fix(dnn/cuda): fix ci
GitOrigin-RevId: 8267e5f9dd
|
3 years ago |
Megvii Engine Team
|
1da58ae17a
|
feat(dnn/cuda): add implicit bmm large kernel dwconv2d dgrad kernels
GitOrigin-RevId: fcb7974d62
|
3 years ago |
Megvii Engine Team
|
96050073a2
|
feat(dnn/cuda): add implicit bmm large kernel dwconv2d fprop impl
GitOrigin-RevId: feb09ebb58
|
3 years ago |
温娟
|
19fe2e94e7
|
chore(release): bump version
|
3 years ago |
Megvii Engine Team
|
1add4517ad
|
test(trace): test subtensor on unknown shape
GitOrigin-RevId: 1b5cfa4e0a
|
3 years ago |
Megvii Engine Team
|
54eef55871
|
fix(trace): assume result is not scalar when shape is valid
GitOrigin-RevId: beee2d0f28
|
3 years ago |
Megvii Engine Team
|
84d99d1cc4
|
fix(traced_module): fix Module compatible issue and traced module getattr check
GitOrigin-RevId: 62eb3bfb10
|
3 years ago |
Megvii Engine Team
|
275b63114d
|
fix(imperative): fix use collections error from python3.10
GitOrigin-RevId: 5dd019b336
|
3 years ago |
Megvii Engine Team
|
95ac055538
|
feat(dnn,mgb,imperative): add diag opr implement
GitOrigin-RevId: 43016ffa2b
|
3 years ago |
Megvii Engine Team
|
39d77fb55a
|
feat(arm): add arm rnn_cell/lstm_cell/lstm optimized kernel
GitOrigin-RevId: b9bb7352bc
|
3 years ago |
Megvii Engine Team
|
3ddc32d3e3
|
feat(android/whl): support android whl
GitOrigin-RevId: 05df16b494
|
3 years ago |
Megvii Engine Team
|
f509b1be9b
|
fix(build): split elemwise_multi_type cpp
GitOrigin-RevId: 13267e9db6
|
3 years ago |
Megvii Engine Team
|
3252016e05
|
Merge pull request #401 from LosReturn:patch-1
GitOrigin-RevId: 440af8bd3d
|
3 years ago |
Megvii Engine Team
|
f7e034b506
|
feat(lite): add global layout transform python interface for lite
GitOrigin-RevId: f159f49208
|
3 years ago |
Megvii Engine Team
|
e70c07a223
|
feat(lite): add global layout transform c/c++ interface for lite
GitOrigin-RevId: 36a4b26b42
|
3 years ago |
Megvii Engine Team
|
86ee4638bf
|
Merge pull request #402 from AA1HSHH:docstring-reshape
GitOrigin-RevId: 1ec572eb7c
|
3 years ago |
Megvii Engine Team
|
3251f50114
|
fix(mgb/cuda-stub): add libcuda-wrap_11.4.h to fit the CUDA11.4 toolchain
GitOrigin-RevId: efa38f00d1
|
3 years ago |
Megvii Engine Team
|
2c2df83051
|
fix(cmake): enable custom op when building develop to avoid the pytest fail
GitOrigin-RevId: fa05ead899
|
3 years ago |
Megvii Engine Team
|
ee0b95e935
|
feat(dnn/elemwise/arm_common): support part of arm ternary elemwise multithread
BCAST111C_VEC_BCAST111C and BCAST101_VEC_BCAST101
GitOrigin-RevId: 0e26553c90
|
3 years ago |
Megvii Engine Team
|
7ea104d788
|
Revert "fix(mge): replace _full_sync by sync"
This reverts commit e36ef45464
GitOrigin-RevId: 2d913c8ac9
|
3 years ago |
Megvii Engine Team
|
cbbca5fb10
|
feat(mge): add softmax op use cudnn api
GitOrigin-RevId: 7734ebf8c4
|
3 years ago |
Megvii Engine Team
|
1d2510b6d7
|
fix(module): fix module dumped in old version without _short_name attr
GitOrigin-RevId: a1c815f613
|
3 years ago |
Megvii Engine Team
|
cf5e9488bb
|
fix(traced_module): fix module trace transformation
GitOrigin-RevId: ce11fe5e09
|
3 years ago |
Megvii Engine Team
|
97c90d9137
|
feat(traced_module): add _exclude_from_trace
GitOrigin-RevId: 615b769a02
|
3 years ago |
Megvii Engine Team
|
30e565e5b8
|
fix(traced_module): fix error message
GitOrigin-RevId: 3046225e30
|
3 years ago |
Megvii Engine Team
|
de8ffe0c12
|
refactor(imperative): unify interpreter option setting
GitOrigin-RevId: 53510445cc
|
3 years ago |
Megvii Engine Team
|
8b60bdfa10
|
fix(mge): replace _full_sync by sync
GitOrigin-RevId: e36ef45464
|
3 years ago |
Megvii Engine Team
|
20b42a8c3b
|
fix(dnn): add naive lstm kernel
GitOrigin-RevId: f08ef810cf
|
3 years ago |
Megvii Engine Team
|
2faa6ea5a9
|
Merge pull request #213 from kxz18:rnn
GitOrigin-RevId: 9e9215c115
|
3 years ago |
Megvii Engine Team
|
f5b8fec4ca
|
fix(imperative): remove big tensor from host side
GitOrigin-RevId: 2047982d73
|
3 years ago |
Megvii Engine Team
|
68cde8734e
|
fix(mge/imperative): support broadcast with None
GitOrigin-RevId: dd330a2a1d
|
3 years ago |
Megvii Engine Team
|
0bdd0b1467
|
refactor(dispatch): switch to new dispatch system
GitOrigin-RevId: 32dd49a23a
|
3 years ago |
Megvii Engine Team
|
d3689c3f3c
|
feat(imperative/python): add transformation manager
GitOrigin-RevId: a3c1732ffd
|
3 years ago |
Megvii Engine Team
|
9ce1f0f5d1
|
refactor(dispatch): implement grad
GitOrigin-RevId: d8367f9587
|
3 years ago |
Megvii Engine Team
|
c609c031f1
|
refactor(dispatch): implement symbol
GitOrigin-RevId: c7bd86f5c1
|
3 years ago |
Megvii Engine Team
|
e32929dfd2
|
refactor(dispatch): implement scalar
GitOrigin-RevId: b244c2ca1a
|
3 years ago |
Megvii Engine Team
|
59084fa857
|
refactor(dispatch): implement lazy_eval
GitOrigin-RevId: 4e3f3a1c44
|
3 years ago |
Megvii Engine Team
|
d2b67c2a88
|
refactor(dispatch): implement trace
GitOrigin-RevId: f8d3005732
|
3 years ago |
Megvii Engine Team
|
39ac606b9c
|
refactor(dispatch): implement eval
GitOrigin-RevId: 32563e0a27
|
3 years ago |