Megvii Engine Team
|
758549b936
|
feat(megengine): support tx2
GitOrigin-RevId: d1175a1f4a
|
3 years ago |
Megvii Engine Team
|
b6ad457269
|
feat(cuda): support int1 simplewq conv
GitOrigin-RevId: 9c37c41bc7
|
3 years ago |
Megvii Engine Team
|
331567af5d
|
fix(opencl/ci): misc opt and fix:
1: fix megbrain test failed on mali 2.1 devices
2: reduce ci time (about reduce 20min)
GitOrigin-RevId: 4dcdcd48a6
|
3 years ago |
Megvii Engine Team
|
ff6a3bb819
|
fix(fallback): delete the repeat opcaller in fallback and arm_common
GitOrigin-RevId: 87046b8197
|
3 years ago |
Megvii Engine Team
|
547945e854
|
feat(fallback): support general intrinsic in elemwise in fallback
GitOrigin-RevId: 96ff2e88cc
|
3 years ago |
Megvii Engine Team
|
fd6f8e58b0
|
feat(mgb/dtype): add dtype qint1
GitOrigin-RevId: abe9fb68b1
|
3 years ago |
Megvii Engine Team
|
8c415f4ed7
|
feat(dnn): cuda nhwc nearest resize support not 1 or 3 channel
GitOrigin-RevId: 764504c341
|
3 years ago |
Megvii Engine Team
|
87de704a46
|
feat(gopt): fuse conv h_swish
GitOrigin-RevId: a3d12991fb
|
3 years ago |
Megvii Engine Team
|
04193e3bd1
|
feat(dnn): add nearest mode for remap and resize
GitOrigin-RevId: 31e7b72a78
|
3 years ago |
Megvii Engine Team
|
e34a642b31
|
feat(fallback): reduce support general intrinsic
GitOrigin-RevId: f250aa7b2a
|
3 years ago |
Megvii Engine Team
|
d7b0994a3e
|
feat(cuda): add fp16 compute 16 kernel
GitOrigin-RevId: e03435be02
|
3 years ago |
Megvii Engine Team
|
8a2e92bd6c
|
refactor(cuda): depthwish large kernel
GitOrigin-RevId: dade8710b4
|
3 years ago |
Megvii Engine Team
|
6b8a69d5b6
|
feat(cuda): float16 depthwise large kernel conv compute fp32
GitOrigin-RevId: 3050d48f26
|
3 years ago |
Megvii Engine Team
|
bc385b5374
|
feat(cuda): support float16 depthwise large kernel conv
GitOrigin-RevId: fdc1b15fbc
|
3 years ago |
Megvii Engine Team
|
7d2063e35a
|
perf(cuda): speedup conv backward data with small feature map and large filter size
GitOrigin-RevId: 85592bca6b
|
3 years ago |
Megvii Engine Team
|
72403e8929
|
perf(cuda): speedup chanwise conv with small feature map and large filter size
GitOrigin-RevId: e65b2ce856
|
3 years ago |
Megvii Engine Team
|
ab6d12caff
|
feat(mge): add conv padding mode
GitOrigin-RevId: 147ced856e
|
3 years ago |
Megvii Engine Team
|
47fe766310
|
feat(dnn/cuda): add implicit bmm kernels for large kernel depthwise convolution backward filter opr
GitOrigin-RevId: 932e7689e8
|
3 years ago |
Megvii Engine Team
|
6cefabe734
|
fix(dnn/cuda): fix ci
GitOrigin-RevId: 8267e5f9dd
|
3 years ago |
Megvii Engine Team
|
888f4e46ae
|
feat(dnn/cuda): add implicit bmm large kernel dwconv2d dgrad kernels
GitOrigin-RevId: fcb7974d62
|
3 years ago |
Megvii Engine Team
|
08d8635ff5
|
feat(dnn/cuda): add implicit bmm large kernel dwconv2d fprop impl
GitOrigin-RevId: feb09ebb58
|
3 years ago |
Megvii Engine Team
|
260923e11c
|
perf(aarch64): optimize aarch64 uint16 relayout with block_w==3
GitOrigin-RevId: fe6aaaac0c
|
3 years ago |
Megvii Engine Team
|
95ac055538
|
feat(dnn,mgb,imperative): add diag opr implement
GitOrigin-RevId: 43016ffa2b
|
3 years ago |
Megvii Engine Team
|
39d77fb55a
|
feat(arm): add arm rnn_cell/lstm_cell/lstm optimized kernel
GitOrigin-RevId: b9bb7352bc
|
3 years ago |
Megvii Engine Team
|
ee0b95e935
|
feat(dnn/elemwise/arm_common): support part of arm ternary elemwise multithread
BCAST111C_VEC_BCAST111C and BCAST101_VEC_BCAST101
GitOrigin-RevId: 0e26553c90
|
3 years ago |
Megvii Engine Team
|
cbbca5fb10
|
feat(mge): add softmax op use cudnn api
GitOrigin-RevId: 7734ebf8c4
|
3 years ago |
Megvii Engine Team
|
20b42a8c3b
|
fix(dnn): add naive lstm kernel
GitOrigin-RevId: f08ef810cf
|
3 years ago |
Megvii Engine Team
|
2faa6ea5a9
|
Merge pull request #213 from kxz18:rnn
GitOrigin-RevId: 9e9215c115
|
3 years ago |
Megvii Engine Team
|
82be0aaced
|
test(dnn): fix compute capability requirement for NCHWX test
GitOrigin-RevId: d2f8022be1
|
3 years ago |
Megvii Engine Team
|
1999307015
|
feat(mgb/opr): add dropout kernel
GitOrigin-RevId: d248bd2005
|
3 years ago |
Megvii Engine Team
|
a93741815b
|
feat(mgb/opr): add layernorm forward and backward kernel
GitOrigin-RevId: 0cd484e753
|
3 years ago |
Megvii Engine Team
|
c53cad2049
|
feat(cmake): format all cmake file
GitOrigin-RevId: 0a4ecab99b
|
3 years ago |
Megvii Engine Team
|
c90e0b54be
|
perf(arm): optimize arm uint16 relayout with n=4
GitOrigin-RevId: 5779c6b9c1
|
3 years ago |
Megvii Engine Team
|
f6d9909460
|
feat(dnn): add elemwise multi type support i16xf32 and u8xf32
GitOrigin-RevId: 2fe469bb4e
|
3 years ago |
kxz@thumt102-1
|
8f48da7ffe
|
feat(mgb/opr): add cell level rnn/lstm and sequence level rnn/lstm
|
3 years ago |
Megvii Engine Team
|
6bb5409976
|
feat(dnn/src): add images2neibs kernel of opencl and related test
GitOrigin-RevId: 82242b7437
|
3 years ago |
Megvii Engine Team
|
c96dbd29b8
|
fix(dnn/arm_common): support more monotonous case in arm typecvt for performance
GitOrigin-RevId: 9e28a64d93
|
3 years ago |
Megvii Engine Team
|
02d5f46d90
|
fix(mgb/x86): fix convbias crash on X86
GitOrigin-RevId: cc7283c6a2
|
3 years ago |
Megvii Engine Team
|
2696e4efaa
|
feat(dnn): add float16 for remap backward
GitOrigin-RevId: 0263030051
|
3 years ago |
Megvii Engine Team
|
11d75fecb5
|
feat(dnn/check_non_finite): add batch check_non_finite
GitOrigin-RevId: e108133282
|
3 years ago |
Megvii Engine Team
|
2318ea3f15
|
fix(dnn): fix naive average pooling overflow bug for int8 type
GitOrigin-RevId: b60a7b6cf8
|
3 years ago |
Megvii Engine Team
|
ba2f0c2e48
|
fix(dnn/cuda): fix cudnn_conv algo of conv_bias opr for fp16 add z cases
GitOrigin-RevId: b29b009de0
|
3 years ago |
Megvii Engine Team
|
b59e8ccf24
|
fix(mgb): fix cambricon bangc copybara
GitOrigin-RevId: 22b57364e8
|
3 years ago |
Megvii Engine Team
|
3116e128c5
|
fix(ci/integration_test): fix benchmark torch version
GitOrigin-RevId: bd964ed505
|
3 years ago |
Megvii Engine Team
|
c85631aa77
|
feat(dnn): use ref ptr interface for all backends
GitOrigin-RevId: f65feae5cc
|
3 years ago |
Megvii Engine Team
|
89186edc5d
|
fix(dnn): correct reduce/argmxx/fakequant calculation with nan
GitOrigin-RevId: 7e78bdae91
|
3 years ago |
Megvii Engine Team
|
68cdabd288
|
feat(opr): indexing_multi_axis_vec support nd index
GitOrigin-RevId: 07b1248bdc
|
3 years ago |
Megvii Engine Team
|
a1cba6cc27
|
fix(dnn): fix convbias crash on X86
GitOrigin-RevId: 3b33c83d03
|
3 years ago |
Megvii Engine Team
|
9b4cd92ba3
|
fix(mgb/dnn): fix cudnnConvBiasActivation crash on nchw32 int8 with oc > 256
GitOrigin-RevId: 20c0b90575
|
3 years ago |
Megvii Engine Team
|
c48d58daa8
|
feat(dnn/arm_common): add N1HW like elemwise broadcast mode
GitOrigin-RevId: 2895135801
|
3 years ago |