Megvii Engine Team
8e5410e41f
feat(cuda): add fp16 compute 16 kernel
GitOrigin-RevId: e03435be02
3 years ago
Megvii Engine Team
472e2f9655
refactor(cuda): depthwish large kernel
GitOrigin-RevId: dade8710b4
3 years ago
Megvii Engine Team
e698ec20c2
feat(cuda): float16 depthwise large kernel conv compute fp32
GitOrigin-RevId: 3050d48f26
3 years ago
Megvii Engine Team
48406382ce
feat(cuda): support float16 depthwise large kernel conv
GitOrigin-RevId: fdc1b15fbc
3 years ago
Megvii Engine Team
7042f76b34
perf(cuda): speedup conv backward data with small feature map and large filter size
GitOrigin-RevId: 85592bca6b
3 years ago
Megvii Engine Team
87a2aeebb1
perf(cuda): speedup chanwise conv with small feature map and large filter size
GitOrigin-RevId: e65b2ce856
3 years ago
Megvii Engine Team
2293385e93
feat(mge): add conv padding mode
GitOrigin-RevId: 147ced856e
3 years ago
Megvii Engine Team
afe9c4b50d
feat(dnn/cuda): add implicit bmm kernels for large kernel depthwise convolution backward filter opr
GitOrigin-RevId: 932e7689e8
3 years ago
Megvii Engine Team
38067472d2
fix(dnn/cuda): fix ci
GitOrigin-RevId: 8267e5f9dd
3 years ago
Megvii Engine Team
1da58ae17a
feat(dnn/cuda): add implicit bmm large kernel dwconv2d dgrad kernels
GitOrigin-RevId: fcb7974d62
3 years ago
Megvii Engine Team
96050073a2
feat(dnn/cuda): add implicit bmm large kernel dwconv2d fprop impl
GitOrigin-RevId: feb09ebb58
3 years ago
Megvii Engine Team
95ac055538
feat(dnn,mgb,imperative): add diag opr implement
GitOrigin-RevId: 43016ffa2b
3 years ago
Megvii Engine Team
39d77fb55a
feat(arm): add arm rnn_cell/lstm_cell/lstm optimized kernel
GitOrigin-RevId: b9bb7352bc
3 years ago
Megvii Engine Team
ee0b95e935
feat(dnn/elemwise/arm_common): support part of arm ternary elemwise multithread
BCAST111C_VEC_BCAST111C and BCAST101_VEC_BCAST101
GitOrigin-RevId: 0e26553c90
3 years ago
Megvii Engine Team
cbbca5fb10
feat(mge): add softmax op use cudnn api
GitOrigin-RevId: 7734ebf8c4
3 years ago
Megvii Engine Team
20b42a8c3b
fix(dnn): add naive lstm kernel
GitOrigin-RevId: f08ef810cf
3 years ago
Megvii Engine Team
2faa6ea5a9
Merge pull request #213 from kxz18:rnn
GitOrigin-RevId: 9e9215c115
3 years ago
Megvii Engine Team
82be0aaced
test(dnn): fix compute capability requirement for NCHWX test
GitOrigin-RevId: d2f8022be1
3 years ago
Megvii Engine Team
1999307015
feat(mgb/opr): add dropout kernel
GitOrigin-RevId: d248bd2005
3 years ago
Megvii Engine Team
a93741815b
feat(mgb/opr): add layernorm forward and backward kernel
GitOrigin-RevId: 0cd484e753
3 years ago
Megvii Engine Team
c53cad2049
feat(cmake): format all cmake file
GitOrigin-RevId: 0a4ecab99b
3 years ago
Megvii Engine Team
c90e0b54be
perf(arm): optimize arm uint16 relayout with n=4
GitOrigin-RevId: 5779c6b9c1
3 years ago
Megvii Engine Team
f6d9909460
feat(dnn): add elemwise multi type support i16xf32 and u8xf32
GitOrigin-RevId: 2fe469bb4e
3 years ago
kxz@thumt102-1
8f48da7ffe
feat(mgb/opr): add cell level rnn/lstm and sequence level rnn/lstm
3 years ago
Megvii Engine Team
6bb5409976
feat(dnn/src): add images2neibs kernel of opencl and related test
GitOrigin-RevId: 82242b7437
3 years ago
Megvii Engine Team
c96dbd29b8
fix(dnn/arm_common): support more monotonous case in arm typecvt for performance
GitOrigin-RevId: 9e28a64d93
3 years ago
Megvii Engine Team
02d5f46d90
fix(mgb/x86): fix convbias crash on X86
GitOrigin-RevId: cc7283c6a2
3 years ago
Megvii Engine Team
2696e4efaa
feat(dnn): add float16 for remap backward
GitOrigin-RevId: 0263030051
3 years ago
Megvii Engine Team
11d75fecb5
feat(dnn/check_non_finite): add batch check_non_finite
GitOrigin-RevId: e108133282
3 years ago
Megvii Engine Team
2318ea3f15
fix(dnn): fix naive average pooling overflow bug for int8 type
GitOrigin-RevId: b60a7b6cf8
3 years ago
Megvii Engine Team
ba2f0c2e48
fix(dnn/cuda): fix cudnn_conv algo of conv_bias opr for fp16 add z cases
GitOrigin-RevId: b29b009de0
3 years ago
Megvii Engine Team
b59e8ccf24
fix(mgb): fix cambricon bangc copybara
GitOrigin-RevId: 22b57364e8
3 years ago
Megvii Engine Team
3116e128c5
fix(ci/integration_test): fix benchmark torch version
GitOrigin-RevId: bd964ed505
3 years ago
Megvii Engine Team
c85631aa77
feat(dnn): use ref ptr interface for all backends
GitOrigin-RevId: f65feae5cc
3 years ago
Megvii Engine Team
89186edc5d
fix(dnn): correct reduce/argmxx/fakequant calculation with nan
GitOrigin-RevId: 7e78bdae91
3 years ago
Megvii Engine Team
68cdabd288
feat(opr): indexing_multi_axis_vec support nd index
GitOrigin-RevId: 07b1248bdc
3 years ago
Megvii Engine Team
a1cba6cc27
fix(dnn): fix convbias crash on X86
GitOrigin-RevId: 3b33c83d03
3 years ago
Megvii Engine Team
9b4cd92ba3
fix(mgb/dnn): fix cudnnConvBiasActivation crash on nchw32 int8 with oc > 256
GitOrigin-RevId: 20c0b90575
3 years ago
Megvii Engine Team
c48d58daa8
feat(dnn/arm_common): add N1HW like elemwise broadcast mode
GitOrigin-RevId: 2895135801
3 years ago
Megvii Engine Team
26634db7a8
fix(dnn): support relayout for non-contigous layout
GitOrigin-RevId: 44a0adddba
3 years ago
Megvii Engine Team
056fd6bc59
feat(dnn/arm64): support stride_m in arm64 relayout
GitOrigin-RevId: c74193a23d
3 years ago
liuke
b0ba6d3201
Merge pull request #207 from togetherwhenyouwant:feat-x86-matmul-6x16x2
GitOrigin-RevId: 148ae44ba0
3 years ago
Megvii Engine Team
10af44abba
fix(dnn/cuda): fix cudnn conv impl for nchw4_nchw hybrid layout
the conv_bias algo *_IMPLICIT_GEMM in cudnn less than 8.0.0 is disabled due to the incorrect result for int8x4->f32 configs
GitOrigin-RevId: 7cc52d0a85
3 years ago
Megvii Engine Team
5885b137fa
feat(dnn/arm): support layout like NHWC channel like broadcast on arm
GitOrigin-RevId: fb4300004c
3 years ago
Megvii Engine Team
369c2ccc5a
style(all): reformat c++ code
GitOrigin-RevId: 3ffd1b211f
3 years ago
zjl
d2184af3b2
feat(dnn/src/x86/matmul): add matmul_6x16 for x86
3 years ago
Megvii Engine Team
177dec94c5
feat(mgb/opr): add bgr2gray mode for cvtcolor opr
GitOrigin-RevId: d50415b236
3 years ago
Megvii Engine Team
f5cb21ed3a
fix(mgb/opr): add non finite check
GitOrigin-RevId: a9fcd0a350
3 years ago
Megvii Engine Team
bde5cf3564
feat(dnn): add resize linear for arm
GitOrigin-RevId: 14ac5bda3f
3 years ago
Megvii Engine Team
3344b580a9
feat(dnn): add elemwise for nchw88+fp16
GitOrigin-RevId: 63587975f8
3 years ago