Megvii Engine Team
cbbca5fb10
feat(mge): add softmax op use cudnn api
GitOrigin-RevId: 7734ebf8c4
3 years ago
Megvii Engine Team
82be0aaced
test(dnn): fix compute capability requirement for NCHWX test
GitOrigin-RevId: d2f8022be1
3 years ago
Megvii Engine Team
1999307015
feat(mgb/opr): add dropout kernel
GitOrigin-RevId: d248bd2005
3 years ago
Megvii Engine Team
a93741815b
feat(mgb/opr): add layernorm forward and backward kernel
GitOrigin-RevId: 0cd484e753
3 years ago
Megvii Engine Team
2696e4efaa
feat(dnn): add float16 for remap backward
GitOrigin-RevId: 0263030051
3 years ago
Megvii Engine Team
11d75fecb5
feat(dnn/check_non_finite): add batch check_non_finite
GitOrigin-RevId: e108133282
3 years ago
Megvii Engine Team
ba2f0c2e48
fix(dnn/cuda): fix cudnn_conv algo of conv_bias opr for fp16 add z cases
GitOrigin-RevId: b29b009de0
3 years ago
Megvii Engine Team
c85631aa77
feat(dnn): use ref ptr interface for all backends
GitOrigin-RevId: f65feae5cc
3 years ago
Megvii Engine Team
89186edc5d
fix(dnn): correct reduce/argmxx/fakequant calculation with nan
GitOrigin-RevId: 7e78bdae91
3 years ago
Megvii Engine Team
68cdabd288
feat(opr): indexing_multi_axis_vec support nd index
GitOrigin-RevId: 07b1248bdc
3 years ago
Megvii Engine Team
9b4cd92ba3
fix(mgb/dnn): fix cudnnConvBiasActivation crash on nchw32 int8 with oc > 256
GitOrigin-RevId: 20c0b90575
3 years ago
Megvii Engine Team
10af44abba
fix(dnn/cuda): fix cudnn conv impl for nchw4_nchw hybrid layout
the conv_bias algo *_IMPLICIT_GEMM in cudnn less than 8.0.0 is disabled due to the incorrect result for int8x4->f32 configs
GitOrigin-RevId: 7cc52d0a85
3 years ago
Megvii Engine Team
5885b137fa
feat(dnn/arm): support layout like NHWC channel like broadcast on arm
GitOrigin-RevId: fb4300004c
3 years ago
Megvii Engine Team
369c2ccc5a
style(all): reformat c++ code
GitOrigin-RevId: 3ffd1b211f
3 years ago
Megvii Engine Team
f5cb21ed3a
fix(mgb/opr): add non finite check
GitOrigin-RevId: a9fcd0a350
3 years ago
Megvii Engine Team
bde5cf3564
feat(dnn): add resize linear for arm
GitOrigin-RevId: 14ac5bda3f
3 years ago
Megvii Engine Team
3d3666b6e0
test(dnn/bn): add compatible configs for NHWC BN
GitOrigin-RevId: ac757ca307
3 years ago
Megvii Engine Team
3977b7aa0b
feat(mgb/shuffle): add shuffle opr
GitOrigin-RevId: 80490a6f84
3 years ago
Megvii Engine Team
17371e79b9
fix(dnn/reduce): fix reduce_mean o16c32 is incorrect for large tensor
GitOrigin-RevId: ebf03d814a
3 years ago
Megvii Engine Team
8b40f57738
feat(mgb/dnn): add conv1x1 algo for matrix mul
GitOrigin-RevId: 585b2c045a
3 years ago
Megvii Engine Team
d69b59035d
feat(dnn): add an get_all_algorithms_safe interface
GitOrigin-RevId: e3734e4531
3 years ago
Megvii Engine Team
8b94f49328
fix(dnn/cuda): fix elemwise and relayout int4 bug when last shape is 1
GitOrigin-RevId: e7d64c4987
3 years ago
Megvii Engine Team
722aecd437
feat(mgb): support fp16 nhwc backward
GitOrigin-RevId: 954ac6405a
3 years ago
Megvii Engine Team
0708bc780c
fix(dnn/cuda): disallow implicit dtype conversion in cublaslt matmul algos
disable tensor op matmul kernels when input and output tensors are in f32 data type to avoid potential accuracy loss
GitOrigin-RevId: 36859cba5a
3 years ago
Megvii Engine Team
4c13bc7e1b
feat(dnn/cuda): add nhwc int8 deconv
GitOrigin-RevId: ad361a0f81
3 years ago
Megvii Engine Team
11f022ff7c
feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt
GitOrigin-RevId: 229e1eb4be
3 years ago
Megvii Engine Team
67575d582c
feat(mge/opr): add interpolate bilinear mode
GitOrigin-RevId: f7023a3fd3
3 years ago
Megvii Engine Team
0558b2123d
feat(mge/opr): add interpolate nearest mode
GitOrigin-RevId: d384b87f50
3 years ago
Megvii Engine Team
c25125e3d2
perf(dnn/cuda): sass int8 epilogue remove shared load
GitOrigin-RevId: 2b49f5069b
3 years ago
Megvii Engine Team
ff0e6be7b9
fix(dnn/cuda): fix cutlass tensorop kernels
do not compile cutlass tensorop kernels, when using cuda version less than 10.2
GitOrigin-RevId: d4c37d5f41
3 years ago
Megvii Engine Team
336761253d
feat(dnn/cuda): add tensorcore matmul for fp16 data type
GitOrigin-RevId: 025c591f75
3 years ago
Megvii Engine Team
eab6afab47
feat(mgb): add padding opr for megbrain
GitOrigin-RevId: 490e0c5d5a
4 years ago
Megvii Engine Team
b18feaab33
feat(dnn/cuda): use cutlass remove shared load imma conv kernel
GitOrigin-RevId: 0b5574f526
4 years ago
Megvii Engine Team
1af350c6d2
feat(dnn): add fill kernel
GitOrigin-RevId: d2cee3a7a0
3 years ago
Megvii Engine Team
3eb0505f9b
feat(imperative): add support for quantized conv transpose2d
GitOrigin-RevId: ffd6431299
3 years ago
Megvii Engine Team
3b452d8c16
feat(mgb): cuda conv support nhwc format and fp16 dtype
GitOrigin-RevId: b8ddcd108a
3 years ago
Megvii Engine Team
2aba0378b9
refactor(mgb/dnn): fix group conv is_available
GitOrigin-RevId: b279909168
3 years ago
Megvii Engine Team
4a92346b7a
refactor(mgb): refactor group conv3d
GitOrigin-RevId: 15360a3a41
3 years ago
Megvii Engine Team
6ce212d2e0
refactor(mgb): refactor group conv
GitOrigin-RevId: 7afd312690
4 years ago
Megvii Engine Team
869a03271b
perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance
GitOrigin-RevId: d1b95a6f01
3 years ago
Megvii Engine Team
8d248a6a9a
fix(dnn/cuda): fix testcase for fallback nchw qs8 conv
GitOrigin-RevId: 646440db59
4 years ago
Megvii Engine Team
894a2407c2
feat(dnn/cuda): add relayout format kernel for nchw <-> nhwc
GitOrigin-RevId: e11f3e5408
4 years ago
Megvii Engine Team
f41a808694
feat(dnn/cuda): add nhwc int4 conv support
GitOrigin-RevId: 5236b235d0
4 years ago
Megvii Engine Team
633016a962
fix(dnn/cuda): fix AlgoFallbackNCHWQS8 to support Float32 dst
GitOrigin-RevId: 06f90f5cf3
4 years ago
Megvii Engine Team
43098fb8f1
feat(mge): add SlidingWindowTranspose opr
BREAKING CHANGE:
GitOrigin-RevId: 54d726d2fe
4 years ago
Megvii Engine Team
b078dda90b
feat(mge/random): add some random op and remove random/distrbution.py
GitOrigin-RevId: 4c05ebc266
4 years ago
Megvii Engine Team
f30c0e06a6
feat(mgb/opr): add lsq opr
GitOrigin-RevId: 45494a2b57
4 years ago
Megvii Engine Team
1cfdbc565c
feat(dnn): add deterministic max pooling
GitOrigin-RevId: 9ab4c7a748
4 years ago
Megvii Engine Team
a5060a2bfe
feat(mgb/opr): add check_has_inf kernel and opr
GitOrigin-RevId: 0d042dbfce
4 years ago
Megvii Engine Team
d915c5a3fd
refactor(mgb): make convolution3D handle noncontiguous tensors
GitOrigin-RevId: 3d3c31b021
4 years ago