Megvii Engine Team
1999307015
feat(mgb/opr): add dropout kernel
GitOrigin-RevId: d248bd2005
3 years ago
Megvii Engine Team
a93741815b
feat(mgb/opr): add layernorm forward and backward kernel
GitOrigin-RevId: 0cd484e753
3 years ago
Megvii Engine Team
a404cd7d06
fix(mgb/src): add tensorRT version check
GitOrigin-RevId: 7abfd30cab
3 years ago
Megvii Engine Team
2881934cb8
feat(dnn/check_non_finite): addmul scale to check_non_finite opr
GitOrigin-RevId: c35a219e52
3 years ago
Megvii Engine Team
0d16952470
fix(mgb/cuda): fix conv error when the input tensor is too large
GitOrigin-RevId: 1b1d693795
3 years ago
Megvii Engine Team
2696e4efaa
feat(dnn): add float16 for remap backward
GitOrigin-RevId: 0263030051
3 years ago
Megvii Engine Team
1f0cc891b0
feat(dnn): enable eye to support bool
GitOrigin-RevId: 76d874d5b7
3 years ago
Megvii Engine Team
11d75fecb5
feat(dnn/check_non_finite): add batch check_non_finite
GitOrigin-RevId: e108133282
3 years ago
Megvii Engine Team
2d54ad185b
feat(lite): add global layout transform interface for load and run
GitOrigin-RevId: 65c2430ec2
3 years ago
Megvii Engine Team
ba2f0c2e48
fix(dnn/cuda): fix cudnn_conv algo of conv_bias opr for fp16 add z cases
GitOrigin-RevId: b29b009de0
3 years ago
Megvii Engine Team
c85631aa77
feat(dnn): use ref ptr interface for all backends
GitOrigin-RevId: f65feae5cc
3 years ago
Megvii Engine Team
89186edc5d
fix(dnn): correct reduce/argmxx/fakequant calculation with nan
GitOrigin-RevId: 7e78bdae91
3 years ago
Megvii Engine Team
68cdabd288
feat(opr): indexing_multi_axis_vec support nd index
GitOrigin-RevId: 07b1248bdc
3 years ago
Megvii Engine Team
9b4cd92ba3
fix(mgb/dnn): fix cudnnConvBiasActivation crash on nchw32 int8 with oc > 256
GitOrigin-RevId: 20c0b90575
3 years ago
Megvii Engine Team
849f0ece9d
fix(dnn): drop batched matmul cublas algo when batch is 1
GitOrigin-RevId: 71126a27b0
3 years ago
Megvii Engine Team
5af52746f7
fix(mgb): fix bug caused by conv filter size is too big
GitOrigin-RevId: 7fe3a9fcf1
3 years ago
Megvii Engine Team
10af44abba
fix(dnn/cuda): fix cudnn conv impl for nchw4_nchw hybrid layout
the conv_bias algo *_IMPLICIT_GEMM in cudnn less than 8.0.0 is disabled due to the incorrect result for int8x4->f32 configs
GitOrigin-RevId: 7cc52d0a85
3 years ago
Megvii Engine Team
369c2ccc5a
style(all): reformat c++ code
GitOrigin-RevId: 3ffd1b211f
3 years ago
Megvii Engine Team
bfb30dcb81
chore(format): fix compile bugs after code format
GitOrigin-RevId: 11a4b06f6f
3 years ago
Megvii Engine Team
eeccf2bc0d
ci(check): add clang-format in check stage
GitOrigin-RevId: 25c24d7856
3 years ago
Megvii Engine Team
177dec94c5
feat(mgb/opr): add bgr2gray mode for cvtcolor opr
GitOrigin-RevId: d50415b236
3 years ago
Megvii Engine Team
f5cb21ed3a
fix(mgb/opr): add non finite check
GitOrigin-RevId: a9fcd0a350
3 years ago
Megvii Engine Team
fca195351c
feat(gopt): add nhwc fuse conv typecvt optpass
GitOrigin-RevId: adc2301203
3 years ago
Megvii Engine Team
2fc7358517
Revert "feat(dnn/apicache): add generic apicache"
This reverts commit 40b8ac2ab6
.
GitOrigin-RevId: 0f9b2df4c8
3 years ago
Megvii Engine Team
de363c04af
Revert "perf(cuda/conv): cache serval cudnn api"
This reverts commit 188c62cdd6
.
GitOrigin-RevId: 92a82b8cd9
3 years ago
Megvii Engine Team
729ee64988
Revert "fix(api_cache): lock api cache for thread safety"
This reverts commit 8a244677c3
.
GitOrigin-RevId: 582488adeb
3 years ago
Megvii Engine Team
64c922c4bb
Revert "fix(api_cache): fix serialization for conv_desc"
This reverts commit 95dbc9c685
.
GitOrigin-RevId: ca8c67b6b3
3 years ago
Megvii Engine Team
b3e54eade1
feat(dnn/bn): use new cudnn BN kernel to support NHWC
GitOrigin-RevId: 9d80f2009d
3 years ago
Megvii Engine Team
3977b7aa0b
feat(mgb/shuffle): add shuffle opr
GitOrigin-RevId: 80490a6f84
3 years ago
Megvii Engine Team
eca6e1d931
fix(ci): fixes for ci
GitOrigin-RevId: b0a432bd2e
3 years ago
Megvii Engine Team
8b40f57738
feat(mgb/dnn): add conv1x1 algo for matrix mul
GitOrigin-RevId: 585b2c045a
3 years ago
Megvii Engine Team
d69b59035d
feat(dnn): add an get_all_algorithms_safe interface
GitOrigin-RevId: e3734e4531
3 years ago
Megvii Engine Team
8b94f49328
fix(dnn/cuda): fix elemwise and relayout int4 bug when last shape is 1
GitOrigin-RevId: e7d64c4987
3 years ago
Megvii Engine Team
694aa1bd92
feat(dnn): add heuristic cache
GitOrigin-RevId: 35e942b5e3
3 years ago
Megvii Engine Team
722aecd437
feat(mgb): support fp16 nhwc backward
GitOrigin-RevId: 954ac6405a
3 years ago
Megvii Engine Team
0708bc780c
fix(dnn/cuda): disallow implicit dtype conversion in cublaslt matmul algos
disable tensor op matmul kernels when input and output tensors are in f32 data type to avoid potential accuracy loss
GitOrigin-RevId: 36859cba5a
3 years ago
Megvii Engine Team
7b855dc64a
fix(dnn/cuda): fix compilation for windows bazel
GitOrigin-RevId: 2023dea19c
3 years ago
Megvii Engine Team
4c13bc7e1b
feat(dnn/cuda): add nhwc int8 deconv
GitOrigin-RevId: ad361a0f81
3 years ago
Megvii Engine Team
11f022ff7c
feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt
GitOrigin-RevId: 229e1eb4be
3 years ago
Megvii Engine Team
a0231a7920
fix(dnn/cuda): fix algo matmul for conv bwd filter
fix fastrun workspace size not available exception and device OOM error caused by the incorrect workspace size calculation of algo matmul of conv bwd filter
GitOrigin-RevId: de96b4fe11
3 years ago
Megvii Engine Team
67575d582c
feat(mge/opr): add interpolate bilinear mode
GitOrigin-RevId: f7023a3fd3
3 years ago
Megvii Engine Team
0558b2123d
feat(mge/opr): add interpolate nearest mode
GitOrigin-RevId: d384b87f50
3 years ago
Megvii Engine Team
ff0e6be7b9
fix(dnn/cuda): fix cutlass tensorop kernels
do not compile cutlass tensorop kernels, when using cuda version less than 10.2
GitOrigin-RevId: d4c37d5f41
3 years ago
Megvii Engine Team
336761253d
feat(dnn/cuda): add tensorcore matmul for fp16 data type
GitOrigin-RevId: 025c591f75
3 years ago
Megvii Engine Team
cc07b96f82
perf(dnn/relayout): disable copy_last_contiguous when contiguous_size is
small
GitOrigin-RevId: 7bd5e793a4
3 years ago
Megvii Engine Team
d195fdec71
refactor(mgb): refactor has-usable-algo function for global optimizer
GitOrigin-RevId: 6610516650
3 years ago
Megvii Engine Team
604bb2a569
feat(mgb/dnn): add int atomic add for megdnn
GitOrigin-RevId: 00d5d752d3
3 years ago
Megvii Engine Team
eab6afab47
feat(mgb): add padding opr for megbrain
GitOrigin-RevId: 490e0c5d5a
4 years ago
Megvii Engine Team
9b4b910dc1
feat(dnn/cuda): integrate cutlass operation table and replace all cutlass wrappers
GitOrigin-RevId: 2a70335441
3 years ago
Megvii Engine Team
b18feaab33
feat(dnn/cuda): use cutlass remove shared load imma conv kernel
GitOrigin-RevId: 0b5574f526
4 years ago