Megvii Engine Team
0d16952470
fix(mgb/cuda): fix conv error when the input tensor is too large
GitOrigin-RevId: 1b1d693795
3 years ago
Megvii Engine Team
02d5f46d90
fix(mgb/x86): fix convbias crash on X86
GitOrigin-RevId: cc7283c6a2
3 years ago
Megvii Engine Team
accb2d8d47
fix(mgb/serialize): fix flatbuffer compatibility issues
GitOrigin-RevId: e4771d6bc4
3 years ago
Megvii Engine Team
5e07e1e0f9
fix(dnn/falback): let cpu be able to execute int4 model
GitOrigin-RevId: 1a6b78f3b6
3 years ago
Megvii Engine Team
2696e4efaa
feat(dnn): add float16 for remap backward
GitOrigin-RevId: 0263030051
3 years ago
Megvii Engine Team
1f0cc891b0
feat(dnn): enable eye to support bool
GitOrigin-RevId: 76d874d5b7
3 years ago
Megvii Engine Team
11d75fecb5
feat(dnn/check_non_finite): add batch check_non_finite
GitOrigin-RevId: e108133282
3 years ago
Megvii Engine Team
2318ea3f15
fix(dnn): fix naive average pooling overflow bug for int8 type
GitOrigin-RevId: b60a7b6cf8
3 years ago
Megvii Engine Team
2d54ad185b
feat(lite): add global layout transform interface for load and run
GitOrigin-RevId: 65c2430ec2
3 years ago
Megvii Engine Team
ba2f0c2e48
fix(dnn/cuda): fix cudnn_conv algo of conv_bias opr for fp16 add z cases
GitOrigin-RevId: b29b009de0
3 years ago
Megvii Engine Team
30976c239f
fix(mgb/gopt): fix global layout transform
add a special opr_format modify function for concat operators to modify concat axis when input's layout has been changed
GitOrigin-RevId: 4094208057
3 years ago
Megvii Engine Team
ca7cec7a5d
fix(mgb/gopt): minor fixes for global layout transform
Merge Target::ARM and Target::X86 into Target::CPU to make global layout transform easier to use
GitOrigin-RevId: cc9363fa38
3 years ago
Megvii Engine Team
fe93013a6e
feat(mgb/gopt): global layout transform support nchw_nchwxx hybrid mode
GitOrigin-RevId: 6d5b55d7fc
3 years ago
Megvii Engine Team
3d45d35241
feat(mgb/gopt): profiler support checking algo availability
GitOrigin-RevId: 39cad612cc
3 years ago
Megvii Engine Team
b59e8ccf24
fix(mgb): fix cambricon bangc copybara
GitOrigin-RevId: 22b57364e8
3 years ago
Megvii Engine Team
3116e128c5
fix(ci/integration_test): fix benchmark torch version
GitOrigin-RevId: bd964ed505
3 years ago
Megvii Engine Team
c85631aa77
feat(dnn): use ref ptr interface for all backends
GitOrigin-RevId: f65feae5cc
3 years ago
Megvii Engine Team
d90cb7763c
feat(src/core): record support change ptr basic
GitOrigin-RevId: b12f1c4a66
3 years ago
Megvii Engine Team
89186edc5d
fix(dnn): correct reduce/argmxx/fakequant calculation with nan
GitOrigin-RevId: 7e78bdae91
3 years ago
Megvii Engine Team
68cdabd288
feat(opr): indexing_multi_axis_vec support nd index
GitOrigin-RevId: 07b1248bdc
3 years ago
Megvii Engine Team
a1cba6cc27
fix(dnn): fix convbias crash on X86
GitOrigin-RevId: 3b33c83d03
3 years ago
Megvii Engine Team
9b4cd92ba3
fix(mgb/dnn): fix cudnnConvBiasActivation crash on nchw32 int8 with oc > 256
GitOrigin-RevId: 20c0b90575
3 years ago
Megvii Engine Team
23c1fda7e6
perf(arm_common): optimize sigmoid
GitOrigin-RevId: 7cb248a15b
3 years ago
Megvii Engine Team
25ec2530ba
feat(whl/api/lar): enable megengine dll on Windows
1: reduce python whl package size
2: unify api link logic on all OS
3: add option: MGE_WINDOWS_BUILD_WITH_STATIC_CRT
--- default OFF
--- if build CRT(vc runtime) with STATIC with megengine.dll
some CRT api will crash, for example, flush, so if you
build with static megengine, and do not want to install CRT
you can set MGE_WINDOWS_BUILD_WITH_STATIC_CRT TRUE
--- how to install CRT:
https://docs.microsoft.com/en-us/cpp/windows/latest-supported-vc-redist?view=msvc-160
install VC_redist.x64.exe
4: rename megengine_export to megengine_shared(only export needed symbols ),
caused by runtime symbols conflict with pytorch
GitOrigin-RevId: 93d8d80f29
3 years ago
Megvii Engine Team
c48d58daa8
feat(dnn/arm_common): add N1HW like elemwise broadcast mode
GitOrigin-RevId: 2895135801
3 years ago
Megvii Engine Team
26634db7a8
fix(dnn): support relayout for non-contigous layout
GitOrigin-RevId: 44a0adddba
3 years ago
Megvii Engine Team
056fd6bc59
feat(dnn/arm64): support stride_m in arm64 relayout
GitOrigin-RevId: c74193a23d
3 years ago
Megvii Engine Team
c50858ee13
fix(dnn): specialize pow to make it consistent
GitOrigin-RevId: cff3bbbadd
3 years ago
Megvii Engine Team
849f0ece9d
fix(dnn): drop batched matmul cublas algo when batch is 1
GitOrigin-RevId: 71126a27b0
3 years ago
Megvii Engine Team
b5bf56e0ee
style(dnn): add bypass of clang-format for dnn foreach_opr macro
GitOrigin-RevId: 2f4bfcff42
3 years ago
Megvii Engine Team
5af52746f7
fix(mgb): fix bug caused by conv filter size is too big
GitOrigin-RevId: 7fe3a9fcf1
3 years ago
liuke
b0ba6d3201
Merge pull request #207 from togetherwhenyouwant:feat-x86-matmul-6x16x2
GitOrigin-RevId: 148ae44ba0
3 years ago
Megvii Engine Team
10af44abba
fix(dnn/cuda): fix cudnn conv impl for nchw4_nchw hybrid layout
the conv_bias algo *_IMPLICIT_GEMM in cudnn less than 8.0.0 is disabled due to the incorrect result for int8x4->f32 configs
GitOrigin-RevId: 7cc52d0a85
3 years ago
Megvii Engine Team
5885b137fa
feat(dnn/arm): support layout like NHWC channel like broadcast on arm
GitOrigin-RevId: fb4300004c
3 years ago
Megvii Engine Team
369c2ccc5a
style(all): reformat c++ code
GitOrigin-RevId: 3ffd1b211f
3 years ago
Megvii Engine Team
bfb30dcb81
chore(format): fix compile bugs after code format
GitOrigin-RevId: 11a4b06f6f
3 years ago
Megvii Engine Team
eeccf2bc0d
ci(check): add clang-format in check stage
GitOrigin-RevId: 25c24d7856
3 years ago
zjl
d2184af3b2
feat(dnn/src/x86/matmul): add matmul_6x16 for x86
3 years ago
Megvii Engine Team
177dec94c5
feat(mgb/opr): add bgr2gray mode for cvtcolor opr
GitOrigin-RevId: d50415b236
3 years ago
Megvii Engine Team
f5cb21ed3a
fix(mgb/opr): add non finite check
GitOrigin-RevId: a9fcd0a350
3 years ago
Megvii Engine Team
bde5cf3564
feat(dnn): add resize linear for arm
GitOrigin-RevId: 14ac5bda3f
3 years ago
Megvii Engine Team
8cb201868e
fix(mgb): fix fastrun cache serialization method
GitOrigin-RevId: cd8bed3cc9
3 years ago
Megvii Engine Team
563239d38f
feat(dnn): add arm_common nchw44 cwconv3x3s1p1 and cwconv5x5s1p2
GitOrigin-RevId: 9ea411d0e1
3 years ago
Megvii Engine Team
3344b580a9
feat(dnn): add elemwise for nchw88+fp16
GitOrigin-RevId: 63587975f8
3 years ago
Megvii Engine Team
682c74df27
feat(dnn): add direct nchw88 fp16 conv
GitOrigin-RevId: 44719e8b64
3 years ago
Megvii Engine Team
fca195351c
feat(gopt): add nhwc fuse conv typecvt optpass
GitOrigin-RevId: adc2301203
3 years ago
Megvii Engine Team
2fc7358517
Revert "feat(dnn/apicache): add generic apicache"
This reverts commit 40b8ac2ab6
.
GitOrigin-RevId: 0f9b2df4c8
3 years ago
Megvii Engine Team
de363c04af
Revert "perf(cuda/conv): cache serval cudnn api"
This reverts commit 188c62cdd6
.
GitOrigin-RevId: 92a82b8cd9
3 years ago
Megvii Engine Team
729ee64988
Revert "fix(api_cache): lock api cache for thread safety"
This reverts commit 8a244677c3
.
GitOrigin-RevId: 582488adeb
3 years ago
Megvii Engine Team
64c922c4bb
Revert "fix(api_cache): fix serialization for conv_desc"
This reverts commit 95dbc9c685
.
GitOrigin-RevId: ca8c67b6b3
3 years ago