Megvii Engine Team
fca6c76a0e
fix(lite): fix input invalid bug in lar for fitting mode
GitOrigin-RevId: 45d81c9a96
2 years ago
Megvii Engine Team
cc21855074
feat(lite): load_and_run support optimize for inference
GitOrigin-RevId: d9abb8de9e
2 years ago
Megvii Engine Team
fac67e7c2b
feat(gopt): support nchw44 global pooling with fuse_grain
GitOrigin-RevId: 4c43a149f8
2 years ago
Megvii Engine Team
d2a1905ad5
Revert "feat(mgb): add cumprod opr"
This reverts commit 3436c3bdaa
.
GitOrigin-RevId: 95ab3d1aa7
2 years ago
Megvii Engine Team
49e14f87b5
feat(mgb): add cumprod opr
GitOrigin-RevId: 3436c3bdaa
3 years ago
Megvii Engine Team
02bfb8f8b9
feat(lite): add and fix some feature for load and run fitting mode
GitOrigin-RevId: bbddc9bb79
3 years ago
Megvii Engine Team
c2e9860feb
chore(license): remove all license in file header
GitOrigin-RevId: a0e31247a6
3 years ago
Megvii Engine Team
4cce2480d5
fix(dnn/opencl): fix some bug for dnn opencl conv bias and relayout format
GitOrigin-RevId: b5bb07d90d
3 years ago
Megvii Engine Team
484e1f1173
fix(build): fix riscv64 gcc build with > O0
GitOrigin-RevId: 9ad3480492
3 years ago
Megvii Engine Team
5ebc9d50b7
fix(pylite): fix lite global layout transform and fast run conflict error
GitOrigin-RevId: 910c8da19f
3 years ago
Megvii Engine Team
12a3ef8d01
refactor(fastrun): decouple fastrun from computing graph
GitOrigin-RevId: 27abd22295
3 years ago
Megvii Engine Team
87de704a46
feat(gopt): fuse conv h_swish
GitOrigin-RevId: a3d12991fb
3 years ago
Megvii Engine Team
3726f5cc92
feat(gopt): merger consecutive relayout and dimshuffle to one relayout to optimize CD4 performarce
GitOrigin-RevId: a058776be3
3 years ago
Megvii Engine Team
1fead9b6b0
feat(gopt): merge consecutive dimshuffle and relayout to one relayout to optimize CD4 performace
GitOrigin-RevId: 16f22baa80
3 years ago
Megvii Engine Team
26d1e4f7ed
feat(gopt): optimize cd4 pass rule for elemwise and typecvt to let cd4 start as soon as possible
GitOrigin-RevId: 6580dedca7
3 years ago
Megvii Engine Team
5f4501e0f3
fix(gopt): fix conv bias fuse 2 noline
GitOrigin-RevId: a6ab9f4e5e
3 years ago
Megvii Engine Team
6011f51001
style(all): fix clang-format for MGB_DEFINE inside another macro
GitOrigin-RevId: 8c2b6a2aed
3 years ago
Megvii Engine Team
93310c0e4b
fix(mgb/gopt): fix cpu global layout transform fastrun error
GitOrigin-RevId: ea254297e5
3 years ago
Megvii Engine Team
e715423f20
feat(src/gopt): add optpass on arm for fusing typecvt and elemwise to elemwise multi type
GitOrigin-RevId: e6bcbbf91b
3 years ago
Megvii Engine Team
0ad5eeaedd
feat(mgb/gopt): global layout transform support opencl
GitOrigin-RevId: 132605c7d9
3 years ago
Megvii Engine Team
b8ccc6a211
fix(mgb): fix loss execution policy after opr shallow copy
GitOrigin-RevId: 4738136e4a
3 years ago
Megvii Engine Team
6bb5409976
feat(dnn/src): add images2neibs kernel of opencl and related test
GitOrigin-RevId: 82242b7437
3 years ago
Megvii Engine Team
30976c239f
fix(mgb/gopt): fix global layout transform
add a special opr_format modify function for concat operators to modify concat axis when input's layout has been changed
GitOrigin-RevId: 4094208057
3 years ago
Megvii Engine Team
b6c0e8d093
fix(mgb/gopt): update profiler cache of global layout transform test
GitOrigin-RevId: fe8a712599
3 years ago
Megvii Engine Team
f2db7b0ded
feat(mgb/gopt): global layout transform support cuda fp16
GitOrigin-RevId: 1449c54215
3 years ago
Megvii Engine Team
ca7cec7a5d
fix(mgb/gopt): minor fixes for global layout transform
Merge Target::ARM and Target::X86 into Target::CPU to make global layout transform easier to use
GitOrigin-RevId: cc9363fa38
3 years ago
Megvii Engine Team
93152dfa14
fix(mgb/gopt): fix global layout transform
deduplicate the states of the DP problem due to different layout config(NCHW44 & NCHW44_HYBRID) will produce tensors with same layout
GitOrigin-RevId: 7f77efd21b
3 years ago
Megvii Engine Team
fe93013a6e
feat(mgb/gopt): global layout transform support nchw_nchwxx hybrid mode
GitOrigin-RevId: 6d5b55d7fc
3 years ago
Megvii Engine Team
3d45d35241
feat(mgb/gopt): profiler support checking algo availability
GitOrigin-RevId: 39cad612cc
3 years ago
Megvii Engine Team
5f15f75984
test(mgb/gopt): add a testcase for SubGraphExtractor with multiple outputs
GitOrigin-RevId: 7785bdc8c0
3 years ago
Megvii Engine Team
a6230ba95a
feat(mgb/gopt): global layout transform support arm
GitOrigin-RevId: db50b33c11
3 years ago
Megvii Engine Team
0be6ca8899
fix(src/core): fix record change ptr bug on comp node copy
GitOrigin-RevId: 0f68966211
3 years ago
Megvii Engine Team
d90cb7763c
feat(src/core): record support change ptr basic
GitOrigin-RevId: b12f1c4a66
3 years ago
Megvii Engine Team
34773ba37b
fix(mgb/gopt): tensorcore pass replace BatchConvBias inputs to nchw4
GitOrigin-RevId: 3ff3c422fb
3 years ago
Megvii Engine Team
8b7649345a
feat(lite): lite support output var no copy option
GitOrigin-RevId: 5b9488cb93
3 years ago
Megvii Engine Team
7642f66df7
feat(core): support output var no copy
GitOrigin-RevId: fd68a1735b
3 years ago
Megvii Engine Team
25ec2530ba
feat(whl/api/lar): enable megengine dll on Windows
1: reduce python whl package size
2: unify api link logic on all OS
3: add option: MGE_WINDOWS_BUILD_WITH_STATIC_CRT
--- default OFF
--- if build CRT(vc runtime) with STATIC with megengine.dll
some CRT api will crash, for example, flush, so if you
build with static megengine, and do not want to install CRT
you can set MGE_WINDOWS_BUILD_WITH_STATIC_CRT TRUE
--- how to install CRT:
https://docs.microsoft.com/en-us/cpp/windows/latest-supported-vc-redist?view=msvc-160
install VC_redist.x64.exe
4: rename megengine_export to megengine_shared(only export needed symbols ),
caused by runtime symbols conflict with pytorch
GitOrigin-RevId: 93d8d80f29
3 years ago
Megvii Engine Team
d898838e18
fix(src/gopt/test): remove device info in the cache_data
GitOrigin-RevId: 8e9758971d
3 years ago
Megvii Engine Team
a4ac5e7e8f
fix(mgb/gopt): fix profiler cache when build without opr type info name
GitOrigin-RevId: 0eba678f2b
3 years ago
Megvii Engine Team
012de7695d
feat(mgb/gopt): add profiler cache
In order to improve performance of the profiling procedure. Make layout transform testcase stable. The profiling result in ci environment will be cached in files.
GitOrigin-RevId: ba2743f35f
3 years ago
Megvii Engine Team
b33ec46e54
fix(mgb/gopt): fix layout gtrans when graph partition has opr with different format
GitOrigin-RevId: 326fdebb0c
3 years ago
Megvii Engine Team
369c2ccc5a
style(all): reformat c++ code
GitOrigin-RevId: 3ffd1b211f
3 years ago
Megvii Engine Team
88c1eedbd7
feat(mgb/gopt): enable reduce for nchw44
GitOrigin-RevId: fce59d0762
3 years ago
Megvii Engine Team
fca195351c
feat(gopt): add nhwc fuse conv typecvt optpass
GitOrigin-RevId: adc2301203
3 years ago
Megvii Engine Team
c0ccd0ea7e
feat(mge/bn): add NHWC support for bn
GitOrigin-RevId: 0a5bb6f72d
3 years ago
Megvii Engine Team
d7cc4628f6
perf(gopt): opt concat for OpenCL
GitOrigin-RevId: 9bb226d4b1
3 years ago
Megvii Engine Team
eca6e1d931
fix(ci): fixes for ci
GitOrigin-RevId: b0a432bd2e
3 years ago
Megvii Engine Team
19d7412a0a
refactor(mgb/gopt): reorganize code of global layout transform
GitOrigin-RevId: 4973820e02
3 years ago
Megvii Engine Team
8ef12bdf22
feat(mgb/gopt): add user inferface for global layout transform
GitOrigin-RevId: b71d6c60ef
3 years ago
Megvii Engine Team
a3cd3fc74f
test(mgb/gopt): add testcase for global layout transform
GitOrigin-RevId: f9669e1ba0
3 years ago