Megvii Engine Team
052a600f03
feat(mge/module): add python wrapper for unfold
GitOrigin-RevId: 562103186f
4 years ago
Megvii Engine Team
4fa9096d20
feat(megbrain): add correlation opr
GitOrigin-RevId: 6d44598891
4 years ago
Megvii Engine Team
b3aae4bb16
feat(dnn/cuda): add correlation kernel
GitOrigin-RevId: 25e58b61e6
4 years ago
Megvii Engine Team
af62add02d
fix(api_cache): fix serialization for conv_desc
GitOrigin-RevId: 95dbc9c685
4 years ago
Megvii Engine Team
d0a68c431a
fix(api_cache): lock api cache for thread safety
GitOrigin-RevId: 8a244677c3
4 years ago
Megvii Engine Team
e8061e6628
perf(cuda/conv): cache serval cudnn api
GitOrigin-RevId: 188c62cdd6
4 years ago
Megvii Engine Team
fe8488bf0a
feat(dnn/apicache): add generic apicache
GitOrigin-RevId: 40b8ac2ab6
4 years ago
Megvii Engine Team
e70c9a3839
fix(cmake): fix cmake depends
reason: some target call python3 to generate some headers,
if PYTHON_EXECUTABLE changed, headers will be invalid, which
will lead to rebuild most of obj, because this headers will
be include in most cpp/c, what`s more, this target do not depends
python3 version(python API) the output is same when use python3.5
or python3.x so change use PYTHON_EXECUTABLE to
PYTHON3_EXECUTABLE_WITHOUT_VERSION, when
PYTHON_EXECUTABLE/PYTHON_LIBRARY/PYTHON_INCLUDE_DIR changed,
can reuse the build obj
GitOrigin-RevId: c2ffe3ead3
4 years ago
Megvii Engine Team
e5d8e4279b
feat(mge): add a tool which can analyze the file generated by compare_binary_iodump.py
GitOrigin-RevId: 9acab0a49f
4 years ago
Megvii Engine Team
747b53c018
feat(dnn): add conv_bwd_data and conv_bwd_filter accuracy shake check
GitOrigin-RevId: 4069e083d2
4 years ago
Megvii Engine Team
ecb11202f5
fix(build): fix naive build
GitOrigin-RevId: 0050ff5d9c
4 years ago
Megvii Engine Team
8109a05a5e
fix(mgb/dnn): fix cub potential issues
Wrap cub with CUB_NS_PREFIX and remove dependency on Thrust to avoid potential linking issues
GitOrigin-RevId: 53893b0a39
4 years ago
Megvii Engine Team
3f5238fb38
feat(mgb/dnn): add accuracy shake checker
GitOrigin-RevId: 0bb52078a1
4 years ago
Megvii Engine Team
be6fb6b7c1
feat(mgb/dnn): add accuracy_depend_on_batch attribute
GitOrigin-RevId: 2be1db43d0
4 years ago
Megvii Engine Team
1cadf9d8d7
fix(mgb): add usable-depend-on-shape attr
GitOrigin-RevId: 3a14fa6b6f
4 years ago
Megvii Engine Team
0083f4c4f3
build(rocm): support rocm-3.9
GitOrigin-RevId: 85f8911736
4 years ago
Megvii Engine Team
928a57f83c
build(rocm): partially support hcc compilation
GitOrigin-RevId: ca9f1f8e8e
4 years ago
Megvii Engine Team
100a502764
fix(dnn): replace kernel launch syntax with macro for hcc
GitOrigin-RevId: f9e69d4825
4 years ago
Megvii Engine Team
07ab8cb6b6
feat(dnn): add param_pack for rocm
GitOrigin-RevId: 2180504c71
4 years ago
Megvii Engine Team
4b2b623b8b
fix(dnn/cuda): fix cutlass matmul splitk limit
GitOrigin-RevId: fc9a7c638c
4 years ago
Megvii Engine Team
ef9aa80074
fix(mgb/dnn): fix cuda naive matmul algo
GitOrigin-RevId: 79c9bba73b
4 years ago
Megvii Engine Team
2d18074a70
fix(mgb): fix spell error
GitOrigin-RevId: acae00e0a5
4 years ago
Megvii Engine Team
ff755451d2
refactor(mgb): move algo's name from info to desc and delete some algo's unnecessary param() method
GitOrigin-RevId: 144ff547d1
4 years ago
Megvii Engine Team
756c1eb7f2
fix(mgb/dnn): add cuda float naive matmul algo
GitOrigin-RevId: db7f7fc057
4 years ago
Megvii Engine Team
8494a1529e
chore(scripts): clarify and fix default value of bit combined enum
GitOrigin-RevId: 3716bf9bb5
4 years ago
Megvii Engine Team
04b1a45af4
fix(dnn): fix cudnn crash when finalize called after cudnn dtor
GitOrigin-RevId: b0ad639921
4 years ago
Megvii Engine Team
c338e876ec
refactor(mgb/dnn): add negative attribute for algo
GitOrigin-RevId: 88b1ce94a5
4 years ago
Megvii Engine Team
ec1a99acc2
refactor(mgb/dnn): replace reproducible with attribute
GitOrigin-RevId: d49015714c
4 years ago
Megvii Engine Team
0d165399e6
fix(mgb): fix fastrun for imperative
GitOrigin-RevId: db54984b92
4 years ago
Megvii Engine Team
94401ce44a
chore(dotprod): dotprod is enabled by default on the android platform
GitOrigin-RevId: d412108732
4 years ago
Megvii Engine Team
85b41a90df
feat(dnn): add checksum opr and test
GitOrigin-RevId: e784a76e0b
4 years ago
Megvii Engine Team
a49f4a66b7
feat(dnn): add indexing_one_hot and indexing_set_one_hot opr
GitOrigin-RevId: c5406c71ff
4 years ago
Megvii Engine Team
2fd3fa8834
feat(cmake): update for enflame cmake compile
GitOrigin-RevId: 3c3c6b3462
4 years ago
Megvii Engine Team
9f2af2099c
feat(mgb): add enflame comp node
GitOrigin-RevId: 478c8538aa
4 years ago
Megvii Engine Team
33da8de12b
build(dnn/cuda): split compilation for cutlass wrapper
GitOrigin-RevId: 6365d5fdbc
4 years ago
Megvii Engine Team
68f2e59763
fix(mgb(ci)): fix tx1 ci testcase
GitOrigin-RevId: 7ac0c4b68d
4 years ago
Megvii Engine Team
420672beca
fix(mgb/dnn): fix x86 matmul midout decl
GitOrigin-RevId: fe1fc977e1
4 years ago
Megvii Engine Team
b717606989
fix(dnn/cuda): add block size limit for culass gemm algo
GitOrigin-RevId: c0940e4535
4 years ago
Megvii Engine Team
55974e8cf9
feat(log): opt log
* opt log at release mode
* add MGE_OVERRIDE_LOG_LEVEL for runtime debug
//! env to config LogLevel
//! DEBUG = 0, INFO = 1, WARN = 2, ERROR = 3, NO_LOG = 4
//! for example , export MGE_OVERRIDE_LOG_LEVEL=0, means set LogLevel to DEBUG
GitOrigin-RevId: 16cd674c56
4 years ago
Megvii Engine Team
1c01128ff5
fix SASS cu111
GitOrigin-RevId: cb075ad154
4 years ago
Megvii Engine Team
58c8746e30
fix(opr): fix fast-run error in cuda
GitOrigin-RevId: 28dd187df9
4 years ago
Megvii Engine Team
ba2ad46e54
feat(gopt): add deconv nchw4 int8 opt pass, add deconv nchw int8
GitOrigin-RevId: c0530a949e
4 years ago
Megvii Engine Team
5d350fc843
feat(dnn/cuda): add deconv int8 and fix cutlass conv wrapper base on modify cutlass 2.4
GitOrigin-RevId: 49e0565e8a
4 years ago
Megvii Engine Team
a3ea1f153c
feat(mgb/opr): add fast profile and combined Execution strategy
GitOrigin-RevId: 843dc3a790
4 years ago
Megvii Engine Team
c82d88751a
fix(dnn/cuda): add cuda nchw int8 conv impl with nchw4 to fix cu111 compatibility
GitOrigin-RevId: 771968f9ac
4 years ago
Megvii Engine Team
652ec9f251
fix(mgb/dnn): fix backward computation of tqt
GitOrigin-RevId: 850d11a5ce
4 years ago
Megvii Engine Team
f2b42bf09e
chore(dotprod): add arm dotprod attribute for easy use
GitOrigin-RevId: 78c3e72218
4 years ago
Megvii Engine Team
c33a717314
feat(dnn): repalce is_reproducible with algo attribute in opencl, cpu, rocm and cuda
GitOrigin-RevId: 86dead0a11
4 years ago
Megvii Engine Team
97beae2fd8
fix(megdnn): fix megdnn benchmark testcase
GitOrigin-RevId: 726971474a
4 years ago
Megvii Engine Team
2de2222e46
feat(dnn/cuda): add cutlass batched gemv kernel for matmul operator
GitOrigin-RevId: 51702c4e79
4 years ago