Megvii Engine Team
4e0c9ad3c6
feat(mgb/external): extern-c-opr dumper and loader for MACE
GitOrigin-RevId: bfe5420b1d
5 years ago
Megvii Engine Team
ca52a93e9f
fix(mge/quant): fix init value of histogram observer
GitOrigin-RevId: 9c8caa5f8f
5 years ago
Megvii Engine Team
9f352b1c45
feat(megbrain/dnn): add indexing remap int32 for naive and cuda
GitOrigin-RevId: 5f66d51de4
5 years ago
Megvii Engine Team
5dbf218d19
feat(dnn/x86): add sse 8816 matmul
GitOrigin-RevId: ed8d9ee5db
5 years ago
Megvii Engine Team
25b6a13148
feat(dnn/x86): add x86 avx2 8x8x16 matmul
GitOrigin-RevId: d2172c50b2
5 years ago
Megvii Engine Team
273f891b55
fix(mgb/gopt): fix run-time winograd-transform and nchwxx error
GitOrigin-RevId: aca796f17d
5 years ago
Megvii Engine Team
02abc36ea6
fix(mbg/arm_common): fix nchw44-dot misc issue
GitOrigin-RevId: f870ad964c
5 years ago
Megvii Engine Team
9ed3882a94
fix(opr/dnn): fix winograd fast run mismatch
GitOrigin-RevId: d308085b9f
5 years ago
Megvii Engine Team
18be23f328
fix(mbg/gopt): fix nchwxx gopt with no fuse conv_bias and winograd
fast-run
GitOrigin-RevId: 49ccbdf2d4
5 years ago
Megvii Engine Team
38f7cbd9aa
fix(mge/module): fix redundant recursion in `train()`
GitOrigin-RevId: 6b3566930b
5 years ago
Megvii Engine Team
5c2323529d
test(mge/quantization): add `quantize_disabled` related test
GitOrigin-RevId: f62ba600c5
5 years ago
Megvii Engine Team
ab91302515
feat(mge/quantization): add `quantize_disabled` attribute in Module
GitOrigin-RevId: f108f03c5a
5 years ago
Megvii Engine Team
f4ead78852
feat(mgb): static allocation with given padding
GitOrigin-RevId: fdf2de8ad6
5 years ago
Megvii Engine Team
575a6dca9f
ci(docker): change object files permission
GitOrigin-RevId: e388f4b57e
5 years ago
Megvii Engine Team
08ac685e3a
feat(mge/functional): add logsumexp
GitOrigin-RevId: e7ef50e9ec
5 years ago
Megvii Engine Team
65ec4f7c26
fix(ci): fix test timeout
GitOrigin-RevId: 875fc613cf
5 years ago
Megvii Engine Team
ea6bfe6cd9
fix(dnn/cuda-stub): simplify and use proper search paths
Removed the `access()` call before `dlopen()`.
It was copy-pasted from the opencl-stub, does not make sense here, and
prevents `dlopen()` from loading `libcuda.so` from non-default path.
Updated the name of the library providing CUDA Driver API on different
platforms, these are harvested from the following file in a CUDA
install:
samples/6_Advanced/matrixMulDynlinkJIT/cuda_drvapi_dynlink.c
GitOrigin-RevId: ed43cab8c8
5 years ago
Megvii Engine Team
01092feb9b
feat(mgb): add PackAllReducePass
GitOrigin-RevId: 59c1b45393
5 years ago
Megvii Engine Team
c7e6c658fd
refactor(mge/distribute): use is_root (and rank) in stead of rank and root at collective comm
GitOrigin-RevId: dccdb71553
5 years ago
Megvii Engine Team
ff308e3b62
feat(mgb/comp_node): generate uid for cuda comp node
GitOrigin-RevId: 34fa5a2fb6
5 years ago
Megvii Engine Team
32c86211ee
fix(dnn/cuda): enable cuda algos for nchw quantized
GitOrigin-RevId: 4d1e167b86
5 years ago
Megvii Engine Team
b8d8886e35
refactor(mge/tensor): combine Dict and TensorDict
GitOrigin-RevId: 6b6c03c04b
5 years ago
Megvii Engine Team
7751a0676e
docs(mge/tensor): add advanced index related docs
GitOrigin-RevId: 31735ddac4
5 years ago
Megvii Engine Team
7b0dbe6af8
fix(dnn/arm): fix stride 1 support for int8 nchw_nchw44
GitOrigin-RevId: 9d718eb7a4
5 years ago
Megvii Engine Team
198f3eb5f6
fix(dnn/arm): fix fp32 nchw44 direct workspace bug
GitOrigin-RevId: 6ee433b02c
5 years ago
Megvii Engine Team
49fdddef8d
fix(gopt): fix reorder arith chain pass
GitOrigin-RevId: d3257ac43a
5 years ago
Megvii Engine Team
6742a58b7e
fix(quant): observer do not use cond_take
GitOrigin-RevId: a954814bcb
5 years ago
Megvii Engine Team
9e876203b5
feat(dnn): add int8 direct conv dot nchw44
GitOrigin-RevId: 31830ba7a4
5 years ago
Megvii Engine Team
09ceaaaecf
fix(dnn/arm): stride1 support for nchw_nchw44 fp32 conv
GitOrigin-RevId: 744c5db3dc
5 years ago
Megvii Engine Team
50db9b84c2
fix(gopt): fix paramfuse if the endpoint is const
GitOrigin-RevId: f666f6d700
5 years ago
Megvii Engine Team
35bc0e1f60
fix(mge/function): do not deeply copy saved tensor in Function
GitOrigin-RevId: 3c89d1ceaa
5 years ago
Megvii Engine Team
47377c7be5
fix(core): fix memory defragmenter
GitOrigin-RevId: e883be8b5c
5 years ago
Megvii Engine Team
f56f187f6e
fix(mbg/gopt): fix nchw44-dot channel wise trans to nchw44
GitOrigin-RevId: aa2059a796
5 years ago
Megvii Engine Team
af29fcb2e3
feat(mgb/plugin): add param json func for indexing oprs
GitOrigin-RevId: b5becbbc02
5 years ago
Megvii Engine Team
62753c4d30
fix(mge/sdk): fix comp_node bug in dump_with_testcast_mge
GitOrigin-RevId: 26a8dc50b8
5 years ago
Megvii Engine Team
f1c86606cb
fix(dnn/cuda): fix FuseConvBiasWithZ pass for HSwish activation
GitOrigin-RevId: b290469cb1
5 years ago
Megvii Engine Team
adfa468899
fix(mge/functional): fix scatter doctest failed for GPU platform issue
GitOrigin-RevId: b5f92c39dd
5 years ago
Megvii Engine Team
4f8e60801c
feat(dnn): fix Werror by adding macro
GitOrigin-RevId: 1f5fe4d46a
5 years ago
Megvii Engine Team
d7bb62cfa1
refactor(mgb): move mm_handler from python module into opr-mm
GitOrigin-RevId: f401ce8603
5 years ago
Megvii Engine Team
84068a6bb1
fix(mge/data): fix typos in voc and objects365
GitOrigin-RevId: 491e607b7e
5 years ago
Megvii Engine Team
3966bb08b3
feat(dnn/test): split cpu.convolution
GitOrigin-RevId: fa28d3d760
5 years ago
Megvii Engine Team
8f87a3e988
feat(dnn/arm_common): add int8 nchw44 winograd f23_4x4 f23_8x8 compute float32/int16 output int8
GitOrigin-RevId: d99ef7efcd
5 years ago
Megvii Engine Team
8ffed043be
fix(dnn/x86): fix matrix_mul quantized performance on vnni
GitOrigin-RevId: 4af6b8be60
5 years ago
Megvii Engine Team
1d860f4d6f
fix(dnn/x86): fix dnnl int8 algo on vnni
GitOrigin-RevId: 2384e09558
5 years ago
Megvii Engine Team
871e6a516f
feat(dnn/x86): opt x86 quantized heuristic
GitOrigin-RevId: 72abe9efcc
5 years ago
Megvii Engine Team
6c29548d20
fix(dnn/arm): fix nchw_nchw44 dot stride1 support
GitOrigin-RevId: c8d3d55b25
5 years ago
Megvii Engine Team
02cbb13bbc
fix(dnn/arm): fix nchw44 fp32 direct algo oh block and unused stride2 algo
GitOrigin-RevId: 8012678fae
5 years ago
Megvii Engine Team
d2f5874a52
fix(mge/module): fix non-str key error of dict in module
GitOrigin-RevId: f82cd48230
5 years ago
Megvii Engine Team
30b3d3aa3e
fix(dnn/gopt): add convolution nchw44-dot format gopt
GitOrigin-RevId: e8e1e96379
5 years ago
Megvii Engine Team
48d1ac1433
fix(dnn/arm): fix consistence between create_conv1x1_strategy and can_create_conv1x1_strategy
GitOrigin-RevId: 2d32998aca
5 years ago