Megvii Engine Team
|
2272abe18d
|
fix(mgb/fallback): disable nchw44 in conv1x1 and im2col in x86
GitOrigin-RevId: 603d2eb94a
|
4 years ago |
Megvii Engine Team
|
230ab45a1e
|
fix(mgb/naive): fix naive convolution no dispatch kernel in handle
GitOrigin-RevId: 4038fe23a4
|
4 years ago |
Megvii Engine Team
|
22853fa20c
|
feat(mge/quantization): add `mapping` parameter for custom modules
GitOrigin-RevId: a4de4261d0
|
4 years ago |
Megvii Engine Team
|
6e70fa7a11
|
feat(dnn/arm): add fp32 asm gemm for a53 a55 and i8i8i16 gemm for a72 a53
GitOrigin-RevId: a049c33f2b
|
4 years ago |
Megvii Engine Team
|
dbaf84b0ef
|
feat(imperative): add cond_take opr
GitOrigin-RevId: 5272e6fa71
|
4 years ago |
Megvii Engine Team
|
df356635b7
|
fix(mgb/fallback): delete im2col duplicate code and fix nchw44 usable
GitOrigin-RevId: 1aa250e9e7
|
4 years ago |
Megvii Engine Team
|
4a2270834f
|
fix(mgb/fallback): fix conv1x1 and conv1x1_gemv nchw44 usable
GitOrigin-RevId: 90aa75d51e
|
4 years ago |
Megvii Engine Team
|
b778d22523
|
feat(mgb/fallback): add conv1x1_gemv, conv1x1 and im2col 8x8x16/8x8x32 support bias
GitOrigin-RevId: 3d97fedc8f
|
4 years ago |
Megvii Engine Team
|
c357db0134
|
feat(mgb/arm_common): add 8x8x16 nchw44 max pooling
GitOrigin-RevId: ed460adb7a
|
4 years ago |
Megvii Engine Team
|
7f5f375fda
|
feat(dnn/arm): add armv7 nchw_nchw44 3x3s2 asm kernel
GitOrigin-RevId: 50ce91e41d
|
4 years ago |
Megvii Engine Team
|
b7d5fa7e64
|
fix(sdk/load_and_run): fix misuse std::string::substr
GitOrigin-RevId: f45a84696a
|
4 years ago |
Megvii Engine Team
|
1bce857cb8
|
fix(mgb/opr-mm): use comp_node of config as default in CollectiveComm
GitOrigin-RevId: 6b43c9fc93
|
4 years ago |
Megvii Engine Team
|
27205461ae
|
feat(mgb/opr-mm): add register info cache for multi-machine oprs
GitOrigin-RevId: d5ae3c5a7c
|
4 years ago |
Megvii Engine Team
|
a7ff580e54
|
feat(mge/utils): add net stats to calculate parameters and flops
GitOrigin-RevId: a77f89e24b
|
4 years ago |
Megvii Engine Team
|
96ec586d28
|
fix(dnn): fix bool cvt
GitOrigin-RevId: 2f883dcbe0
|
4 years ago |
Megvii Engine Team
|
f26cd398e3
|
build(third_party): Update megray version
|
4 years ago |
Megvii Engine Team
|
f829f836b9
|
test(mgb/index): add empty index desc tests
GitOrigin-RevId: 1a71ad3ede
|
4 years ago |
Megvii Engine Team
|
e73f2799d0
|
fix(mgb/index): enable index desc empty
GitOrigin-RevId: 4f0ab7c6e7
|
4 years ago |
Megvii Engine Team
|
b43f6a2602
|
fix(mge/quantization): handle empty Observer in QATModule
GitOrigin-RevId: e8a62297bc
|
4 years ago |
Megvii Engine Team
|
13e8f00a37
|
feat(mge/module): add forward hook support
GitOrigin-RevId: c0db58df13
|
4 years ago |
Megvii Engine Team
|
ab9fa48ee7
|
feat(mge/quantization): make `q_dict` a kwarg rather than an arg
GitOrigin-RevId: 38e3b2bfaf
|
4 years ago |
Megvii Engine Team
|
f8810f733a
|
feat(mge/imperative): prepare to make whl
GitOrigin-RevId: f9f22fb6cb
|
4 years ago |
Megvii Engine Team
|
ff60fdb82d
|
feat(dnn): add bool type cvt on gpu
GitOrigin-RevId: ab0fecf368
|
4 years ago |
Megvii Engine Team
|
e8571cca51
|
fix(mgb/cuda): fix cuda host alloc set device
GitOrigin-RevId: f4756e8981
|
4 years ago |
Megvii Engine Team
|
f7b5eced23
|
refactor(mgb/opr-mm): set False as default value of local_grad
GitOrigin-RevId: 2f9603b087
|
4 years ago |
Megvii Engine Team
|
7a8183f4e0
|
fix(mge/quantization): fix enable observer bug
GitOrigin-RevId: 493cf9dea9
|
4 years ago |
Megvii Engine Team
|
555ecea9bc
|
feat(mge/quantization): add bias fakequant support
GitOrigin-RevId: a5e953b3fa
|
4 years ago |
Megvii Engine Team
|
9440842e27
|
fix(mge/core): fix Tensor deepcopy issue
GitOrigin-RevId: 6bea7970b8
|
4 years ago |
Megvii Engine Team
|
d4b86b844e
|
feat(mge/dtype): add int2 lowbit support and example
GitOrigin-RevId: 67c14ac959
|
4 years ago |
Megvii Engine Team
|
3931099ea7
|
fix(dnn/test): fix nchw_nchw44 i8i8i16 benchmark
GitOrigin-RevId: 6a68030fbf
|
4 years ago |
Megvii Engine Team
|
bcf5691ddf
|
feat(dnn/arm): add nchw_nchw44 i8i8i16 2x2 3x3 5x5 7x7 s1 s2 conv
GitOrigin-RevId: 8ef1541665
|
4 years ago |
Megvii Engine Team
|
c7b6ef35c1
|
feat(dnn/cuda): add warp perspective backward mat idx
GitOrigin-RevId: b4b494bb69
|
5 years ago |
Megvii Engine Team
|
a773d07678
|
feat(dnn/arm_common): add nchw44 8x8x16 channel wise conv
stride1 2x2 3x3 5x5 stride2 2x2 3x3 5x5
GitOrigin-RevId: 43d76311c2
|
4 years ago |
Megvii Engine Team
|
09b5f3d434
|
fix(mgb/core): fix multi thread pool deactive and multi thread conflict
GitOrigin-RevId: 36787a08a5
|
4 years ago |
Megvii Engine Team
|
ef239f835f
|
feat(windows/python_whl): make windows HAPPY for build megbrain python package
GitOrigin-RevId: 92b2c07bf9
|
4 years ago |
Megvii Engine Team
|
bf6cbc1df7
|
build(third_party): fix git apply issue
|
4 years ago |
Megvii Engine Team
|
5eb491c5af
|
Merge pull request #74 from ChaiMind:master
GitOrigin-RevId: 7e7b78b125
|
4 years ago |
Megvii Engine Team
|
b72f1e8258
|
chore(build): cleanup BUILD files
GitOrigin-RevId: cb9ddcea3c
|
4 years ago |
Megvii Engine Team
|
e258812f12
|
feat(dnn): add bool dtype
GitOrigin-RevId: 98c8a092b4
|
4 years ago |
Megvii Engine Team
|
734c498d27
|
perf(mgb/core): improve DevMemAlloc when it has single stream
GitOrigin-RevId: 61874faa6d
|
4 years ago |
Megvii Engine Team
|
39bd66fc63
|
fix(mgb): fix TensorRT missing cudaSetDevice
GitOrigin-RevId: 40eb119e48
|
4 years ago |
Megvii Engine Team
|
ab9dfbcefc
|
test(mgb): fix tensorrt tests missing cudaSetDevice
GitOrigin-RevId: faeb6ae070
|
4 years ago |
Megvii Engine Team
|
b43fb1a97c
|
perf(mgb): add CUDA host memory allocator
test(mgb): add SimpleCachingAlloc test
GitOrigin-RevId: 17f381e4ac
|
4 years ago |
Megvii Engine Team
|
2afceb4187
|
fix(mgb/atlas): use dyn output alloc if enable dynamic batchsize
GitOrigin-RevId: 45a6c6ad51
|
4 years ago |
Megvii Engine Team
|
6bcc6faec8
|
feat(mge/imperative/opr): modify batch_norm to support frozen BN
fix(mge/imperative): cmake uses MGE_BUILD_IMPERATIVE_RT flag
GitOrigin-RevId: 8ea21af9da
|
4 years ago |
Megvii Engine Team
|
7ca3d579db
|
feat(dnn): make mk4 and mk8 matmul for winograd both on aarch64 and armv7 supports n=1
GitOrigin-RevId: 0f64b9f70f
|
4 years ago |
Megvii Engine Team
|
54d18115b6
|
fix(imperative): fix grad of BatchNorm
GitOrigin-RevId: 1e8d8afaf2
|
4 years ago |
Megvii Engine Team
|
80c4705317
|
perf(mgb): use midout in megbrain to reduce binary size
GitOrigin-RevId: ddc8af79af
|
4 years ago |
Megvii Engine Team
|
35c712767d
|
fix(mge/quant): fix TQT epoch scale change bug
GitOrigin-RevId: 6e39de9cec
|
4 years ago |
Megvii Engine Team
|
e6e41242c7
|
fix(mge/quant): fix zero grad warn in TQT train
GitOrigin-RevId: a6545ee366
|
4 years ago |