Xu Xinran
5985392297
chore(release): bump version
3 years ago
Megvii Engine Team
10bcf75767
feat(dnn/x86): add algo for x86 max pooling for Window size bigger than 10 and S1 under NCHW88
GitOrigin-RevId: 613a18dd91
3 years ago
Megvii Engine Team
ddba5c9674
fix(core): fix nr_threads is zero
GitOrigin-RevId: 0ccbe3c69b
3 years ago
Megvii Engine Team
67f117882b
perf(arm_common): add elemwise unary multithread support
GitOrigin-RevId: 8eac123f67
3 years ago
Megvii Engine Team
3afa3893d7
perf(arm_common): optimize arm common pooling 9x9 and 13x13
GitOrigin-RevId: 33d5a62478
3 years ago
Megvii Engine Team
d16c5caf6b
fix(mge/dump): fix dump device error with const
GitOrigin-RevId: 9dd8321fd7
3 years ago
Megvii Engine Team
2c4ff5431b
fix(mgb): fix cudnn ConvolutionBackwardData
GitOrigin-RevId: 1fffc06eaa
3 years ago
Megvii Engine Team
7138e4fd02
feat(docs): add docs for megengine.functional.full
GitOrigin-RevId: 8428a0c9a9
3 years ago
Megvii Engine Team
0b4a767965
feat(mge/distributed): enable uint8 for collective communication
GitOrigin-RevId: 3305c0cf14
3 years ago
Megvii Engine Team
a22b2cf473
ci(copybara): add config files and fix format script
GitOrigin-RevId: 9bd9a7d66c
3 years ago
Megvii Engine Team
287cab49c2
fix(mgb/sereg): fix rng operator compatibility
GitOrigin-RevId: 66d1694035
3 years ago
Megvii Engine Team
e3fc783642
fix(mgb/opr): fix nvof shape error
fix(mgb/opr): fix format error
fix(mgb/opr): update nvof opr to 1.3 fix (compared to 2.0)
fix(mgb/opr): add try catch to solve test error
GitOrigin-RevId: b688745b36
3 years ago
Megvii Engine Team
3f3a256e0f
fix(mge/functional): fix conv* dtype promotion
GitOrigin-RevId: 3f03790cfc
3 years ago
Megvii Engine Team
536506c3f4
feat(functional): let interpolate support more modes
GitOrigin-RevId: 9693a1ac63
3 years ago
Megvii Engine Team
d811dc5478
docs(mge/distributed): add document for distributed.backend
GitOrigin-RevId: 6cdcf7af77
3 years ago
Megvii Engine Team
9526ee521b
docs(distributed.functional): add return type for all_reduce_min
GitOrigin-RevId: 9f734902fe
3 years ago
Megvii Engine Team
2aba0378b9
refactor(mgb/dnn): fix group conv is_available
GitOrigin-RevId: b279909168
3 years ago
Megvii Engine Team
4a92346b7a
refactor(mgb): refactor group conv3d
GitOrigin-RevId: 15360a3a41
3 years ago
Megvii Engine Team
6ce212d2e0
refactor(mgb): refactor group conv
GitOrigin-RevId: 7afd312690
4 years ago
XindaH
febd0b1798
ci(fix): fail when git user name or email is empty
3 years ago
Megvii Engine Team
eb2dd018d9
build(fp16): fix fp16 build
GitOrigin-RevId: a20ebfd110
3 years ago
Megvii Engine Team
f76a2cc2c6
feat(mge/opr): add silu and gelu
GitOrigin-RevId: 75aa42947e
3 years ago
Megvii Engine Team
f2ac4c345b
docs(distributed.functional.all_reduce_sum): googlestring and examples
GitOrigin-RevId: a456dfde24
3 years ago
Megvii Engine Team
186bacfb71
fix(mge): recover bn freeze fastpath execution
GitOrigin-RevId: 13e58c9fba
3 years ago
Megvii Engine Team
5f558042b2
fix(imperative/ops): use tblgen to generate FastpathCopy
GitOrigin-RevId: b02157dacb
3 years ago
Megvii Engine Team
bfc4e7a966
docs(mge): fix amp docstring problems
GitOrigin-RevId: e5540eb940
3 years ago
Megvii Engine Team
0b764cf2d2
docs(mge/functional): add docs for megengine.functional.full_like
GitOrigin-RevId: 391447e977
3 years ago
Megvii Engine Team
f141159088
refactor(mge): loose the error bound of fastrun
GitOrigin-RevId: 9bf9b9d4ca
3 years ago
Megvii Engine Team
1f0436967c
refactor(mge/distributed): using nccl as default in distributed training
GitOrigin-RevId: 81268e84bc
3 years ago
Megvii Engine Team
b17a02d44a
feat(mge/distributed): deprecate get_device_count_by_fork
GitOrigin-RevId: 6147c3ae90
3 years ago
Megvii Engine Team
f8b0f2cb91
build(dnn/cutlass): fix build for cutlass
GitOrigin-RevId: 9aa095fe84
3 years ago
konghuanjun
0fb4e9a9ca
fix(ci): git set user and email
4 years ago
huangxinda
6af4a32e17
feat(mge/third_party): update MegRay version
3 years ago
huangxinda
093f7ae774
feat(mge/third_party): update cutlass version
3 years ago
Megvii Engine Team
c2daea3cba
chore(release): bump version
GitOrigin-RevId: a016ea9d56
3 years ago
Megvii Engine Team
207a346351
chore(mge): run get_device_count("gpu") in subprocess
GitOrigin-RevId: 0f0dc001cf
4 years ago
Megvii Engine Team
869a03271b
perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance
GitOrigin-RevId: d1b95a6f01
3 years ago
liuke
0baf6b0d63
Merge pull request #175 from tpoisonooo:fix-spell-error
GitOrigin-RevId: fe649676ac
3 years ago
Megvii Engine Team
239916a997
fix(mgb/gopt): fix testcase for enable nchw64 pass
GitOrigin-RevId: 2ae8d1608d
4 years ago
Megvii Engine Team
2ab5c53f1d
feat(mgb/gopt): support nhwc conv in tensor reformat pass
GitOrigin-RevId: 43e78d758a
4 years ago
Megvii Engine Team
009c90a2fe
feat(mgb/gopt): modify padding policy for 4bit conv bias oprs
GitOrigin-RevId: 188a2c3728
4 years ago
Megvii Engine Team
4eda338876
feat(dnn/cuda): generate cutlass kimpls using cmake and bazel
GitOrigin-RevId: da3bcfb85a
4 years ago
Megvii Engine Team
8d248a6a9a
fix(dnn/cuda): fix testcase for fallback nchw qs8 conv
GitOrigin-RevId: 646440db59
4 years ago
Megvii Engine Team
894a2407c2
feat(dnn/cuda): add relayout format kernel for nchw <-> nhwc
GitOrigin-RevId: e11f3e5408
4 years ago
Megvii Engine Team
43c59204df
refactor(dnn/cuda): refactor relayout format kernels
GitOrigin-RevId: ab86e66533
4 years ago
Megvii Engine Team
f41a808694
feat(dnn/cuda): add nhwc int4 conv support
GitOrigin-RevId: 5236b235d0
4 years ago
Megvii Engine Team
5a14a89224
refactor(dnn/cuda): refactor cutlass kernel generator for gemm and gemv
GitOrigin-RevId: 11d78ab227
4 years ago
Megvii Engine Team
b33217d8f0
refactor(dnn/cuda): refactor cutlass kernel generator for deconv operation
GitOrigin-RevId: 88e962a912
4 years ago
Megvii Engine Team
4abf7bd36f
refactor(dnn/cuda): refactor kernel generator for cutlass convolution kernels
GitOrigin-RevId: 7882f9c68c
4 years ago
Megvii Engine Team
b4687ce8da
feat(dnn/cuda): add convolution with i8 input and u4 output
GitOrigin-RevId: 8be439abf1
4 years ago