Megvii Engine Team
|
5f558042b2
|
fix(imperative/ops): use tblgen to generate FastpathCopy
GitOrigin-RevId: b02157dacb
|
3 years ago |
Megvii Engine Team
|
bfc4e7a966
|
docs(mge): fix amp docstring problems
GitOrigin-RevId: e5540eb940
|
3 years ago |
Megvii Engine Team
|
0b764cf2d2
|
docs(mge/functional): add docs for megengine.functional.full_like
GitOrigin-RevId: 391447e977
|
3 years ago |
Megvii Engine Team
|
f141159088
|
refactor(mge): loose the error bound of fastrun
GitOrigin-RevId: 9bf9b9d4ca
|
3 years ago |
Megvii Engine Team
|
1f0436967c
|
refactor(mge/distributed): using nccl as default in distributed training
GitOrigin-RevId: 81268e84bc
|
3 years ago |
Megvii Engine Team
|
b17a02d44a
|
feat(mge/distributed): deprecate get_device_count_by_fork
GitOrigin-RevId: 6147c3ae90
|
3 years ago |
Megvii Engine Team
|
f8b0f2cb91
|
build(dnn/cutlass): fix build for cutlass
GitOrigin-RevId: 9aa095fe84
|
3 years ago |
konghuanjun
|
0fb4e9a9ca
|
fix(ci): git set user and email
|
4 years ago |
huangxinda
|
6af4a32e17
|
feat(mge/third_party): update MegRay version
|
3 years ago |
huangxinda
|
093f7ae774
|
feat(mge/third_party): update cutlass version
|
3 years ago |
Megvii Engine Team
|
c2daea3cba
|
chore(release): bump version
GitOrigin-RevId: a016ea9d56
|
3 years ago |
Megvii Engine Team
|
207a346351
|
chore(mge): run get_device_count("gpu") in subprocess
GitOrigin-RevId: 0f0dc001cf
|
4 years ago |
Megvii Engine Team
|
869a03271b
|
perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance
GitOrigin-RevId: d1b95a6f01
|
3 years ago |
liuke
|
0baf6b0d63
|
Merge pull request #175 from tpoisonooo:fix-spell-error
GitOrigin-RevId: fe649676ac
|
3 years ago |
Megvii Engine Team
|
239916a997
|
fix(mgb/gopt): fix testcase for enable nchw64 pass
GitOrigin-RevId: 2ae8d1608d
|
4 years ago |
Megvii Engine Team
|
2ab5c53f1d
|
feat(mgb/gopt): support nhwc conv in tensor reformat pass
GitOrigin-RevId: 43e78d758a
|
4 years ago |
Megvii Engine Team
|
009c90a2fe
|
feat(mgb/gopt): modify padding policy for 4bit conv bias oprs
GitOrigin-RevId: 188a2c3728
|
4 years ago |
Megvii Engine Team
|
4eda338876
|
feat(dnn/cuda): generate cutlass kimpls using cmake and bazel
GitOrigin-RevId: da3bcfb85a
|
4 years ago |
Megvii Engine Team
|
8d248a6a9a
|
fix(dnn/cuda): fix testcase for fallback nchw qs8 conv
GitOrigin-RevId: 646440db59
|
4 years ago |
Megvii Engine Team
|
894a2407c2
|
feat(dnn/cuda): add relayout format kernel for nchw <-> nhwc
GitOrigin-RevId: e11f3e5408
|
4 years ago |
Megvii Engine Team
|
43c59204df
|
refactor(dnn/cuda): refactor relayout format kernels
GitOrigin-RevId: ab86e66533
|
4 years ago |
Megvii Engine Team
|
f41a808694
|
feat(dnn/cuda): add nhwc int4 conv support
GitOrigin-RevId: 5236b235d0
|
4 years ago |
Megvii Engine Team
|
5a14a89224
|
refactor(dnn/cuda): refactor cutlass kernel generator for gemm and gemv
GitOrigin-RevId: 11d78ab227
|
4 years ago |
Megvii Engine Team
|
b33217d8f0
|
refactor(dnn/cuda): refactor cutlass kernel generator for deconv operation
GitOrigin-RevId: 88e962a912
|
4 years ago |
Megvii Engine Team
|
4abf7bd36f
|
refactor(dnn/cuda): refactor kernel generator for cutlass convolution kernels
GitOrigin-RevId: 7882f9c68c
|
4 years ago |
Megvii Engine Team
|
b4687ce8da
|
feat(dnn/cuda): add convolution with i8 input and u4 output
GitOrigin-RevId: 8be439abf1
|
4 years ago |
Megvii Engine Team
|
00083d13b6
|
fix(dnn/cuda): fix recursive algo search for fallback_nchw_qs8
GitOrigin-RevId: 6be2991224
|
4 years ago |
Megvii Engine Team
|
bba04f02e5
|
feat(mgb/gopt): add fusion support for conv, astype(s4) and reformat
GitOrigin-RevId: 6329ca2c5f
|
4 years ago |
Megvii Engine Team
|
66f70578c2
|
feat(dnn/cuda): add convolution with i8 input and i4 output
GitOrigin-RevId: 10512645d5
|
4 years ago |
Megvii Engine Team
|
6d686ff26f
|
feat(gopt/inference): allow Float32 output dtype in EnableNCHW64Pass
GitOrigin-RevId: 1891efb76f
|
4 years ago |
Megvii Engine Team
|
7d3df995cb
|
feat(gopt/inference): allow Float32 output dtype in EnableNCHW4Pass
GitOrigin-RevId: 81100dbaf7
|
4 years ago |
Megvii Engine Team
|
633016a962
|
fix(dnn/cuda): fix AlgoFallbackNCHWQS8 to support Float32 dst
GitOrigin-RevId: 06f90f5cf3
|
4 years ago |
Megvii Engine Team
|
e6caa9ff89
|
feat(opr): add bn backward for inference mode
GitOrigin-RevId: bb643cb62f
|
4 years ago |
Xinda Huang
|
c90fa087ea
|
test(mge): delete test_external.py
|
3 years ago |
Megvii Engine Team
|
b2944559a8
|
fix(imperative/module): remove ``__getattribute__`` method in module
GitOrigin-RevId: 5ac525f010
|
4 years ago |
Megvii Engine Team
|
77ead9377b
|
fix(src/serialization): fix compatibility error of oss model
GitOrigin-RevId: 43e0fa4fe1
|
3 years ago |
Megvii Engine Team
|
070c811732
|
fix(imperative): remove convert_inputs
GitOrigin-RevId: a3c43db746
|
3 years ago |
Megvii Engine Team
|
f40df60242
|
docs(mge): refactor docs to remove warnings
GitOrigin-RevId: efefc2a4a2
|
3 years ago |
Megvii Engine Team
|
1040b77843
|
fix(mge/functional): fix F.topk(kth_only=True)
GitOrigin-RevId: ddecd1d14b
|
4 years ago |
Megvii Engine Team
|
551cc701c6
|
docs(distributed.functional): add return type for all_reduce_max (jira #MGE-2706)
GitOrigin-RevId: a29f1f1880
|
3 years ago |
Megvii Engine Team
|
72ff7aeccb
|
feat(docs): add docs for megengine.functional.ones_like(jira #MGE-2702)
GitOrigin-RevId: e808fc9e4e
|
3 years ago |
Megvii Engine Team
|
7c9569e4e5
|
fix(mge/random): fix random seed
GitOrigin-RevId: 121f459b1b
|
3 years ago |
Megvii Engine Team
|
07de15713c
|
fix(mgb): remove static mem record from tee
GitOrigin-RevId: ac61b2a5eb
|
4 years ago |
Megvii Engine Team
|
d7b6bfd56c
|
test(mge/fakequant): use fixed input for lsq test to temperarily avoid precision error
GitOrigin-RevId: e91c71874e
|
3 years ago |
Megvii Engine Team
|
5cef74a77e
|
feat(mge/amp): add GradScaler support
GitOrigin-RevId: 0ab4910360
|
4 years ago |
Megvii Engine Team
|
1bf18252c4
|
feat(mge/amp): add mix precision autocast support
GitOrigin-RevId: 6fbffc4845
|
4 years ago |
Megvii Engine Team
|
f12355f727
|
fix(imperative/grad): fix hardcode dtype in subtensor_grad_rule
GitOrigin-RevId: 50da4af26d
|
4 years ago |
Megvii Engine Team
|
4e4497b903
|
refactor(mgb/dnn): x86 pooling rebase algochooser
GitOrigin-RevId: 96cdc57180
|
3 years ago |
Megvii Engine Team
|
a33c3b73bd
|
refactor(mgb/dnn): arm pooling rebase algochooser
GitOrigin-RevId: 21d17e647a
|
3 years ago |
Megvii Engine Team
|
8dea6b3c68
|
build(dnn): compat for more windows env
GitOrigin-RevId: 5ec4be2888
|
3 years ago |