Megvii Engine Team
a83098890b
test(mge/utils): cover all test data
GitOrigin-RevId: e676476b9d
3 years ago
Megvii Engine Team
dd1fecdf29
feat(mge/opr): add cumsum
GitOrigin-RevId: 740f00a8e5
3 years ago
Megvii Engine Team
a0c7e047f5
fix(extern_c_opr): let megbrain throw a exception when load extern c opr fails
GitOrigin-RevId: 16f5ceb8e3
3 years ago
Megvii Engine Team
c68e669530
feat(bazel/windows/xp/sp2/inference): implement inference on windows xp
(os vesion >= sp2) build with bazel
* bazel build support(define __DEPLOY_ON_XP_SP2__ when deploy on xp sp2):
(dbg)./bazel build //brain/megbrain:load_and_run --cpu='x86_windows_xp'
--compiler='clang_cl' -c dbg --copt "-D__DEPLOY_ON_XP_SP2__=1"
(opt)./bazel build //brain/megbrain:load_and_run --cpu='x86_windows_xp'
--compiler='clang_cl' -c opt --copt "-D__DEPLOY_ON_XP_SP2__=1"
* internal behavior:
will define MGB_HAVE_THREAD=0 when enable __DEPLOY_ON_XP_SP2__
* refer to
https://docs.microsoft.com/en-us/cpp/build/configuring-programs-for-windows-xp?view=msvc-160
xp sp2(x86) do not support vc runtime fully, casused by KERNEL32.dll do not
implement some base apis for c++ std function, for example,
std::mutex/std::thread/std::condition_variable as a workround, we will
disable some MegEngine features on xp sp2 env, for exampe, multi-thread etc!
* about DNN_MUTEX/MGB_MUTEX, if your code will build in inference
code (even CPU backends), please replace std::mutex to DNN_MUTEX/MGB_MUTEX,
* about multi-thread, if you code need multi-thread support, please
enable it when MGB_HAVE_THREAD=1
* about test build env status
1: Visual Studio 2019(MSVC version <= 14.26.28801)---- pass
2: Visual Studio 2019(MSVC version > 14.26.28801) ---- failed
caused by this 'new' version will put VCR depends on win7
KERNEL32.DLL, this may be fixed at Visual Studio 2019 later version
but we do not test at this MR merge point
3: Visual Studio 2017 ---------- pass
4: Visual Studio 2014 ---------- pass
GitOrigin-RevId: 65ac48b95e
3 years ago
Megvii Engine Team
125406e318
feat(cmake/windows/cuda): upgrade windows llvm to
lastest 12.0.1, for fit cutlass upgrade
GitOrigin-RevId: 23b9d779df
3 years ago
huangxinda
b8ea63927e
ci(mge): merge try-import automatically
3 years ago
Megvii Engine Team
3b452d8c16
feat(mgb): cuda conv support nhwc format and fp16 dtype
GitOrigin-RevId: b8ddcd108a
3 years ago
Megvii Engine Team
10bcf75767
feat(dnn/x86): add algo for x86 max pooling for Window size bigger than 10 and S1 under NCHW88
GitOrigin-RevId: 613a18dd91
3 years ago
Megvii Engine Team
ddba5c9674
fix(core): fix nr_threads is zero
GitOrigin-RevId: 0ccbe3c69b
3 years ago
Megvii Engine Team
67f117882b
perf(arm_common): add elemwise unary multithread support
GitOrigin-RevId: 8eac123f67
3 years ago
Megvii Engine Team
3afa3893d7
perf(arm_common): optimize arm common pooling 9x9 and 13x13
GitOrigin-RevId: 33d5a62478
3 years ago
Megvii Engine Team
d16c5caf6b
fix(mge/dump): fix dump device error with const
GitOrigin-RevId: 9dd8321fd7
3 years ago
Megvii Engine Team
2c4ff5431b
fix(mgb): fix cudnn ConvolutionBackwardData
GitOrigin-RevId: 1fffc06eaa
3 years ago
Megvii Engine Team
7138e4fd02
feat(docs): add docs for megengine.functional.full
GitOrigin-RevId: 8428a0c9a9
3 years ago
Megvii Engine Team
0b4a767965
feat(mge/distributed): enable uint8 for collective communication
GitOrigin-RevId: 3305c0cf14
3 years ago
Megvii Engine Team
a22b2cf473
ci(copybara): add config files and fix format script
GitOrigin-RevId: 9bd9a7d66c
3 years ago
Megvii Engine Team
287cab49c2
fix(mgb/sereg): fix rng operator compatibility
GitOrigin-RevId: 66d1694035
3 years ago
Megvii Engine Team
e3fc783642
fix(mgb/opr): fix nvof shape error
fix(mgb/opr): fix format error
fix(mgb/opr): update nvof opr to 1.3 fix (compared to 2.0)
fix(mgb/opr): add try catch to solve test error
GitOrigin-RevId: b688745b36
3 years ago
Megvii Engine Team
3f3a256e0f
fix(mge/functional): fix conv* dtype promotion
GitOrigin-RevId: 3f03790cfc
3 years ago
Megvii Engine Team
536506c3f4
feat(functional): let interpolate support more modes
GitOrigin-RevId: 9693a1ac63
3 years ago
Megvii Engine Team
d811dc5478
docs(mge/distributed): add document for distributed.backend
GitOrigin-RevId: 6cdcf7af77
3 years ago
Megvii Engine Team
9526ee521b
docs(distributed.functional): add return type for all_reduce_min
GitOrigin-RevId: 9f734902fe
3 years ago
Megvii Engine Team
2aba0378b9
refactor(mgb/dnn): fix group conv is_available
GitOrigin-RevId: b279909168
3 years ago
Megvii Engine Team
4a92346b7a
refactor(mgb): refactor group conv3d
GitOrigin-RevId: 15360a3a41
3 years ago
Megvii Engine Team
6ce212d2e0
refactor(mgb): refactor group conv
GitOrigin-RevId: 7afd312690
4 years ago
XindaH
febd0b1798
ci(fix): fail when git user name or email is empty
3 years ago
Megvii Engine Team
eb2dd018d9
build(fp16): fix fp16 build
GitOrigin-RevId: a20ebfd110
3 years ago
Megvii Engine Team
f76a2cc2c6
feat(mge/opr): add silu and gelu
GitOrigin-RevId: 75aa42947e
3 years ago
Megvii Engine Team
f2ac4c345b
docs(distributed.functional.all_reduce_sum): googlestring and examples
GitOrigin-RevId: a456dfde24
3 years ago
Megvii Engine Team
186bacfb71
fix(mge): recover bn freeze fastpath execution
GitOrigin-RevId: 13e58c9fba
3 years ago
Megvii Engine Team
5f558042b2
fix(imperative/ops): use tblgen to generate FastpathCopy
GitOrigin-RevId: b02157dacb
3 years ago
Megvii Engine Team
bfc4e7a966
docs(mge): fix amp docstring problems
GitOrigin-RevId: e5540eb940
3 years ago
Megvii Engine Team
0b764cf2d2
docs(mge/functional): add docs for megengine.functional.full_like
GitOrigin-RevId: 391447e977
3 years ago
Megvii Engine Team
f141159088
refactor(mge): loose the error bound of fastrun
GitOrigin-RevId: 9bf9b9d4ca
3 years ago
Megvii Engine Team
1f0436967c
refactor(mge/distributed): using nccl as default in distributed training
GitOrigin-RevId: 81268e84bc
3 years ago
Megvii Engine Team
b17a02d44a
feat(mge/distributed): deprecate get_device_count_by_fork
GitOrigin-RevId: 6147c3ae90
3 years ago
Megvii Engine Team
f8b0f2cb91
build(dnn/cutlass): fix build for cutlass
GitOrigin-RevId: 9aa095fe84
3 years ago
konghuanjun
0fb4e9a9ca
fix(ci): git set user and email
4 years ago
huangxinda
6af4a32e17
feat(mge/third_party): update MegRay version
3 years ago
huangxinda
093f7ae774
feat(mge/third_party): update cutlass version
3 years ago
Megvii Engine Team
c2daea3cba
chore(release): bump version
GitOrigin-RevId: a016ea9d56
3 years ago
Megvii Engine Team
207a346351
chore(mge): run get_device_count("gpu") in subprocess
GitOrigin-RevId: 0f0dc001cf
4 years ago
Megvii Engine Team
869a03271b
perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance
GitOrigin-RevId: d1b95a6f01
3 years ago
liuke
0baf6b0d63
Merge pull request #175 from tpoisonooo:fix-spell-error
GitOrigin-RevId: fe649676ac
3 years ago
Megvii Engine Team
239916a997
fix(mgb/gopt): fix testcase for enable nchw64 pass
GitOrigin-RevId: 2ae8d1608d
4 years ago
Megvii Engine Team
2ab5c53f1d
feat(mgb/gopt): support nhwc conv in tensor reformat pass
GitOrigin-RevId: 43e78d758a
4 years ago
Megvii Engine Team
009c90a2fe
feat(mgb/gopt): modify padding policy for 4bit conv bias oprs
GitOrigin-RevId: 188a2c3728
4 years ago
Megvii Engine Team
4eda338876
feat(dnn/cuda): generate cutlass kimpls using cmake and bazel
GitOrigin-RevId: da3bcfb85a
4 years ago
Megvii Engine Team
8d248a6a9a
fix(dnn/cuda): fix testcase for fallback nchw qs8 conv
GitOrigin-RevId: 646440db59
4 years ago
Megvii Engine Team
894a2407c2
feat(dnn/cuda): add relayout format kernel for nchw <-> nhwc
GitOrigin-RevId: e11f3e5408
4 years ago