Megvii Engine Team
ba32360af4
feat(lite): add set opencl buffer kernel cache lite api
GitOrigin-RevId: 0605218238
3 years ago
Megvii Engine Team
711b5bf502
fix(dnn/arm_common): fix some load beyond memory
GitOrigin-RevId: acd6363945
3 years ago
Megvii Engine Team
3ebb8db01a
feat(third_party/cutlass): update to version 2.8
GitOrigin-RevId: 9de584b3b8
3 years ago
Megvii Engine Team
da91e650a5
refactor(ops/layer_norm): speed up the host speed of layer_norm
GitOrigin-RevId: 6f359b5b29
3 years ago
Megvii Engine Team
67cfce9f2f
fix(imperative/amp): add is_scalar check in elemwise and concat
GitOrigin-RevId: 61a612e92a
3 years ago
Megvii Engine Team
d313f92610
fix(imperative/amp): fix format transformation for symbol trans
GitOrigin-RevId: 96cc237c67
3 years ago
Megvii Engine Team
261a5bce23
feat(imperative/amp): add dimshuffle in set_format for nhwc
GitOrigin-RevId: 5ced9e1a31
3 years ago
Megvii Engine Team
c9e56f4987
feat(imperative/amp): add dimshuffle before creating nhwc tensor
GitOrigin-RevId: 4461f9a0d3
3 years ago
Megvii Engine Team
d57a071271
feat(imperative/amp): add fallback for op not supported for nhwc tensor
GitOrigin-RevId: 8411ce7bdc
3 years ago
Megvii Engine Team
38a9aa9faf
feat(imperative/amp): add auto dimshuffle for elemwise and concat
GitOrigin-RevId: 6e3df4e064
3 years ago
Megvii Engine Team
cd26376549
style(imperative/amp): reformat code
GitOrigin-RevId: 6e5a6e1eaf
3 years ago
Megvii Engine Team
3892aa0b6e
fix(imperative/amp): fix bn params for nhwc amp
GitOrigin-RevId: 57a3b9d418
3 years ago
Megvii Engine Team
6f0b582064
chore(imperative/amp): adapt dev
GitOrigin-RevId: 41eb0faadf
3 years ago
Megvii Engine Team
ee984e8608
fix(imperative/amp): fix distributed backward callback for nhwc amp
GitOrigin-RevId: 4d725b0ea4
3 years ago
Megvii Engine Team
15c6da6218
feat(imperative/amp): add nhwc support for adaptive pooling
GitOrigin-RevId: 7c5755308e
3 years ago
Megvii Engine Team
c28a875fac
fix(imperative/amp): adapt new transformation
GitOrigin-RevId: 6edd577a70
3 years ago
Megvii Engine Team
fd41302cc1
feat(imperative/amp): add set_format
GitOrigin-RevId: 91de6f49de
3 years ago
Megvii Engine Team
fc633ce4ff
fix(imperative/amp): fix custom grad in Subgraph
GitOrigin-RevId: 1c728d6ab9
3 years ago
Megvii Engine Team
673b295d75
feat(imperative/amp): remove conv_format and bn param_dim configs
GitOrigin-RevId: 848d34f63d
3 years ago
Megvii Engine Team
7e9aa742e6
feat(imperative/amp): enable auto_convert_format by default
GitOrigin-RevId: 71ae311fed
3 years ago
Megvii Engine Team
fc0f454685
fix(dnn/check_non_finite): adjust some details of CheckNonFinite
GitOrigin-RevId: 52ddd805b4
3 years ago
Megvii Engine Team
3bd40887b6
feat(mgb/opr): add NHWC support for AdaptivePooling
GitOrigin-RevId: b23e37ac23
3 years ago
Megvii Engine Team
e393d1cf65
feat(mge/amp): add convert_format module for NHWC training
GitOrigin-RevId: 1b41e1042c
3 years ago
Megvii Engine Team
533fb5bf49
feat(imperative): support formatted tensor and add special op rules
GitOrigin-RevId: 77ff909f23
3 years ago
Megvii Engine Team
4aa79c453b
perf(mge): override grad of matmul
GitOrigin-RevId: d9d97e70fe
3 years ago
Megvii Engine Team
98b5ee78c1
feat(mge/dnn): add lamb optimizer
GitOrigin-RevId: 5a27157456
3 years ago
Megvii Engine Team
a926878c01
feat(imperative): remove symbolvar of imperative
GitOrigin-RevId: 16da6d1491
3 years ago
Megvii Engine Team
14813d13c0
fix(whl): fix whl broken: patchelf on big (> 4G) file will
make elf section broken, as a workaround, do strip firstly,
then do patchelf.
GitOrigin-RevId: c7fb7e25a6
3 years ago
Megvii Engine Team
9e0583e13a
feat(dnn/arm_common): add arm_common chanwise dot 11x11
GitOrigin-RevId: 84e0815a59
3 years ago
Megvii Engine Team
115bcbce2b
feat(lite): add fitting mode for load and run
GitOrigin-RevId: 8f21fda9d3
3 years ago
Megvii Engine Team
02bfb8f8b9
feat(lite): add and fix some feature for load and run fitting mode
GitOrigin-RevId: bbddc9bb79
3 years ago
Megvii Engine Team
c62ddba238
feat(dnn/opencl): optimize heuristic rule
GitOrigin-RevId: 971c93d926
3 years ago
Xinran Xu
6e83940722
Merge pull request #461 from tpoisonooo/patch-1
docs(README.md): add doc web link
3 years ago
tpoisonooo
91a45d7caa
docs(README.md): add link
3 years ago
huangxinda
d404ed184d
feat(ci): update cpuinfo
3 years ago
Megvii Engine Team
c2500cdb7e
chore(license): apply change caused by bot forward rebase
GitOrigin-RevId: 2707bc03c9
3 years ago
Megvii Engine Team
5f0e7ffb64
feat(fallback): add FB_GI_F32_4x12 benchmark
GitOrigin-RevId: cfacf31b28
3 years ago
Megvii Engine Team
f249d387de
feat(fallback): imp gi matmul FB_GI_F32_4x12 algo
GitOrigin-RevId: 16255e7a72
3 years ago
Megvii Engine Team
03f78547f7
feat(dnn/arm_common): add 9x9s1s2 dot chanwise kernel
GitOrigin-RevId: a28a97fcb5
3 years ago
Megvii Engine Team
80e1f38bea
fix(gtest): fix ci error report stack-use-after-scope
how to reproduce the problem:
1: build with asan(revert this MR)
2: then taskset process to one cpu:
taskset 01 ./megbrain_test --gtest_filter=TestAsyncQueue.SynchronizerWaiterStarving
GitOrigin-RevId: eb6f7aa4d8
3 years ago
Megvii Engine Team
c2e9860feb
chore(license): remove all license in file header
GitOrigin-RevId: a0e31247a6
3 years ago
Megvii Engine Team
38b492727e
fix(opr): fix no update ptr in reduce operator when input change
GitOrigin-RevId: a443a79ac0
3 years ago
Megvii Engine Team
4cce2480d5
fix(dnn/opencl): fix some bug for dnn opencl conv bias and relayout format
GitOrigin-RevId: b5bb07d90d
3 years ago
Megvii Engine Team
ca0e616fb5
refactor(lite): refactor load_and_run profiling message
GitOrigin-RevId: 4676398627
3 years ago
Megvii Engine Team
1783b8977a
feat(profiler): integrate cupti backend
GitOrigin-RevId: dec8be1908
3 years ago
Megvii Engine Team
e98049d77e
feat(fallback): move arm_common resize f32 algo to fallback gi
GitOrigin-RevId: 3370cdc57a
3 years ago
megvii-mge
5b69af2045
Merge pull request #460 from kagome1007/updatereadme
fix(mge): update readme
3 years ago
“wenjuan”
824af20bd8
fix(mge): update readme
3 years ago
Megvii Engine Team
6814cf1cd7
fix(lite): fix lite test error
GitOrigin-RevId: ab608672ec
3 years ago
Megvii Engine Team
7c8f184723
fix(dnn/x86): fix x86 pooling exec
GitOrigin-RevId: cdaa752d7e
3 years ago