Megvii Engine Team
50faabf614
feat(serialization): support the registry for new serialization format
GitOrigin-RevId: 8eacd5e77c
3 years ago
Megvii Engine Team
a694fb3304
feat(serialization): implement the new serialization format
GitOrigin-RevId: 00f87f7ccd
3 years ago
Megvii Engine Team
ca4a5da013
feat(serialization): add new serialization format define
GitOrigin-RevId: 177bfdd56b
3 years ago
Megvii Engine Team
3bd40887b6
feat(mgb/opr): add NHWC support for AdaptivePooling
GitOrigin-RevId: b23e37ac23
3 years ago
Megvii Engine Team
533fb5bf49
feat(imperative): support formatted tensor and add special op rules
GitOrigin-RevId: 77ff909f23
3 years ago
Megvii Engine Team
98b5ee78c1
feat(mge/dnn): add lamb optimizer
GitOrigin-RevId: 5a27157456
3 years ago
Megvii Engine Team
02bfb8f8b9
feat(lite): add and fix some feature for load and run fitting mode
GitOrigin-RevId: bbddc9bb79
3 years ago
Megvii Engine Team
80e1f38bea
fix(gtest): fix ci error report stack-use-after-scope
how to reproduce the problem:
1: build with asan(revert this MR)
2: then taskset process to one cpu:
taskset 01 ./megbrain_test --gtest_filter=TestAsyncQueue.SynchronizerWaiterStarving
GitOrigin-RevId: eb6f7aa4d8
3 years ago
Megvii Engine Team
c2e9860feb
chore(license): remove all license in file header
GitOrigin-RevId: a0e31247a6
3 years ago
Megvii Engine Team
38b492727e
fix(opr): fix no update ptr in reduce operator when input change
GitOrigin-RevId: a443a79ac0
3 years ago
Megvii Engine Team
4cce2480d5
fix(dnn/opencl): fix some bug for dnn opencl conv bias and relayout format
GitOrigin-RevId: b5bb07d90d
3 years ago
Megvii Engine Team
1783b8977a
feat(profiler): integrate cupti backend
GitOrigin-RevId: dec8be1908
3 years ago
Megvii Engine Team
bde2efa3b5
feat(lite/load_and_run): support put and get model redis cache
GitOrigin-RevId: 55c82e28c1
3 years ago
Megvii Engine Team
48526abb79
fix(mgb): fix concat cd4 tensor check size invalid
GitOrigin-RevId: 065e0b4be0
3 years ago
Megvii Engine Team
c87d998e59
feat(mgb): add interface to support opencl IO zero copy when inference
GitOrigin-RevId: a1d7021892
3 years ago
Megvii Engine Team
a0e531180d
fix(src/comp_node): fix calling cuda driver api
GitOrigin-RevId: cc33af2ac4
3 years ago
Megvii Engine Team
ccea0e2386
fix(dnn/rdnn): add warmup before profile
GitOrigin-RevId: 7962525e90
3 years ago
Megvii Engine Team
8182af6eb6
fix(mgb): fix strategy of grad_op and opr_attr
GitOrigin-RevId: bb7ab8fa9d
3 years ago
Megvii Engine Team
e2f5156b69
refactor(megbrain): save fastrun result to algorithm cache
GitOrigin-RevId: 45301ebb4d
3 years ago
Megvii Engine Team
f902ba2433
docs(megbrain): add notes for fastrun
GitOrigin-RevId: b59f7f205d
3 years ago
Megvii Engine Team
7dc347697a
feat(dnn/cuda): add typecvt uint16
GitOrigin-RevId: d1368c414e
3 years ago
Megvii Engine Team
b92866d2c2
fix(build): fix build depends dirty file issue
GitOrigin-RevId: 435d8b5c50
3 years ago
Megvii Engine Team
27d4c4b36c
refactor(stats): use static inline variable declaration
GitOrigin-RevId: 7d86e5f257
3 years ago
Megvii Engine Team
787a22a9d6
perf(tensor): implement __new__ in cpp
GitOrigin-RevId: 4defd249c3
3 years ago
Megvii Engine Team
99df4a7996
fix(dtype): dtype scalar set_retain_dtype supports bool
GitOrigin-RevId: aafd378e1b
3 years ago
Megvii Engine Team
7bf5b0ee1e
test(imperative): check env values after each pytest
GitOrigin-RevId: 826788113a
3 years ago
Megvii Engine Team
b3f79966fd
fix(mgb): fix "TRT_ERROR: INVALID_ARGUMENT: Get binding data type failed."
GitOrigin-RevId: d9601cb15b
3 years ago
Megvii Engine Team
409c988163
fix(imperative): add matmul apply_on_varnode
GitOrigin-RevId: 2cf6bf237c
3 years ago
Megvii Engine Team
b9cbc10120
feat(lite): add pack model
GitOrigin-RevId: 1a150f2af3
3 years ago
Megvii Engine Team
7927e98fd6
perf(mge): speed up PixelShuffle
GitOrigin-RevId: 942e755745
3 years ago
Megvii Engine Team
1c2a323e78
feat(mge): add warning message when mismatched cuda sm is detected
GitOrigin-RevId: f78c79eb06
3 years ago
Megvii Engine Team
877bda4180
perf(mge): improve cross stream memory borrowing
GitOrigin-RevId: c68977c5dc
3 years ago
Megvii Engine Team
484e1f1173
fix(build): fix riscv64 gcc build with > O0
GitOrigin-RevId: 9ad3480492
3 years ago
Megvii Engine Team
14e9ad625d
fix(megdnn): emit define-but-not-referenced and extra-;-ignored warning on cuda9.0~cuda9.1
GitOrigin-RevId: f6db42e395
3 years ago
Megvii Engine Team
c2435d1561
perf(imperative): specialize adaptive pooling
GitOrigin-RevId: 01e1418458
3 years ago
Megvii Engine Team
c0b267fff6
refactor(cuda-stub): opt cuda-stub log
GitOrigin-RevId: 87dda08e1b
3 years ago
Megvii Engine Team
d9c4ef59fe
perf(imperative): using simple hash key in heuristic cache
GitOrigin-RevId: 6fddd612e7
3 years ago
Megvii Engine Team
3949d425fb
feat(core): always show MegEngine version and git commit id
GitOrigin-RevId: 4daa5be6d6
3 years ago
Megvii Engine Team
fd6f8e58b0
feat(mgb/dtype): add dtype qint1
GitOrigin-RevId: abe9fb68b1
3 years ago
Megvii Engine Team
5ebc9d50b7
fix(pylite): fix lite global layout transform and fast run conflict error
GitOrigin-RevId: 910c8da19f
3 years ago
Megvii Engine Team
2a900a69cb
perf(imperative): improve reduce op performance
GitOrigin-RevId: 26d982a7b8
3 years ago
Megvii Engine Team
273c0e8745
fix(autodiff): fix some bugs in relation to 2nd order grad
1. implement double backward for batchnorm
2. fix grad attach in nested grad manager
3. pad empty tensor for unsatisfied output_has_grad
4. support double backward for jit subgraph
5. support double backward for autodiff.Function
6. readd debug flag MGE_LOG_OP_DISPATCH
GitOrigin-RevId: cd31ddc620
3 years ago
Megvii Engine Team
d56570d929
fix(megbrain): add rdnn to copybara
GitOrigin-RevId: 7d8bf77053
3 years ago
Megvii Engine Team
12a3ef8d01
refactor(fastrun): decouple fastrun from computing graph
GitOrigin-RevId: 27abd22295
3 years ago
Megvii Engine Team
2b80806f21
perf(imperative/src): improve dot performance
GitOrigin-RevId: 35b5bd164f
3 years ago
Megvii Engine Team
1709b3940b
perf(mge/functional): speed up Broadcast and Reshape
GitOrigin-RevId: a72f5460b6
3 years ago
Megvii Engine Team
3e206d899b
perf(mge/functional): speed up Split
GitOrigin-RevId: 43550a0706
3 years ago
Megvii Engine Team
8446626193
perf(imperative/src): improve elemwise
GitOrigin-RevId: 78aa487277
3 years ago
Megvii Engine Team
e400b7ffe5
perf(imperative): enable memory forwarding for imperative
GitOrigin-RevId: 7c1993979c
3 years ago
Megvii Engine Team
0cb60d646d
feat(imperative): add output_descs for apply_on_physical_tensor
GitOrigin-RevId: 5b036c2c5a
3 years ago