Megvii Engine Team
|
e64536a31e
|
fix(imperative): fix the dtype promote problem when amp
GitOrigin-RevId: 43e1035fc8
|
3 years ago |
Megvii Engine Team
|
2b80806f21
|
perf(imperative/src): improve dot performance
GitOrigin-RevId: 35b5bd164f
|
3 years ago |
Megvii Engine Team
|
2f3bc2db9d
|
perf(mge/utils): move astensor1d into C++
GitOrigin-RevId: e7c6659020
|
3 years ago |
Megvii Engine Team
|
fa62f6c06e
|
perf(mge/utils): move convert_input into C++
GitOrigin-RevId: 0d1cd36251
|
3 years ago |
Megvii Engine Team
|
d98be08030
|
perf(mge): move Const into C++
GitOrigin-RevId: 31a443cffd
|
3 years ago |
Megvii Engine Team
|
1709b3940b
|
perf(mge/functional): speed up Broadcast and Reshape
GitOrigin-RevId: a72f5460b6
|
3 years ago |
Megvii Engine Team
|
0f736a0ab4
|
perf(mge/functional): speed up Dimshuffle
GitOrigin-RevId: 8160c9522b
|
3 years ago |
Megvii Engine Team
|
3e5e08b0b4
|
perf(mge/functional): speed up RemoveAxis
GitOrigin-RevId: 9c5d27fe1d
|
3 years ago |
Megvii Engine Team
|
a4d473c99a
|
perf(mge/functional): speed up AddAxis
GitOrigin-RevId: 92a3e1bdd3
|
3 years ago |
Megvii Engine Team
|
3e206d899b
|
perf(mge/functional): speed up Split
GitOrigin-RevId: 43550a0706
|
3 years ago |
Megvii Engine Team
|
730ddc2d81
|
perf(interpreter): improve interpreter performance
GitOrigin-RevId: 88f51d15f8
|
3 years ago |
Megvii Engine Team
|
729242f9f8
|
refactor(imperative): move typecvt code of sereval ops to c++
GitOrigin-RevId: 4ffaa376c1
|
3 years ago |
Megvii Engine Team
|
3c3fc6f33c
|
refactor(imperative): move python code of elemwise/reduce/conv2d/bn to c++
GitOrigin-RevId: 01b5324392
|
3 years ago |
Megvii Engine Team
|
8446626193
|
perf(imperative/src): improve elemwise
GitOrigin-RevId: 78aa487277
|
3 years ago |
Megvii Engine Team
|
e400b7ffe5
|
perf(imperative): enable memory forwarding for imperative
GitOrigin-RevId: 7c1993979c
|
3 years ago |
Megvii Engine Team
|
84d1a440f0
|
fix(imperative): do not use output_desc in rng ops
GitOrigin-RevId: e6a399be17
|
3 years ago |
Megvii Engine Team
|
1ce78aa09b
|
fix(imperative): destruct dnn handles at last
GitOrigin-RevId: 7a67c68c55
|
3 years ago |
Megvii Engine Team
|
0cb60d646d
|
feat(imperative): add output_descs for apply_on_physical_tensor
GitOrigin-RevId: 5b036c2c5a
|
3 years ago |
Megvii Engine Team
|
c7ded2fe2f
|
refactor(imperative): remove unnecessary reverve in small vector
GitOrigin-RevId: 85c30bc828
|
3 years ago |
Megvii Engine Team
|
8c2b916ef5
|
refactor(imperative): remove some methods in proxy graph
GitOrigin-RevId: 1fb68a1da2
|
3 years ago |
Megvii Engine Team
|
2348a963f2
|
refactor(imperative): apply workspace limit hook to mini graph
GitOrigin-RevId: 27c51f3147
|
3 years ago |
Megvii Engine Team
|
fea46ea9a4
|
perf(imperative): add opr cache for apply_on_physical_tensor
GitOrigin-RevId: fc5d5fb34d
|
4 years ago |
Megvii Engine Team
|
ea4e6ab93a
|
fix(mgb/opr): fix shape cache of NvOF
GitOrigin-RevId: 456ba478e9
|
4 years ago |
Megvii Engine Team
|
3228fb75a5
|
fix(cuda): conv algo heuristic choose
GitOrigin-RevId: 95c5e7d627
|
3 years ago |
Megvii Engine Team
|
8c415f4ed7
|
feat(dnn): cuda nhwc nearest resize support not 1 or 3 channel
GitOrigin-RevId: 764504c341
|
3 years ago |
Megvii Engine Team
|
0447574446
|
feat(opencl): add OpenCL cache compat level api
GitOrigin-RevId: e7561e6879
|
3 years ago |
Megvii Engine Team
|
6fb5a34360
|
build(flatbuffer/cx2): fix cx2 build and fix uclibc build flatbuffer
GitOrigin-RevId: af851e155f
|
3 years ago |
Megvii Engine Team
|
87de704a46
|
feat(gopt): fuse conv h_swish
GitOrigin-RevId: a3d12991fb
|
3 years ago |
Megvii Engine Team
|
4adba37867
|
feat(lite): add example script and some small change for lar
GitOrigin-RevId: a28ed2f27a
|
3 years ago |
Megvii Engine Team
|
87f00232f2
|
fix(mge/gm): fix missing dtype checking while attach tensors
GitOrigin-RevId: f0aaea99b9
|
3 years ago |
Megvii Engine Team
|
3726f5cc92
|
feat(gopt): merger consecutive relayout and dimshuffle to one relayout to optimize CD4 performarce
GitOrigin-RevId: a058776be3
|
3 years ago |
Megvii Engine Team
|
1fead9b6b0
|
feat(gopt): merge consecutive dimshuffle and relayout to one relayout to optimize CD4 performace
GitOrigin-RevId: 16f22baa80
|
3 years ago |
Megvii Engine Team
|
26d1e4f7ed
|
feat(gopt): optimize cd4 pass rule for elemwise and typecvt to let cd4 start as soon as possible
GitOrigin-RevId: 6580dedca7
|
3 years ago |
Megvii Engine Team
|
ac26bdcef5
|
fix(cuda): fix direct conv speed and memory problem
GitOrigin-RevId: 6faeeff3b8
|
3 years ago |
Megvii Engine Team
|
f7994683bd
|
feat(cuda): add large kernel direct conv to heuristic algo chooser
GitOrigin-RevId: bc927b6df7
|
3 years ago |
Megvii Engine Team
|
6dc0c0b9cc
|
fix(dnn): fix the sync problem in some kernels
GitOrigin-RevId: df3f7dc51b
|
3 years ago |
Megvii Engine Team
|
04193e3bd1
|
feat(dnn): add nearest mode for remap and resize
GitOrigin-RevId: 31e7b72a78
|
3 years ago |
Megvii Engine Team
|
69b89388e8
|
docs(mge/functional): fix debug_param set_execution_strategy docstring
GitOrigin-RevId: 434929c998
|
3 years ago |
Megvii Engine Team
|
93c7e45188
|
feat(arm): delete the reduant implement
GitOrigin-RevId: ff32a3dc8b
|
3 years ago |
Megvii Engine Team
|
e34a642b31
|
feat(fallback): reduce support general intrinsic
GitOrigin-RevId: f250aa7b2a
|
3 years ago |
Megvii Engine Team
|
10f23778a8
|
feat(fallback): add simd general intrinsic
GitOrigin-RevId: ad78ba689f
|
3 years ago |
Megvii Engine Team
|
286051ede1
|
feat(dnn): differentiate sass kernel with cuda version
GitOrigin-RevId: 40bb4423b8
|
3 years ago |
Megvii Engine Team
|
f78b60ec10
|
feat(bazel): make bazel gensass depend on cuda toolchain version automatically
GitOrigin-RevId: 9433f21a91
|
3 years ago |
Megvii Engine Team
|
f48227c07d
|
feat(mgb): show more details for cuda driver api call
GitOrigin-RevId: 40e63d9dac
|
3 years ago |
Megvii Engine Team
|
bb5af9b475
|
feat(lite): hidden lar gflags symbols for static link
GitOrigin-RevId: 28823da644
|
3 years ago |
Megvii Engine Team
|
d8bb3ff5b4
|
fix(cuda): fix fp16 tensorcore gemm split k workspace
GitOrigin-RevId: d04a0e0985
|
3 years ago |
Megvii Engine Team
|
597efed40b
|
feat(lite): add get last error code interface in lite c
GitOrigin-RevId: 280cc88092
|
3 years ago |
Megvii Engine Team
|
90c8a58cca
|
docs(docstring): add pad docstring
GitOrigin-RevId: eaf6a87456
|
3 years ago |
Megvii Engine Team
|
5f4501e0f3
|
fix(gopt): fix conv bias fuse 2 noline
GitOrigin-RevId: a6ab9f4e5e
|
3 years ago |
Megvii Engine Team
|
ac2f548c9a
|
docs(imperative/dataloader): update preload description
GitOrigin-RevId: 523618b1e1
|
3 years ago |