2402 Commits (7582157a3b18f92b53ff26f8d83d3856784dac93)
 

Author SHA1 Message Date
  zhanghaolong 7582157a3b chore(deps): change flatbuffers repo from google to MegEngine 3 years ago
  Megvii Engine Team 8fa9a8defa fix(imperative): fix dot-op implement 3 years ago
  Megvii Engine Team 6c413ba943 refactor(mge): refactor physical tensor 3 years ago
  Megvii Engine Team d56570d929 fix(megbrain): add rdnn to copybara 3 years ago
  Megvii Engine Team 7de1bb11ab fix(mge/utils): disable memory forwarding for subgraph 3 years ago
  Megvii Engine Team b7c9361f81 perf(mge/functional): add infer_output_attrs_fallible for some ops 3 years ago
  Megvii Engine Team a4327c4d25 perf(imperative): add dim_expansion transform for conv/bn1d 3 years ago
  Megvii Engine Team 72a70dd6a7 perf(imperative): specialize convolution implementation 3 years ago
  Megvii Engine Team 12a3ef8d01 refactor(fastrun): decouple fastrun from computing graph 3 years ago
  Megvii Engine Team 0a6f4a880e fix(mge/dtr): fix dtr problem 3 years ago
  Megvii Engine Team 529b394f9c fix(imperative): fix profiler problem 3 years ago
  Megvii Engine Team e64536a31e fix(imperative): fix the dtype promote problem when amp 3 years ago
  Megvii Engine Team 2b80806f21 perf(imperative/src): improve dot performance 3 years ago
  Megvii Engine Team 2f3bc2db9d perf(mge/utils): move astensor1d into C++ 3 years ago
  Megvii Engine Team fa62f6c06e perf(mge/utils): move convert_input into C++ 3 years ago
  Megvii Engine Team d98be08030 perf(mge): move Const into C++ 3 years ago
  Megvii Engine Team 1709b3940b perf(mge/functional): speed up Broadcast and Reshape 3 years ago
  Megvii Engine Team 0f736a0ab4 perf(mge/functional): speed up Dimshuffle 3 years ago
  Megvii Engine Team 3e5e08b0b4 perf(mge/functional): speed up RemoveAxis 3 years ago
  Megvii Engine Team a4d473c99a perf(mge/functional): speed up AddAxis 3 years ago
  Megvii Engine Team 3e206d899b perf(mge/functional): speed up Split 3 years ago
  Megvii Engine Team 730ddc2d81 perf(interpreter): improve interpreter performance 3 years ago
  Megvii Engine Team 729242f9f8 refactor(imperative): move typecvt code of sereval ops to c++ 3 years ago
  Megvii Engine Team 3c3fc6f33c refactor(imperative): move python code of elemwise/reduce/conv2d/bn to c++ 3 years ago
  Megvii Engine Team 8446626193 perf(imperative/src): improve elemwise 3 years ago
  Megvii Engine Team e400b7ffe5 perf(imperative): enable memory forwarding for imperative 3 years ago
  Megvii Engine Team 84d1a440f0 fix(imperative): do not use output_desc in rng ops 3 years ago
  Megvii Engine Team 1ce78aa09b fix(imperative): destruct dnn handles at last 3 years ago
  Megvii Engine Team 0cb60d646d feat(imperative): add output_descs for apply_on_physical_tensor 3 years ago
  Megvii Engine Team c7ded2fe2f refactor(imperative): remove unnecessary reverve in small vector 3 years ago
  Megvii Engine Team 8c2b916ef5 refactor(imperative): remove some methods in proxy graph 3 years ago
  Megvii Engine Team 2348a963f2 refactor(imperative): apply workspace limit hook to mini graph 3 years ago
  Megvii Engine Team fea46ea9a4 perf(imperative): add opr cache for apply_on_physical_tensor 4 years ago
  Megvii Engine Team ea4e6ab93a fix(mgb/opr): fix shape cache of NvOF 4 years ago
  Megvii Engine Team 3228fb75a5 fix(cuda): conv algo heuristic choose 3 years ago
  Megvii Engine Team 8c415f4ed7 feat(dnn): cuda nhwc nearest resize support not 1 or 3 channel 3 years ago
  Megvii Engine Team 0447574446 feat(opencl): add OpenCL cache compat level api 3 years ago
  Megvii Engine Team 6fb5a34360 build(flatbuffer/cx2): fix cx2 build and fix uclibc build flatbuffer 3 years ago
  Megvii Engine Team 87de704a46 feat(gopt): fuse conv h_swish 3 years ago
  Megvii Engine Team 4adba37867 feat(lite): add example script and some small change for lar 3 years ago
  Megvii Engine Team 87f00232f2 fix(mge/gm): fix missing dtype checking while attach tensors 3 years ago
  Megvii Engine Team 3726f5cc92 feat(gopt): merger consecutive relayout and dimshuffle to one relayout to optimize CD4 performarce 3 years ago
  Megvii Engine Team 1fead9b6b0 feat(gopt): merge consecutive dimshuffle and relayout to one relayout to optimize CD4 performace 3 years ago
  Megvii Engine Team 26d1e4f7ed feat(gopt): optimize cd4 pass rule for elemwise and typecvt to let cd4 start as soon as possible 3 years ago
  Megvii Engine Team ac26bdcef5 fix(cuda): fix direct conv speed and memory problem 3 years ago
  Megvii Engine Team f7994683bd feat(cuda): add large kernel direct conv to heuristic algo chooser 3 years ago
  Megvii Engine Team 6dc0c0b9cc fix(dnn): fix the sync problem in some kernels 3 years ago
  Megvii Engine Team 04193e3bd1 feat(dnn): add nearest mode for remap and resize 3 years ago
  Megvii Engine Team 69b89388e8 docs(mge/functional): fix debug_param set_execution_strategy docstring 3 years ago
  Megvii Engine Team 93c7e45188 feat(arm): delete the reduant implement 3 years ago