715 Commits (d52ba79d8906a1a758051f3c3189c7440ecc1adc)

Author SHA1 Message Date
  Megvii Engine Team b9cbc10120 feat(lite): add pack model 3 years ago
  Megvii Engine Team 7927e98fd6 perf(mge): speed up PixelShuffle 3 years ago
  Megvii Engine Team 1c2a323e78 feat(mge): add warning message when mismatched cuda sm is detected 3 years ago
  Megvii Engine Team 877bda4180 perf(mge): improve cross stream memory borrowing 3 years ago
  Megvii Engine Team 484e1f1173 fix(build): fix riscv64 gcc build with > O0 3 years ago
  Megvii Engine Team 14e9ad625d fix(megdnn): emit define-but-not-referenced and extra-;-ignored warning on cuda9.0~cuda9.1 3 years ago
  Megvii Engine Team c2435d1561 perf(imperative): specialize adaptive pooling 3 years ago
  Megvii Engine Team c0b267fff6 refactor(cuda-stub): opt cuda-stub log 3 years ago
  Megvii Engine Team d9c4ef59fe perf(imperative): using simple hash key in heuristic cache 3 years ago
  Megvii Engine Team 3949d425fb feat(core): always show MegEngine version and git commit id 3 years ago
  Megvii Engine Team fd6f8e58b0 feat(mgb/dtype): add dtype qint1 3 years ago
  Megvii Engine Team 5ebc9d50b7 fix(pylite): fix lite global layout transform and fast run conflict error 3 years ago
  Megvii Engine Team 2a900a69cb perf(imperative): improve reduce op performance 3 years ago
  Megvii Engine Team 273c0e8745 fix(autodiff): fix some bugs in relation to 2nd order grad 3 years ago
  Megvii Engine Team d56570d929 fix(megbrain): add rdnn to copybara 3 years ago
  Megvii Engine Team 12a3ef8d01 refactor(fastrun): decouple fastrun from computing graph 3 years ago
  Megvii Engine Team 2b80806f21 perf(imperative/src): improve dot performance 3 years ago
  Megvii Engine Team 1709b3940b perf(mge/functional): speed up Broadcast and Reshape 3 years ago
  Megvii Engine Team 3e206d899b perf(mge/functional): speed up Split 3 years ago
  Megvii Engine Team 8446626193 perf(imperative/src): improve elemwise 3 years ago
  Megvii Engine Team e400b7ffe5 perf(imperative): enable memory forwarding for imperative 3 years ago
  Megvii Engine Team 0cb60d646d feat(imperative): add output_descs for apply_on_physical_tensor 3 years ago
  Megvii Engine Team fea46ea9a4 perf(imperative): add opr cache for apply_on_physical_tensor 4 years ago
  Megvii Engine Team ea4e6ab93a fix(mgb/opr): fix shape cache of NvOF 4 years ago
  Megvii Engine Team 87de704a46 feat(gopt): fuse conv h_swish 3 years ago
  Megvii Engine Team 3726f5cc92 feat(gopt): merger consecutive relayout and dimshuffle to one relayout to optimize CD4 performarce 3 years ago
  Megvii Engine Team 1fead9b6b0 feat(gopt): merge consecutive dimshuffle and relayout to one relayout to optimize CD4 performace 3 years ago
  Megvii Engine Team 26d1e4f7ed feat(gopt): optimize cd4 pass rule for elemwise and typecvt to let cd4 start as soon as possible 3 years ago
  Megvii Engine Team 5f4501e0f3 fix(gopt): fix conv bias fuse 2 noline 3 years ago
  Megvii Engine Team 7d2063e35a perf(cuda): speedup conv backward data with small feature map and large filter size 3 years ago
  Megvii Engine Team 28d48f2f7a fix(mgb/src): fix megbrain cmake unsupport android_nn 3 years ago
  Megvii Engine Team 187c1dc081 fix(jit): copy aux var when shallow copying JITExecutor 3 years ago
  Megvii Engine Team b6ce02a152 fix(subgraph): fallback back to cg if jit unsupported 3 years ago
  Megvii Engine Team c55fda9a7c fix(fastrun): don't kill profiling worker 3 years ago
  Megvii Engine Team aa587446fc feat(subgraph): support shape inference for CompiledOp 3 years ago
  Megvii Engine Team bdb853ee6f fix(mgb): fix extra device malloc when load MultipleDeviceTensorWithFormatHolder 3 years ago
  Megvii Engine Team e2b79ea00e feat(mgb): reduce the number of trtruntimeopr create contexts 3 years ago
  Megvii Engine Team 95ac055538 feat(dnn,mgb,imperative): add diag opr implement 3 years ago
  Megvii Engine Team cbbca5fb10 feat(mge): add softmax op use cudnn api 3 years ago
  Megvii Engine Team 20b42a8c3b fix(dnn): add naive lstm kernel 3 years ago
  Megvii Engine Team 2faa6ea5a9 Merge pull request #213 from kxz18:rnn 3 years ago
  Megvii Engine Team 85ea882cb5 fix(mgb/ops): immutable tensor support empty storage 3 years ago
  Megvii Engine Team 4b0ecb5deb fix(ops/recv): use std::vector to store shape to support scalar 3 years ago
  Megvii Engine Team f4f20046c4 fix(mgb): fix tensorrt runtimeopr get output var shape bug 3 years ago
  Megvii Engine Team 1999307015 feat(mgb/opr): add dropout kernel 3 years ago
  Megvii Engine Team a93741815b feat(mgb/opr): add layernorm forward and backward kernel 3 years ago
  Megvii Engine Team 1657b8e881 fix(fastrun): fix persistent_cache in redis 3 years ago
  Megvii Engine Team a404cd7d06 fix(mgb/src): add tensorRT version check 3 years ago
  Megvii Engine Team c53cad2049 feat(cmake): format all cmake file 3 years ago
  Megvii Engine Team 6011f51001 style(all): fix clang-format for MGB_DEFINE inside another macro 3 years ago