2268 Commits (v1.9.0)
 

Author SHA1 Message Date
  Megvii Engine Team 150a6a6151 perf(dispatch/trace): remove unnecessary h2d for constant 3 years ago
  Megvii Engine Team 81d8c73a41 perf(dispatch/trace): serval tricks to speed up trace 3 years ago
  Megvii Engine Team 4fa6162027 perf(dispatch): improve performance of dispatch system 3 years ago
  Megvii Engine Team ca00177719 perf(dispatch): speed up dispatch system 3 years ago
  Megvii Engine Team 187c1dc081 fix(jit): copy aux var when shallow copying JITExecutor 3 years ago
  Megvii Engine Team 7bd848ce04 fix(subgraph): fix hand-written backward for serval jit-elemwise ops 3 years ago
  Megvii Engine Team 7be7656c9f fix(imperative): explicitly manage global structures 3 years ago
  Megvii Engine Team 62034fb262 fix(imperative): make CompNode finalize happens before global object destructor 3 years ago
  Megvii Engine Team 59cbf9583d fix(subgraph): use CompiledOp in cpu to avoid workspace error 3 years ago
  Megvii Engine Team b6ce02a152 fix(subgraph): fallback back to cg if jit unsupported 3 years ago
  Megvii Engine Team 21f5a7fcc0 fix(subgraph): fix device recognition and scalar propagate 3 years ago
  Megvii Engine Team 27346b0b65 test(opr): add scalar check for opr_test 3 years ago
  Megvii Engine Team 225045236b perf(imperative): improve shape inference 3 years ago
  Megvii Engine Team df3474ca1d perf(functional): rewrite serval elemwise ops with jit subgraph 3 years ago
  Megvii Engine Team c55fda9a7c fix(fastrun): don't kill profiling worker 3 years ago
  Megvii Engine Team 2775f4580c feat(subgraph): subgraph builder supports jit and custom grad 3 years ago
  Megvii Engine Team 3c61e0e02a feat(ops): add JITFusion op 3 years ago
  Megvii Engine Team aa587446fc feat(subgraph): support shape inference for CompiledOp 3 years ago
  Megvii Engine Team 1c1e9b002d fix(rng): init layout strides 3 years ago
  Megvii Engine Team 9527859cc8 feat(opcache): add ndim and has_value to cache key 3 years ago
  Megvii Engine Team cbb47089a6 perf(interpreter): add fastpath for GetVarShape 3 years ago
  Megvii Engine Team b458178847 feat(opr): add mutable tensor opr 3 years ago
  Megvii Engine Team 47fe766310 feat(dnn/cuda): add implicit bmm kernels for large kernel depthwise convolution backward filter opr 3 years ago
  Megvii Engine Team dcc9693582 feat(dnn/cuda): add heuristic rule for implicit batched gemm large kernel dwconv2d kernels 3 years ago
  Megvii Engine Team 6cefabe734 fix(dnn/cuda): fix ci 3 years ago
  Megvii Engine Team 888f4e46ae feat(dnn/cuda): add implicit bmm large kernel dwconv2d dgrad kernels 3 years ago
  Megvii Engine Team 08d8635ff5 feat(dnn/cuda): add implicit bmm large kernel dwconv2d fprop impl 3 years ago
  Megvii Engine Team 93ceb80ad2 refactor(imperative): fix broadcast,reshape,reduce 3 years ago
  Megvii Engine Team d919aaebc7 test(imperative): reopen special interpolate test and sync when test rng 3 years ago
  Megvii Engine Team ca2deebc0f fix(imperative/tensor): make @ operator has the same functionality as matmul functional 3 years ago
  Megvii Engine Team e860a08386 refactor(mge/indexing): move indexing into c++ 3 years ago
  Megvii Engine Team e6706be23a refactor(imperative): remove infer_output_mem_desc 3 years ago
  Megvii Engine Team a5af35c18c refactor(imperative): remove command buffer 3 years ago
  Megvii Engine Team bdb853ee6f fix(mgb): fix extra device malloc when load MultipleDeviceTensorWithFormatHolder 3 years ago
  Megvii Engine Team 406115dba0 fix(imperative): syncbn fp16 support 3 years ago
  Megvii Engine Team d5ef792309 perf(lite): optimized lite tensor get data by share 3 years ago
  huangxinda ce9ad07a27 feat(ci): update ci and readme 3 years ago
  Megvii Engine Team 884865703d test(trace): test subtensor on unknown shape 3 years ago
  Megvii Engine Team c34a75d0f4 fix(trace): assume result is not scalar when shape is valid 3 years ago
  Megvii Engine Team bebb2cf4c3 Merge pull request #428 from P2Oileen:fix-pad 3 years ago
  Megvii Engine Team e2b79ea00e feat(mgb): reduce the number of trtruntimeopr create contexts 3 years ago
  Megvii Engine Team 6157d9cfef fix(traced_module): fix Module compatible issue and traced module getattr check 3 years ago
  Megvii Engine Team 26b52a61de feat(lite): add get model infomation before create network interface 3 years ago
  Megvii Engine Team 5e17b3e4c6 Merge pull request #426 from Qsingle:fix-pixel_suffle 3 years ago
  Megvii Engine Team 2bebe80e93 fix(imperative): fix the default pickle protocol version of save 3 years ago
  Xinran Xu f02cd2d28b
Merge pull request #436 from bealwang/master 3 years ago
  王彪 df4153dc71 docs(readme): add more badges 3 years ago
  XindaH ea91babbce
Merge pull request #435 from MegEngine/try-import 3 years ago
  Megvii Engine Team 8e94af9d78 Merge pull request #400 from jieli-matrix:docstring-svd 3 years ago
  Megvii Engine Team 260923e11c perf(aarch64): optimize aarch64 uint16 relayout with block_w==3 3 years ago