315 Commits (47138c06cf473e8e108aa01ff1bf421c37eeff55)

Author SHA1 Message Date
  Megvii Engine Team 052a600f03 feat(mge/module): add python wrapper for unfold 4 years ago
  Megvii Engine Team 4fa9096d20 feat(megbrain): add correlation opr 4 years ago
  Megvii Engine Team b3aae4bb16 feat(dnn/cuda): add correlation kernel 4 years ago
  Megvii Engine Team af62add02d fix(api_cache): fix serialization for conv_desc 4 years ago
  Megvii Engine Team d0a68c431a fix(api_cache): lock api cache for thread safety 4 years ago
  Megvii Engine Team e8061e6628 perf(cuda/conv): cache serval cudnn api 4 years ago
  Megvii Engine Team fe8488bf0a feat(dnn/apicache): add generic apicache 4 years ago
  Megvii Engine Team e5d8e4279b feat(mge): add a tool which can analyze the file generated by compare_binary_iodump.py 4 years ago
  Megvii Engine Team 747b53c018 feat(dnn): add conv_bwd_data and conv_bwd_filter accuracy shake check 4 years ago
  Megvii Engine Team ecb11202f5 fix(build): fix naive build 4 years ago
  Megvii Engine Team 8109a05a5e fix(mgb/dnn): fix cub potential issues 4 years ago
  Megvii Engine Team 3f5238fb38 feat(mgb/dnn): add accuracy shake checker 4 years ago
  Megvii Engine Team be6fb6b7c1 feat(mgb/dnn): add accuracy_depend_on_batch attribute 4 years ago
  Megvii Engine Team 1cadf9d8d7 fix(mgb): add usable-depend-on-shape attr 4 years ago
  Megvii Engine Team 0083f4c4f3 build(rocm): support rocm-3.9 4 years ago
  Megvii Engine Team 928a57f83c build(rocm): partially support hcc compilation 4 years ago
  Megvii Engine Team 100a502764 fix(dnn): replace kernel launch syntax with macro for hcc 4 years ago
  Megvii Engine Team 07ab8cb6b6 feat(dnn): add param_pack for rocm 4 years ago
  Megvii Engine Team 4b2b623b8b fix(dnn/cuda): fix cutlass matmul splitk limit 4 years ago
  Megvii Engine Team ef9aa80074 fix(mgb/dnn): fix cuda naive matmul algo 4 years ago
  Megvii Engine Team 2d18074a70 fix(mgb): fix spell error 4 years ago
  Megvii Engine Team ff755451d2 refactor(mgb): move algo's name from info to desc and delete some algo's unnecessary param() method 4 years ago
  Megvii Engine Team 756c1eb7f2 fix(mgb/dnn): add cuda float naive matmul algo 4 years ago
  Megvii Engine Team 04b1a45af4 fix(dnn): fix cudnn crash when finalize called after cudnn dtor 4 years ago
  Megvii Engine Team c338e876ec refactor(mgb/dnn): add negative attribute for algo 4 years ago
  Megvii Engine Team ec1a99acc2 refactor(mgb/dnn): replace reproducible with attribute 4 years ago
  Megvii Engine Team 0d165399e6 fix(mgb): fix fastrun for imperative 4 years ago
  Megvii Engine Team 94401ce44a chore(dotprod): dotprod is enabled by default on the android platform 4 years ago
  Megvii Engine Team 85b41a90df feat(dnn): add checksum opr and test 4 years ago
  Megvii Engine Team a49f4a66b7 feat(dnn): add indexing_one_hot and indexing_set_one_hot opr 4 years ago
  Megvii Engine Team 2fd3fa8834 feat(cmake): update for enflame cmake compile 4 years ago
  Megvii Engine Team 9f2af2099c feat(mgb): add enflame comp node 4 years ago
  Megvii Engine Team 33da8de12b build(dnn/cuda): split compilation for cutlass wrapper 4 years ago
  Megvii Engine Team 420672beca fix(mgb/dnn): fix x86 matmul midout decl 4 years ago
  Megvii Engine Team b717606989 fix(dnn/cuda): add block size limit for culass gemm algo 4 years ago
  Megvii Engine Team 55974e8cf9 feat(log): opt log 4 years ago
  Megvii Engine Team 58c8746e30 fix(opr): fix fast-run error in cuda 4 years ago
  Megvii Engine Team ba2ad46e54 feat(gopt): add deconv nchw4 int8 opt pass, add deconv nchw int8 4 years ago
  Megvii Engine Team 5d350fc843 feat(dnn/cuda): add deconv int8 and fix cutlass conv wrapper base on modify cutlass 2.4 4 years ago
  Megvii Engine Team a3ea1f153c feat(mgb/opr): add fast profile and combined Execution strategy 4 years ago
  Megvii Engine Team c82d88751a fix(dnn/cuda): add cuda nchw int8 conv impl with nchw4 to fix cu111 compatibility 4 years ago
  Megvii Engine Team 652ec9f251 fix(mgb/dnn): fix backward computation of tqt 4 years ago
  Megvii Engine Team f2b42bf09e chore(dotprod): add arm dotprod attribute for easy use 4 years ago
  Megvii Engine Team c33a717314 feat(dnn): repalce is_reproducible with algo attribute in opencl, cpu, rocm and cuda 4 years ago
  Megvii Engine Team 2de2222e46 feat(dnn/cuda): add cutlass batched gemv kernel for matmul operator 4 years ago
  Megvii Engine Team 973d2a0ac2 feat(dnn/cuda): add cutlass matmul using split k parallel 4 years ago
  Megvii Engine Team 03c921f7c4 feat(dnn/cuda): add cutlass matmul impls 4 years ago
  Megvii Engine Team 5b62acfa01 feat(dnn/armv7): add new matmul strategy k8x8x4 4 years ago
  Megvii Engine Team 9cc732f82d fix(opencl): fix opencl search algo negative stride support 4 years ago
  Megvii Engine Team c69359d00d fix(dnn/cuda): disable cudnn conv_bias kernels for NCHW4_NCHW tensor format 4 years ago