318 Commits (a3ea1f153c8e7f057147ad4d7270f48a51717d28)

Author SHA1 Message Date
  Megvii Engine Team a3ea1f153c feat(mgb/opr): add fast profile and combined Execution strategy 4 years ago
  Megvii Engine Team c82d88751a fix(dnn/cuda): add cuda nchw int8 conv impl with nchw4 to fix cu111 compatibility 4 years ago
  Megvii Engine Team 652ec9f251 fix(mgb/dnn): fix backward computation of tqt 4 years ago
  Megvii Engine Team f2b42bf09e chore(dotprod): add arm dotprod attribute for easy use 4 years ago
  Megvii Engine Team c33a717314 feat(dnn): repalce is_reproducible with algo attribute in opencl, cpu, rocm and cuda 4 years ago
  Megvii Engine Team 97beae2fd8 fix(megdnn): fix megdnn benchmark testcase 4 years ago
  Megvii Engine Team 2de2222e46 feat(dnn/cuda): add cutlass batched gemv kernel for matmul operator 4 years ago
  Megvii Engine Team 973d2a0ac2 feat(dnn/cuda): add cutlass matmul using split k parallel 4 years ago
  Megvii Engine Team 03c921f7c4 feat(dnn/cuda): add cutlass matmul impls 4 years ago
  Megvii Engine Team 5b62acfa01 feat(dnn/armv7): add new matmul strategy k8x8x4 4 years ago
  Megvii Engine Team ad87f78a14 chore(imperative): refine tblgen for generating op name 4 years ago
  Megvii Engine Team 9cc732f82d fix(opencl): fix opencl search algo negative stride support 4 years ago
  Megvii Engine Team cf27dd642c fix(cuda): use cudnn8.0.4 as cu111 default libs 4 years ago
  Megvii Engine Team 649e4dd750 test(cuda): fix test for cu111 4 years ago
  Megvii Engine Team c69359d00d fix(dnn/cuda): disable cudnn conv_bias kernels for NCHW4_NCHW tensor format 4 years ago
  Megvii Engine Team 2e4b9a42f7 fix(mgb/gopt): fix folding conv dimshuffle opt pass 4 years ago
  Megvii Engine Team 0e3a6329ff build(cuda): support cu111 build 4 years ago
  Megvii Engine Team e9db061e45 fix(mgb): fix compiling error for cuda-11.1 4 years ago
  Megvii Engine Team cd7090acbb fix(opencl): enable image on mali(cl2.1) 4 years ago
  Megvii Engine Team c51a687cef chore(mge): update copyright years 4 years ago
  Megvii Engine Team af42ce7e69 fix(megdnn): some fixes of execution policy 4 years ago
  Megvii Engine Team 7afa422df4 refactor(megdnn): refactor sub opr setter 4 years ago
  Megvii Engine Team 821656aa4b refactor(megdnn): refactor brute force algo in batched matmul 4 years ago
  Megvii Engine Team 08ff62deb6 refactor(megdnn): refactor batched matmul algo in conv bias 4 years ago
  Megvii Engine Team 8773926ef8 refactor(megdnn): refactor matmul algo in conv bias 4 years ago
  Megvii Engine Team e4b71bdf64 refactor(megdnn): remove unnessary 1x1 algo 4 years ago
  Megvii Engine Team 44c8d2d16f refactor(megdnn): refactor matmul algo in deformable conv 4 years ago
  Megvii Engine Team b04ad06f84 refactor(megdnn): refactor matmul algo in conv backward filter 4 years ago
  Megvii Engine Team 25089e520e refactor(megdnn): refactor matmul algo in conv backward data 4 years ago
  Megvii Engine Team 0d720653ac refactor(megdnn): add default algo for convolution forward 4 years ago
  Megvii Engine Team 659217acd2 refactor(megdnn): refactor bfloat16 convbias to recursive inteface 4 years ago
  Megvii Engine Team 4a1d52c9c6 refactor(megdnn): refactor bfloat16 matmul to recursive inteface 4 years ago
  Megvii Engine Team b8febaf91f refactor(megdnn): refactor bfloat16 convolutionbackwardfilter to recursive inteface 4 years ago
  Megvii Engine Team f14e0c17e7 feat(mgb): add recursive for fastrun and megdnn test 4 years ago
  Megvii Engine Team 85fa988348 refactor(dnn): add get_algorithm_from_desc interface 4 years ago
  Megvii Engine Team 2b8150ab52 fix(dnn): fix bazel build issue for cambricon platform 4 years ago
  Megvii Engine Team 0e8b81c20e fix(dnn/opencl): fix elemwise negative stride support 4 years ago
  Megvii Engine Team 329306b031 fix(cmake/cuda): fix build at cuda `copy` env caused by b278a69e1 4 years ago
  Megvii Engine Team 364afec033 chore(mge): update copyright years 4 years ago
  Megvii Engine Team ae8b38f634 fix(cmake/whl): reduce wheel size 4 years ago
  Megvii Engine Team 3bda334798 fix(dnn/fallback): fix segmentfault caused by im2col/conv1x1 using 4 years ago
  Megvii Engine Team 87ff58f7fc fix(megdnn): add algo for matmul/batchedmatrixmul of naive and opencl 4 years ago
  Megvii Engine Team a3caa5d3b7 fix(mgb(dnn)): fix convbias cudnnConvBiasActivation 4 years ago
  Megvii Engine Team 55042195d4 chore(winograd): add Convolutionv2 param 4 years ago
  Megvii Engine Team 409a877267 feat(dnn): add algo interface for rocm&fallback matmul and batched matrix mul 4 years ago
  Megvii Engine Team 8f7f52ae4d feat(jit): add memfwd in jit executor opr 4 years ago
  Megvii Engine Team dfb2b2ce49 fix(dnn): change pooling window size smaller than padding constraint to log_error 4 years ago
  Megvii Engine Team d1fbec4fe2 feat(dnn/atlas): add atlas stub 4 years ago
  Megvii Engine Team a85531dd0f feat(mgb/opr): add tqt opr 4 years ago
  Megvii Engine Team c3a4b2225d feat(dnn/cuda): add cutlass impls for fused convolution reformat operation 4 years ago