279 Commits (ae8b38f634434504f79fc05cca7917062e871a44)

Author SHA1 Message Date
  Megvii Engine Team ae8b38f634 fix(cmake/whl): reduce wheel size 4 years ago
  Megvii Engine Team 3bda334798 fix(dnn/fallback): fix segmentfault caused by im2col/conv1x1 using 4 years ago
  Megvii Engine Team 87ff58f7fc fix(megdnn): add algo for matmul/batchedmatrixmul of naive and opencl 4 years ago
  Megvii Engine Team a3caa5d3b7 fix(mgb(dnn)): fix convbias cudnnConvBiasActivation 4 years ago
  Megvii Engine Team 55042195d4 chore(winograd): add Convolutionv2 param 4 years ago
  Megvii Engine Team 409a877267 feat(dnn): add algo interface for rocm&fallback matmul and batched matrix mul 4 years ago
  Megvii Engine Team 8f7f52ae4d feat(jit): add memfwd in jit executor opr 4 years ago
  Megvii Engine Team dfb2b2ce49 fix(dnn): change pooling window size smaller than padding constraint to log_error 4 years ago
  Megvii Engine Team d1fbec4fe2 feat(dnn/atlas): add atlas stub 4 years ago
  Megvii Engine Team a85531dd0f feat(mgb/opr): add tqt opr 4 years ago
  Megvii Engine Team c3a4b2225d feat(dnn/cuda): add cutlass impls for fused convolution reformat operation 4 years ago
  Megvii Engine Team 5f44203d7b feat(dnn/cuda): add a cutlass impl for fusing convolution and dimshuffle 4 years ago
  Megvii Engine Team 61f917fb8e feat(dnn/cuda): add impl for fusing warp perspective and dimshuffle 4 years ago
  Megvii Engine Team eb826422c4 fix(dnn): forbid pooling window size smaller than padding 4 years ago
  Megvii Engine Team fc0fcd2f7f chore(winograd): remove winograd transform code 4 years ago
  Megvii Engine Team d1adc9a22f fix(dnn): fix opencl algo search 4 years ago
  Megvii Engine Team 7e2b2dbffc fix(dnn/test): delete large size in ARM_COMMON.FP32_GEVM 4 years ago
  Megvii Engine Team 69e3e32240 feat(imperative): auto generated opdef header and python binding 4 years ago
  Megvii Engine Team 0398a7867f fix(build/windows/cuda/llvm): fix windows bazel build with cuda 4 years ago
  Megvii Engine Team 3bf73ff16f feat(dnn): add cuda preprocess fusion 4 years ago
  Megvii Engine Team 86cf7490ec feat(dnn/aarch64): add quantizeds4 matmul int4x4x16_k8x8x8 4 years ago
  Megvii Engine Team 142f31a875 perf(dnn/cuda): change conv_bias heu, prefer dnn chanwise impl, dislike dnn batch gemm conv1x1 4 years ago
  Megvii Engine Team f214e14695 refactor(mgb/cuda): use single implementation of get_device_prop from utils 4 years ago
  Megvii Engine Team 54e79dd1d9 perf(mgb/cuda): do not call cudaGetDeviceProperties to avoid io traffic 4 years ago
  Megvii Engine Team 98a74e4a7b refactor(dnn): refactor opr proxy in test 4 years ago
  Megvii Engine Team 7066ad5ba6 feat(dnn): add uint16 support 4 years ago
  Megvii Engine Team a1877ee0fa refactor(dnn): refactor algo interface, use algoinfo instead of global algorithm 4 years ago
  Megvii Engine Team 6f5d0febf1 perf(dnn/cuda): enhance performance for pooling forward 4 years ago
  Megvii Engine Team 0560a218af chore(dnn/test): refactor megdnn arm_common test 4 years ago
  Megvii Engine Team 6856ce9ce2 feat(dnn): support conv bias activation for nchw4 input tensor format and nchw output tensor format 4 years ago
  Megvii Engine Team 60c6d59fc9 feat(mbg/core): support bias preprocess in conv_bias 4 years ago
  Megvii Engine Team ff8ef9eda7 docs(dnn): add comments of weight prerpocess interface 4 years ago
  Megvii Engine Team 1f75c7ade4 ci(midout): fix midout and reopen midout test 4 years ago
  Megvii Engine Team 1e71e0afe0 refactor(dnn): refactor deconv algo 4 years ago
  Megvii Engine Team 89ad33aeb3 feat(dnn/cuda): support weight preprocessing for cutlass algorithms 4 years ago
  Megvii Engine Team c03249c059 feat(dnn/opr): add megdnn fake quant opr 4 years ago
  Megvii Engine Team 739f927c4c feat(dnn/cuda): opt dp4a conv for small channel base on cutlass 4 years ago
  Megvii Engine Team 1f8e40753f fix(mkl): fix windows mkl LOG compute exception 4 years ago
  Megvii Engine Team 4aa277a203 refactor(dnn/cuda): misc 4 years ago
  Megvii Engine Team f7b2bdae1a refactor(dnn): refactor algorithm type interface 4 years ago
  Megvii Engine Team 18ec5341f2 refactor(dnn): remove unused costmodel in cuda 4 years ago
  Megvii Engine Team e39f938662 refactor(dnn): remove ProfileCache and matmul algo in x86 4 years ago
  Megvii Engine Team 89303cd829 feat(megdnn/rocm): add bn for rocm backend 4 years ago
  Megvii Engine Team aea829c9fa feat(megdnn/rocm): add average inclusive mode for pooling 4 years ago
  Megvii Engine Team 1217801133 perf(mge): add opdef for broadcast 4 years ago
  Megvii Engine Team 2a3f4d099a refactor(dnn/arm): refactor CPU heuristic algo selection 4 years ago
  Megvii Engine Team ba66e1d039 feat(dnn): add nchw_fp32 nchw44_qint8 cuda dct 4 years ago
  Megvii Engine Team a9f98e9c66 refactor(meg/internal): move interal codes back to megbrain 4 years ago
  Megvii Engine Team 44b27f0d6e build(3516): fix some cpu flags build failed and fix 3516 ycm 4 years ago
  Megvii Engine Team 8764a6c8ff feat(dnn/cuda): add volta dp4a int8 sass kernel 4 years ago