228 Commits (1999307015a5035d3d3f0c34bfcf5cb3d7bfac02)

Author SHA1 Message Date
  Megvii Engine Team 1999307015 feat(mgb/opr): add dropout kernel 3 years ago
  Megvii Engine Team a93741815b feat(mgb/opr): add layernorm forward and backward kernel 3 years ago
  Megvii Engine Team a404cd7d06 fix(mgb/src): add tensorRT version check 3 years ago
  Megvii Engine Team 2881934cb8 feat(dnn/check_non_finite): addmul scale to check_non_finite opr 3 years ago
  Megvii Engine Team 0d16952470 fix(mgb/cuda): fix conv error when the input tensor is too large 3 years ago
  Megvii Engine Team 2696e4efaa feat(dnn): add float16 for remap backward 3 years ago
  Megvii Engine Team 1f0cc891b0 feat(dnn): enable eye to support bool 3 years ago
  Megvii Engine Team 11d75fecb5 feat(dnn/check_non_finite): add batch check_non_finite 3 years ago
  Megvii Engine Team 2d54ad185b feat(lite): add global layout transform interface for load and run 3 years ago
  Megvii Engine Team ba2f0c2e48 fix(dnn/cuda): fix cudnn_conv algo of conv_bias opr for fp16 add z cases 3 years ago
  Megvii Engine Team c85631aa77 feat(dnn): use ref ptr interface for all backends 3 years ago
  Megvii Engine Team 89186edc5d fix(dnn): correct reduce/argmxx/fakequant calculation with nan 3 years ago
  Megvii Engine Team 68cdabd288 feat(opr): indexing_multi_axis_vec support nd index 3 years ago
  Megvii Engine Team 9b4cd92ba3 fix(mgb/dnn): fix cudnnConvBiasActivation crash on nchw32 int8 with oc > 256 3 years ago
  Megvii Engine Team 849f0ece9d fix(dnn): drop batched matmul cublas algo when batch is 1 3 years ago
  Megvii Engine Team 5af52746f7 fix(mgb): fix bug caused by conv filter size is too big 3 years ago
  Megvii Engine Team 10af44abba fix(dnn/cuda): fix cudnn conv impl for nchw4_nchw hybrid layout 3 years ago
  Megvii Engine Team 369c2ccc5a style(all): reformat c++ code 3 years ago
  Megvii Engine Team bfb30dcb81 chore(format): fix compile bugs after code format 3 years ago
  Megvii Engine Team eeccf2bc0d ci(check): add clang-format in check stage 3 years ago
  Megvii Engine Team 177dec94c5 feat(mgb/opr): add bgr2gray mode for cvtcolor opr 3 years ago
  Megvii Engine Team f5cb21ed3a fix(mgb/opr): add non finite check 3 years ago
  Megvii Engine Team fca195351c feat(gopt): add nhwc fuse conv typecvt optpass 3 years ago
  Megvii Engine Team 2fc7358517 Revert "feat(dnn/apicache): add generic apicache" 3 years ago
  Megvii Engine Team de363c04af Revert "perf(cuda/conv): cache serval cudnn api" 3 years ago
  Megvii Engine Team 729ee64988 Revert "fix(api_cache): lock api cache for thread safety" 3 years ago
  Megvii Engine Team 64c922c4bb Revert "fix(api_cache): fix serialization for conv_desc" 3 years ago
  Megvii Engine Team b3e54eade1 feat(dnn/bn): use new cudnn BN kernel to support NHWC 3 years ago
  Megvii Engine Team 3977b7aa0b feat(mgb/shuffle): add shuffle opr 3 years ago
  Megvii Engine Team eca6e1d931 fix(ci): fixes for ci 3 years ago
  Megvii Engine Team 8b40f57738 feat(mgb/dnn): add conv1x1 algo for matrix mul 3 years ago
  Megvii Engine Team d69b59035d feat(dnn): add an get_all_algorithms_safe interface 3 years ago
  Megvii Engine Team 8b94f49328 fix(dnn/cuda): fix elemwise and relayout int4 bug when last shape is 1 3 years ago
  Megvii Engine Team 694aa1bd92 feat(dnn): add heuristic cache 3 years ago
  Megvii Engine Team 722aecd437 feat(mgb): support fp16 nhwc backward 3 years ago
  Megvii Engine Team 0708bc780c fix(dnn/cuda): disallow implicit dtype conversion in cublaslt matmul algos 3 years ago
  Megvii Engine Team 7b855dc64a fix(dnn/cuda): fix compilation for windows bazel 3 years ago
  Megvii Engine Team 4c13bc7e1b feat(dnn/cuda): add nhwc int8 deconv 3 years ago
  Megvii Engine Team 11f022ff7c feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt 3 years ago
  Megvii Engine Team a0231a7920 fix(dnn/cuda): fix algo matmul for conv bwd filter 3 years ago
  Megvii Engine Team 67575d582c feat(mge/opr): add interpolate bilinear mode 3 years ago
  Megvii Engine Team 0558b2123d feat(mge/opr): add interpolate nearest mode 3 years ago
  Megvii Engine Team ff0e6be7b9 fix(dnn/cuda): fix cutlass tensorop kernels 3 years ago
  Megvii Engine Team 336761253d feat(dnn/cuda): add tensorcore matmul for fp16 data type 3 years ago
  Megvii Engine Team cc07b96f82 perf(dnn/relayout): disable copy_last_contiguous when contiguous_size is 3 years ago
  Megvii Engine Team d195fdec71 refactor(mgb): refactor has-usable-algo function for global optimizer 3 years ago
  Megvii Engine Team 604bb2a569 feat(mgb/dnn): add int atomic add for megdnn 3 years ago
  Megvii Engine Team eab6afab47 feat(mgb): add padding opr for megbrain 4 years ago
  Megvii Engine Team 9b4b910dc1 feat(dnn/cuda): integrate cutlass operation table and replace all cutlass wrappers 3 years ago
  Megvii Engine Team b18feaab33 feat(dnn/cuda): use cutlass remove shared load imma conv kernel 4 years ago