692 Commits (release-1.10)

Author SHA1 Message Date
  Megvii Engine Team 8b40f57738 feat(mgb/dnn): add conv1x1 algo for matrix mul 3 years ago
  Megvii Engine Team fb49a2834f refactor(mgb/dnn): refactor enum used in serializing 3 years ago
  Megvii Engine Team d69b59035d feat(dnn): add an get_all_algorithms_safe interface 3 years ago
  Megvii Engine Team 103d7f33ba refactor(dnn/rocm): update hip license header 4 years ago
  Megvii Engine Team 5aa52d3863 feat(dnn/rocm): add adaptive pooling opr 3 years ago
  Megvii Engine Team 83cf4ee64e refactor(dnn/rocm): remove some useless includes 3 years ago
  Megvii Engine Team 323a4642e6 feat(dnn/rocm): add topk opr 3 years ago
  Megvii Engine Team f4784f4af1 feat(dnn/rocm): add argsort opr 3 years ago
  Megvii Engine Team 6082c353e7 feat(dnn/rocm): support bool in type_cvt and elemwise 4 years ago
  Megvii Engine Team 8b94f49328 fix(dnn/cuda): fix elemwise and relayout int4 bug when last shape is 1 3 years ago
  Megvii Engine Team 694aa1bd92 feat(dnn): add heuristic cache 3 years ago
  Megvii Engine Team bc9cfc277a feat(mgb): add arm resize nchwxx and naive nearest interp 3 years ago
  Megvii Engine Team 722aecd437 feat(mgb): support fp16 nhwc backward 3 years ago
  Megvii Engine Team 0708bc780c fix(dnn/cuda): disallow implicit dtype conversion in cublaslt matmul algos 3 years ago
  Megvii Engine Team 1e83ab638e feat(dnn): add channelwise conv for fp16 nchw88 3 years ago
  Megvii Engine Team 7b855dc64a fix(dnn/cuda): fix compilation for windows bazel 3 years ago
  Megvii Engine Team 3abe0b2462 fix(mgb): fix rocm pooling 3 years ago
  Megvii Engine Team 16678bb998 fix(dnn): fix_short_cutlass_name_gemm 3 years ago
  Megvii Engine Team 4c13bc7e1b feat(dnn/cuda): add nhwc int8 deconv 3 years ago
  Megvii Engine Team 11f022ff7c feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt 3 years ago
  Megvii Engine Team a0231a7920 fix(dnn/cuda): fix algo matmul for conv bwd filter 3 years ago
  Megvii Engine Team 56c1b626bf refactor(dnn): move arch-dependant code to arch.h 3 years ago
  Megvii Engine Team 67575d582c feat(mge/opr): add interpolate bilinear mode 3 years ago
  Megvii Engine Team 0558b2123d feat(mge/opr): add interpolate nearest mode 3 years ago
  Megvii Engine Team 127870a926 feat(dnn/opencl): add heuristic rule for batched matmul 3 years ago
  Megvii Engine Team c25125e3d2 perf(dnn/cuda): sass int8 epilogue remove shared load 3 years ago
  Megvii Engine Team 55efc8e197 feat(mgb/gopt): add reformat emitter 4 years ago
  Megvii Engine Team c9d060307f feat(dnn/common): add named tensor shape 4 years ago
  Megvii Engine Team ff0e6be7b9 fix(dnn/cuda): fix cutlass tensorop kernels 3 years ago
  Megvii Engine Team 336761253d feat(dnn/cuda): add tensorcore matmul for fp16 data type 3 years ago
  Megvii Engine Team 2c4ee99227 fix(dnn): short cutlass filename in windows 3 years ago
  Megvii Engine Team 432592374d build(dnn/cuda): fix cmake compile dependency for cutlass kernels 3 years ago
  Megvii Engine Team cc07b96f82 perf(dnn/relayout): disable copy_last_contiguous when contiguous_size is 3 years ago
  Megvii Engine Team d195fdec71 refactor(mgb): refactor has-usable-algo function for global optimizer 3 years ago
  Megvii Engine Team 604bb2a569 feat(mgb/dnn): add int atomic add for megdnn 3 years ago
  Megvii Engine Team eab6afab47 feat(mgb): add padding opr for megbrain 4 years ago
  Megvii Engine Team 66c18f6054 fix(ci): fix bazel compile error in new macos 3 years ago
  Megvii Engine Team c88a4e5b32 fix(mgb): fix get env macro 3 years ago
  Megvii Engine Team 9b4b910dc1 feat(dnn/cuda): integrate cutlass operation table and replace all cutlass wrappers 3 years ago
  Megvii Engine Team b18feaab33 feat(dnn/cuda): use cutlass remove shared load imma conv kernel 4 years ago
  Megvii Engine Team 1af350c6d2 feat(dnn): add fill kernel 3 years ago
  Megvii Engine Team 3eb0505f9b feat(imperative): add support for quantized conv transpose2d 3 years ago
  Megvii Engine Team c68e669530 feat(bazel/windows/xp/sp2/inference): implement inference on windows xp 3 years ago
  Megvii Engine Team 3b452d8c16 feat(mgb): cuda conv support nhwc format and fp16 dtype 3 years ago
  Megvii Engine Team 10bcf75767 feat(dnn/x86): add algo for x86 max pooling for Window size bigger than 10 and S1 under NCHW88 3 years ago
  Megvii Engine Team ddba5c9674 fix(core): fix nr_threads is zero 3 years ago
  Megvii Engine Team 67f117882b perf(arm_common): add elemwise unary multithread support 3 years ago
  Megvii Engine Team 3afa3893d7 perf(arm_common): optimize arm common pooling 9x9 and 13x13 3 years ago
  Megvii Engine Team 2c4ff5431b fix(mgb): fix cudnn ConvolutionBackwardData 3 years ago
  Megvii Engine Team 287cab49c2 fix(mgb/sereg): fix rng operator compatibility 3 years ago