Megvii Engine Team
eca6e1d931
fix(ci): fixes for ci
GitOrigin-RevId: b0a432bd2e
3 years ago
Megvii Engine Team
c14e5719f8
feat(mgb/gopt): add profile impl for global layout transform pass
GitOrigin-RevId: 8ef62baf79
3 years ago
Megvii Engine Team
8a3eb05a1b
refactor(mgb/gopt): refactor tensor reformat opt pass
GitOrigin-RevId: a1b1e89b76
3 years ago
Megvii Engine Team
c33126ab5c
feat(mgb/gopt): add reformat manager
GitOrigin-RevId: b9791b131a
3 years ago
Megvii Engine Team
4f28e14684
fix(dnn): fix compatibility broken of convolution format
GitOrigin-RevId: c17f02d03d
3 years ago
Megvii Engine Team
8b40f57738
feat(mgb/dnn): add conv1x1 algo for matrix mul
GitOrigin-RevId: 585b2c045a
3 years ago
Megvii Engine Team
fb49a2834f
refactor(mgb/dnn): refactor enum used in serializing
GitOrigin-RevId: e57af4a59c
3 years ago
Megvii Engine Team
d69b59035d
feat(dnn): add an get_all_algorithms_safe interface
GitOrigin-RevId: e3734e4531
3 years ago
Megvii Engine Team
103d7f33ba
refactor(dnn/rocm): update hip license header
GitOrigin-RevId: 79d684755d
4 years ago
Megvii Engine Team
5aa52d3863
feat(dnn/rocm): add adaptive pooling opr
GitOrigin-RevId: e844b3e770
3 years ago
Megvii Engine Team
83cf4ee64e
refactor(dnn/rocm): remove some useless includes
GitOrigin-RevId: 3d2c315a36
3 years ago
Megvii Engine Team
323a4642e6
feat(dnn/rocm): add topk opr
GitOrigin-RevId: 5ecb079854
3 years ago
Megvii Engine Team
f4784f4af1
feat(dnn/rocm): add argsort opr
GitOrigin-RevId: b4c3eb4707
3 years ago
Megvii Engine Team
6082c353e7
feat(dnn/rocm): support bool in type_cvt and elemwise
GitOrigin-RevId: ad5ec7bc1c
4 years ago
Megvii Engine Team
8b94f49328
fix(dnn/cuda): fix elemwise and relayout int4 bug when last shape is 1
GitOrigin-RevId: e7d64c4987
3 years ago
Megvii Engine Team
694aa1bd92
feat(dnn): add heuristic cache
GitOrigin-RevId: 35e942b5e3
3 years ago
Megvii Engine Team
bc9cfc277a
feat(mgb): add arm resize nchwxx and naive nearest interp
GitOrigin-RevId: d5fbd59a30
3 years ago
Megvii Engine Team
722aecd437
feat(mgb): support fp16 nhwc backward
GitOrigin-RevId: 954ac6405a
3 years ago
Megvii Engine Team
0708bc780c
fix(dnn/cuda): disallow implicit dtype conversion in cublaslt matmul algos
disable tensor op matmul kernels when input and output tensors are in f32 data type to avoid potential accuracy loss
GitOrigin-RevId: 36859cba5a
3 years ago
Megvii Engine Team
1e83ab638e
feat(dnn): add channelwise conv for fp16 nchw88
GitOrigin-RevId: 1bb64f82c5
3 years ago
Megvii Engine Team
7b855dc64a
fix(dnn/cuda): fix compilation for windows bazel
GitOrigin-RevId: 2023dea19c
3 years ago
Megvii Engine Team
3abe0b2462
fix(mgb): fix rocm pooling
GitOrigin-RevId: 44876d398e
3 years ago
Megvii Engine Team
16678bb998
fix(dnn): fix_short_cutlass_name_gemm
GitOrigin-RevId: cc0a2db9da
3 years ago
Megvii Engine Team
4c13bc7e1b
feat(dnn/cuda): add nhwc int8 deconv
GitOrigin-RevId: ad361a0f81
3 years ago
Megvii Engine Team
11f022ff7c
feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt
GitOrigin-RevId: 229e1eb4be
3 years ago
Megvii Engine Team
a0231a7920
fix(dnn/cuda): fix algo matmul for conv bwd filter
fix fastrun workspace size not available exception and device OOM error caused by the incorrect workspace size calculation of algo matmul of conv bwd filter
GitOrigin-RevId: de96b4fe11
3 years ago
Megvii Engine Team
56c1b626bf
refactor(dnn): move arch-dependant code to arch.h
GitOrigin-RevId: 52350144b1
3 years ago
Megvii Engine Team
67575d582c
feat(mge/opr): add interpolate bilinear mode
GitOrigin-RevId: f7023a3fd3
3 years ago
Megvii Engine Team
0558b2123d
feat(mge/opr): add interpolate nearest mode
GitOrigin-RevId: d384b87f50
3 years ago
Megvii Engine Team
127870a926
feat(dnn/opencl): add heuristic rule for batched matmul
GitOrigin-RevId: bd152428e6
3 years ago
Megvii Engine Team
c25125e3d2
perf(dnn/cuda): sass int8 epilogue remove shared load
GitOrigin-RevId: 2b49f5069b
3 years ago
Megvii Engine Team
55efc8e197
feat(mgb/gopt): add reformat emitter
GitOrigin-RevId: 937b20a57c
4 years ago
Megvii Engine Team
c9d060307f
feat(dnn/common): add named tensor shape
GitOrigin-RevId: 918928b8ba
4 years ago
Megvii Engine Team
ff0e6be7b9
fix(dnn/cuda): fix cutlass tensorop kernels
do not compile cutlass tensorop kernels, when using cuda version less than 10.2
GitOrigin-RevId: d4c37d5f41
3 years ago
Megvii Engine Team
336761253d
feat(dnn/cuda): add tensorcore matmul for fp16 data type
GitOrigin-RevId: 025c591f75
3 years ago
Megvii Engine Team
2c4ee99227
fix(dnn): short cutlass filename in windows
GitOrigin-RevId: 83a43fdf87
3 years ago
Megvii Engine Team
432592374d
build(dnn/cuda): fix cmake compile dependency for cutlass kernels
GitOrigin-RevId: ebe71f5a12
3 years ago
Megvii Engine Team
cc07b96f82
perf(dnn/relayout): disable copy_last_contiguous when contiguous_size is
small
GitOrigin-RevId: 7bd5e793a4
3 years ago
Megvii Engine Team
d195fdec71
refactor(mgb): refactor has-usable-algo function for global optimizer
GitOrigin-RevId: 6610516650
3 years ago
Megvii Engine Team
604bb2a569
feat(mgb/dnn): add int atomic add for megdnn
GitOrigin-RevId: 00d5d752d3
3 years ago
Megvii Engine Team
eab6afab47
feat(mgb): add padding opr for megbrain
GitOrigin-RevId: 490e0c5d5a
4 years ago
Megvii Engine Team
66c18f6054
fix(ci): fix bazel compile error in new macos
GitOrigin-RevId: d22cc87fec
3 years ago
Megvii Engine Team
c88a4e5b32
fix(mgb): fix get env macro
GitOrigin-RevId: 79c602f79c
3 years ago
Megvii Engine Team
9b4b910dc1
feat(dnn/cuda): integrate cutlass operation table and replace all cutlass wrappers
GitOrigin-RevId: 2a70335441
3 years ago
Megvii Engine Team
b18feaab33
feat(dnn/cuda): use cutlass remove shared load imma conv kernel
GitOrigin-RevId: 0b5574f526
4 years ago
Megvii Engine Team
1af350c6d2
feat(dnn): add fill kernel
GitOrigin-RevId: d2cee3a7a0
3 years ago
Megvii Engine Team
3eb0505f9b
feat(imperative): add support for quantized conv transpose2d
GitOrigin-RevId: ffd6431299
3 years ago
Megvii Engine Team
c68e669530
feat(bazel/windows/xp/sp2/inference): implement inference on windows xp
(os vesion >= sp2) build with bazel
* bazel build support(define __DEPLOY_ON_XP_SP2__ when deploy on xp sp2):
(dbg)./bazel build //brain/megbrain:load_and_run --cpu='x86_windows_xp'
--compiler='clang_cl' -c dbg --copt "-D__DEPLOY_ON_XP_SP2__=1"
(opt)./bazel build //brain/megbrain:load_and_run --cpu='x86_windows_xp'
--compiler='clang_cl' -c opt --copt "-D__DEPLOY_ON_XP_SP2__=1"
* internal behavior:
will define MGB_HAVE_THREAD=0 when enable __DEPLOY_ON_XP_SP2__
* refer to
https://docs.microsoft.com/en-us/cpp/build/configuring-programs-for-windows-xp?view=msvc-160
xp sp2(x86) do not support vc runtime fully, casused by KERNEL32.dll do not
implement some base apis for c++ std function, for example,
std::mutex/std::thread/std::condition_variable as a workround, we will
disable some MegEngine features on xp sp2 env, for exampe, multi-thread etc!
* about DNN_MUTEX/MGB_MUTEX, if your code will build in inference
code (even CPU backends), please replace std::mutex to DNN_MUTEX/MGB_MUTEX,
* about multi-thread, if you code need multi-thread support, please
enable it when MGB_HAVE_THREAD=1
* about test build env status
1: Visual Studio 2019(MSVC version <= 14.26.28801)---- pass
2: Visual Studio 2019(MSVC version > 14.26.28801) ---- failed
caused by this 'new' version will put VCR depends on win7
KERNEL32.DLL, this may be fixed at Visual Studio 2019 later version
but we do not test at this MR merge point
3: Visual Studio 2017 ---------- pass
4: Visual Studio 2014 ---------- pass
GitOrigin-RevId: 65ac48b95e
3 years ago
Megvii Engine Team
3b452d8c16
feat(mgb): cuda conv support nhwc format and fp16 dtype
GitOrigin-RevId: b8ddcd108a
3 years ago
Megvii Engine Team
10bcf75767
feat(dnn/x86): add algo for x86 max pooling for Window size bigger than 10 and S1 under NCHW88
GitOrigin-RevId: 613a18dd91
3 years ago