MegEngine

Commit Graph

Author	SHA1	Message	Date
Megvii Engine Team	eca6e1d931	fix(ci): fixes for ci GitOrigin-RevId: `b0a432bd2e`	3 years ago
Megvii Engine Team	c14e5719f8	feat(mgb/gopt): add profile impl for global layout transform pass GitOrigin-RevId: `8ef62baf79`	3 years ago
Megvii Engine Team	8a3eb05a1b	refactor(mgb/gopt): refactor tensor reformat opt pass GitOrigin-RevId: `a1b1e89b76`	3 years ago
Megvii Engine Team	c33126ab5c	feat(mgb/gopt): add reformat manager GitOrigin-RevId: `b9791b131a`	3 years ago
Megvii Engine Team	4f28e14684	fix(dnn): fix compatibility broken of convolution format GitOrigin-RevId: `c17f02d03d`	3 years ago
Megvii Engine Team	8b40f57738	feat(mgb/dnn): add conv1x1 algo for matrix mul GitOrigin-RevId: `585b2c045a`	3 years ago
Megvii Engine Team	fb49a2834f	refactor(mgb/dnn): refactor enum used in serializing GitOrigin-RevId: `e57af4a59c`	3 years ago
Megvii Engine Team	d69b59035d	feat(dnn): add an get_all_algorithms_safe interface GitOrigin-RevId: `e3734e4531`	3 years ago
Megvii Engine Team	103d7f33ba	refactor(dnn/rocm): update hip license header GitOrigin-RevId: `79d684755d`	4 years ago
Megvii Engine Team	5aa52d3863	feat(dnn/rocm): add adaptive pooling opr GitOrigin-RevId: `e844b3e770`	3 years ago
Megvii Engine Team	83cf4ee64e	refactor(dnn/rocm): remove some useless includes GitOrigin-RevId: `3d2c315a36`	3 years ago
Megvii Engine Team	323a4642e6	feat(dnn/rocm): add topk opr GitOrigin-RevId: `5ecb079854`	3 years ago
Megvii Engine Team	f4784f4af1	feat(dnn/rocm): add argsort opr GitOrigin-RevId: `b4c3eb4707`	3 years ago
Megvii Engine Team	6082c353e7	feat(dnn/rocm): support bool in type_cvt and elemwise GitOrigin-RevId: `ad5ec7bc1c`	4 years ago
Megvii Engine Team	8b94f49328	fix(dnn/cuda): fix elemwise and relayout int4 bug when last shape is 1 GitOrigin-RevId: `e7d64c4987`	3 years ago
Megvii Engine Team	694aa1bd92	feat(dnn): add heuristic cache GitOrigin-RevId: `35e942b5e3`	3 years ago
Megvii Engine Team	bc9cfc277a	feat(mgb): add arm resize nchwxx and naive nearest interp GitOrigin-RevId: `d5fbd59a30`	3 years ago
Megvii Engine Team	722aecd437	feat(mgb): support fp16 nhwc backward GitOrigin-RevId: `954ac6405a`	3 years ago
Megvii Engine Team	0708bc780c	fix(dnn/cuda): disallow implicit dtype conversion in cublaslt matmul algos disable tensor op matmul kernels when input and output tensors are in f32 data type to avoid potential accuracy loss GitOrigin-RevId: `36859cba5a`	3 years ago
Megvii Engine Team	1e83ab638e	feat(dnn): add channelwise conv for fp16 nchw88 GitOrigin-RevId: `1bb64f82c5`	3 years ago
Megvii Engine Team	7b855dc64a	fix(dnn/cuda): fix compilation for windows bazel GitOrigin-RevId: `2023dea19c`	3 years ago
Megvii Engine Team	3abe0b2462	fix(mgb): fix rocm pooling GitOrigin-RevId: `44876d398e`	3 years ago
Megvii Engine Team	16678bb998	fix(dnn): fix_short_cutlass_name_gemm GitOrigin-RevId: `cc0a2db9da`	3 years ago
Megvii Engine Team	4c13bc7e1b	feat(dnn/cuda): add nhwc int8 deconv GitOrigin-RevId: `ad361a0f81`	3 years ago
Megvii Engine Team	11f022ff7c	feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt GitOrigin-RevId: `229e1eb4be`	3 years ago
Megvii Engine Team	a0231a7920	fix(dnn/cuda): fix algo matmul for conv bwd filter fix fastrun workspace size not available exception and device OOM error caused by the incorrect workspace size calculation of algo matmul of conv bwd filter GitOrigin-RevId: `de96b4fe11`	3 years ago
Megvii Engine Team	56c1b626bf	refactor(dnn): move arch-dependant code to arch.h GitOrigin-RevId: `52350144b1`	3 years ago
Megvii Engine Team	67575d582c	feat(mge/opr): add interpolate bilinear mode GitOrigin-RevId: `f7023a3fd3`	3 years ago
Megvii Engine Team	0558b2123d	feat(mge/opr): add interpolate nearest mode GitOrigin-RevId: `d384b87f50`	3 years ago
Megvii Engine Team	127870a926	feat(dnn/opencl): add heuristic rule for batched matmul GitOrigin-RevId: `bd152428e6`	3 years ago
Megvii Engine Team	c25125e3d2	perf(dnn/cuda): sass int8 epilogue remove shared load GitOrigin-RevId: `2b49f5069b`	3 years ago
Megvii Engine Team	55efc8e197	feat(mgb/gopt): add reformat emitter GitOrigin-RevId: `937b20a57c`	4 years ago
Megvii Engine Team	c9d060307f	feat(dnn/common): add named tensor shape GitOrigin-RevId: `918928b8ba`	4 years ago
Megvii Engine Team	ff0e6be7b9	fix(dnn/cuda): fix cutlass tensorop kernels do not compile cutlass tensorop kernels, when using cuda version less than 10.2 GitOrigin-RevId: `d4c37d5f41`	3 years ago
Megvii Engine Team	336761253d	feat(dnn/cuda): add tensorcore matmul for fp16 data type GitOrigin-RevId: `025c591f75`	3 years ago
Megvii Engine Team	2c4ee99227	fix(dnn): short cutlass filename in windows GitOrigin-RevId: `83a43fdf87`	3 years ago
Megvii Engine Team	432592374d	build(dnn/cuda): fix cmake compile dependency for cutlass kernels GitOrigin-RevId: `ebe71f5a12`	3 years ago
Megvii Engine Team	cc07b96f82	perf(dnn/relayout): disable copy_last_contiguous when contiguous_size is small GitOrigin-RevId: `7bd5e793a4`	3 years ago
Megvii Engine Team	d195fdec71	refactor(mgb): refactor has-usable-algo function for global optimizer GitOrigin-RevId: `6610516650`	3 years ago
Megvii Engine Team	604bb2a569	feat(mgb/dnn): add int atomic add for megdnn GitOrigin-RevId: `00d5d752d3`	3 years ago
Megvii Engine Team	eab6afab47	feat(mgb): add padding opr for megbrain GitOrigin-RevId: `490e0c5d5a`	4 years ago
Megvii Engine Team	66c18f6054	fix(ci): fix bazel compile error in new macos GitOrigin-RevId: `d22cc87fec`	3 years ago
Megvii Engine Team	c88a4e5b32	fix(mgb): fix get env macro GitOrigin-RevId: `79c602f79c`	3 years ago
Megvii Engine Team	9b4b910dc1	feat(dnn/cuda): integrate cutlass operation table and replace all cutlass wrappers GitOrigin-RevId: `2a70335441`	3 years ago
Megvii Engine Team	b18feaab33	feat(dnn/cuda): use cutlass remove shared load imma conv kernel GitOrigin-RevId: `0b5574f526`	4 years ago
Megvii Engine Team	1af350c6d2	feat(dnn): add fill kernel GitOrigin-RevId: `d2cee3a7a0`	3 years ago
Megvii Engine Team	3eb0505f9b	feat(imperative): add support for quantized conv transpose2d GitOrigin-RevId: `ffd6431299`	3 years ago
Megvii Engine Team	c68e669530	feat(bazel/windows/xp/sp2/inference): implement inference on windows xp (os vesion >= sp2) build with bazel * bazel build support(define __DEPLOY_ON_XP_SP2__ when deploy on xp sp2): (dbg)./bazel build //brain/megbrain:load_and_run --cpu='x86_windows_xp' --compiler='clang_cl' -c dbg --copt "-D__DEPLOY_ON_XP_SP2__=1" (opt)./bazel build //brain/megbrain:load_and_run --cpu='x86_windows_xp' --compiler='clang_cl' -c opt --copt "-D__DEPLOY_ON_XP_SP2__=1" * internal behavior: will define MGB_HAVE_THREAD=0 when enable __DEPLOY_ON_XP_SP2__ * refer to https://docs.microsoft.com/en-us/cpp/build/configuring-programs-for-windows-xp?view=msvc-160 xp sp2(x86) do not support vc runtime fully, casused by KERNEL32.dll do not implement some base apis for c++ std function, for example, std::mutex/std::thread/std::condition_variable as a workround, we will disable some MegEngine features on xp sp2 env, for exampe, multi-thread etc! * about DNN_MUTEX/MGB_MUTEX, if your code will build in inference code (even CPU backends), please replace std::mutex to DNN_MUTEX/MGB_MUTEX, * about multi-thread, if you code need multi-thread support, please enable it when MGB_HAVE_THREAD=1 * about test build env status 1: Visual Studio 2019(MSVC version <= 14.26.28801)---- pass 2: Visual Studio 2019(MSVC version > 14.26.28801) ---- failed caused by this 'new' version will put VCR depends on win7 KERNEL32.DLL, this may be fixed at Visual Studio 2019 later version but we do not test at this MR merge point 3: Visual Studio 2017 ---------- pass 4: Visual Studio 2014 ---------- pass GitOrigin-RevId: `65ac48b95e`	3 years ago
Megvii Engine Team	3b452d8c16	feat(mgb): cuda conv support nhwc format and fp16 dtype GitOrigin-RevId: `b8ddcd108a`	3 years ago
Megvii Engine Team	10bcf75767	feat(dnn/x86): add algo for x86 max pooling for Window size bigger than 10 and S1 under NCHW88 GitOrigin-RevId: `613a18dd91`	3 years ago

1 2 3 4 5 ...

497 Commits (eca6e1d9313803bf7f822722abf9ad7e092a5203)