Megvii Engine Team
|
fea46ea9a4
|
perf(imperative): add opr cache for apply_on_physical_tensor
GitOrigin-RevId: fc5d5fb34d
|
4 years ago |
Megvii Engine Team
|
ea4e6ab93a
|
fix(mgb/opr): fix shape cache of NvOF
GitOrigin-RevId: 456ba478e9
|
4 years ago |
Megvii Engine Team
|
87de704a46
|
feat(gopt): fuse conv h_swish
GitOrigin-RevId: a3d12991fb
|
3 years ago |
Megvii Engine Team
|
3726f5cc92
|
feat(gopt): merger consecutive relayout and dimshuffle to one relayout to optimize CD4 performarce
GitOrigin-RevId: a058776be3
|
3 years ago |
Megvii Engine Team
|
1fead9b6b0
|
feat(gopt): merge consecutive dimshuffle and relayout to one relayout to optimize CD4 performace
GitOrigin-RevId: 16f22baa80
|
3 years ago |
Megvii Engine Team
|
26d1e4f7ed
|
feat(gopt): optimize cd4 pass rule for elemwise and typecvt to let cd4 start as soon as possible
GitOrigin-RevId: 6580dedca7
|
3 years ago |
Megvii Engine Team
|
5f4501e0f3
|
fix(gopt): fix conv bias fuse 2 noline
GitOrigin-RevId: a6ab9f4e5e
|
3 years ago |
Megvii Engine Team
|
7d2063e35a
|
perf(cuda): speedup conv backward data with small feature map and large filter size
GitOrigin-RevId: 85592bca6b
|
3 years ago |
Megvii Engine Team
|
28d48f2f7a
|
fix(mgb/src): fix megbrain cmake unsupport android_nn
GitOrigin-RevId: 037c197912
|
3 years ago |
Megvii Engine Team
|
187c1dc081
|
fix(jit): copy aux var when shallow copying JITExecutor
GitOrigin-RevId: 3b331e1c17
|
3 years ago |
Megvii Engine Team
|
b6ce02a152
|
fix(subgraph): fallback back to cg if jit unsupported
GitOrigin-RevId: 853a00a402
|
3 years ago |
Megvii Engine Team
|
c55fda9a7c
|
fix(fastrun): don't kill profiling worker
GitOrigin-RevId: 99a0f11a5a
|
3 years ago |
Megvii Engine Team
|
aa587446fc
|
feat(subgraph): support shape inference for CompiledOp
GitOrigin-RevId: a96b8f3446
|
3 years ago |
Megvii Engine Team
|
bdb853ee6f
|
fix(mgb): fix extra device malloc when load MultipleDeviceTensorWithFormatHolder
GitOrigin-RevId: adf4a7f77a
|
3 years ago |
Megvii Engine Team
|
e2b79ea00e
|
feat(mgb): reduce the number of trtruntimeopr create contexts
GitOrigin-RevId: 14e5d1769e
|
3 years ago |
Megvii Engine Team
|
95ac055538
|
feat(dnn,mgb,imperative): add diag opr implement
GitOrigin-RevId: 43016ffa2b
|
3 years ago |
Megvii Engine Team
|
cbbca5fb10
|
feat(mge): add softmax op use cudnn api
GitOrigin-RevId: 7734ebf8c4
|
3 years ago |
Megvii Engine Team
|
20b42a8c3b
|
fix(dnn): add naive lstm kernel
GitOrigin-RevId: f08ef810cf
|
3 years ago |
Megvii Engine Team
|
2faa6ea5a9
|
Merge pull request #213 from kxz18:rnn
GitOrigin-RevId: 9e9215c115
|
3 years ago |
Megvii Engine Team
|
85ea882cb5
|
fix(mgb/ops): immutable tensor support empty storage
GitOrigin-RevId: 2851498fce
|
3 years ago |
Megvii Engine Team
|
4b0ecb5deb
|
fix(ops/recv): use std::vector to store shape to support scalar
GitOrigin-RevId: e1dac3c919
|
3 years ago |
Megvii Engine Team
|
f4f20046c4
|
fix(mgb): fix tensorrt runtimeopr get output var shape bug
GitOrigin-RevId: b830706a89
|
3 years ago |
Megvii Engine Team
|
1999307015
|
feat(mgb/opr): add dropout kernel
GitOrigin-RevId: d248bd2005
|
3 years ago |
Megvii Engine Team
|
a93741815b
|
feat(mgb/opr): add layernorm forward and backward kernel
GitOrigin-RevId: 0cd484e753
|
3 years ago |
Megvii Engine Team
|
1657b8e881
|
fix(fastrun): fix persistent_cache in redis
GitOrigin-RevId: ada5862b05
|
3 years ago |
Megvii Engine Team
|
a404cd7d06
|
fix(mgb/src): add tensorRT version check
GitOrigin-RevId: 7abfd30cab
|
3 years ago |
Megvii Engine Team
|
c53cad2049
|
feat(cmake): format all cmake file
GitOrigin-RevId: 0a4ecab99b
|
3 years ago |
Megvii Engine Team
|
6011f51001
|
style(all): fix clang-format for MGB_DEFINE inside another macro
GitOrigin-RevId: 8c2b6a2aed
|
3 years ago |
Megvii Engine Team
|
7231257efc
|
fix(imperative/fastrun): fix worksapce limit for cpu compnode
GitOrigin-RevId: 4583ce6d4b
|
3 years ago |
Megvii Engine Team
|
a72e0cb568
|
feat(imperative,src): add jit builder for custom op
GitOrigin-RevId: 3bb0b46311
|
3 years ago |
Megvii Engine Team
|
93310c0e4b
|
fix(mgb/gopt): fix cpu global layout transform fastrun error
GitOrigin-RevId: ea254297e5
|
3 years ago |
Megvii Engine Team
|
8624ec224b
|
fix(mgb): fix param merge bug that caused the weight statistics error
GitOrigin-RevId: f76a096832
|
3 years ago |
Megvii Engine Team
|
46d4bd8a59
|
feat(windows): let sdk do not care about more macro on win
GitOrigin-RevId: c522c2fd63
|
3 years ago |
Megvii Engine Team
|
202b407149
|
fix(core): fix output var replaced by optpass
GitOrigin-RevId: aea62de345
|
3 years ago |
Megvii Engine Team
|
e715423f20
|
feat(src/gopt): add optpass on arm for fusing typecvt and elemwise to elemwise multi type
GitOrigin-RevId: e6bcbbf91b
|
3 years ago |
Megvii Engine Team
|
d9a46ea47b
|
fix(dnn): correct behaviour of floor div for int tensor
GitOrigin-RevId: 1444f69cce
|
3 years ago |
Megvii Engine Team
|
cf1db2616e
|
fix(fastrun): replace py_redis with cpp_redis to avoid deadlock
GitOrigin-RevId: 9af7fa5c97
|
3 years ago |
Megvii Engine Team
|
390d2bb545
|
feat(mgb): tensorrt runtime opr support mutiple profiles
GitOrigin-RevId: 1157d34e4d
|
3 years ago |
Megvii Engine Team
|
1708ab2ec6
|
feat(mgb): add tensorrt runtime dynamic batch testcase
GitOrigin-RevId: 36372437ff
|
3 years ago |
Megvii Engine Team
|
87c845fd61
|
feat(mgb): tensorrt runtime opr support dynamic batch trt model
GitOrigin-RevId: 7461de704e
|
3 years ago |
Megvii Engine Team
|
ce119ef5a5
|
fix(lite): fix lite error when record level is 2
GitOrigin-RevId: 7dabfd8876
|
3 years ago |
Megvii Engine Team
|
0ad5eeaedd
|
feat(mgb/gopt): global layout transform support opencl
GitOrigin-RevId: 132605c7d9
|
3 years ago |
kxz@thumt102-1
|
8f48da7ffe
|
feat(mgb/opr): add cell level rnn/lstm and sequence level rnn/lstm
|
3 years ago |
Megvii Engine Team
|
2881934cb8
|
feat(dnn/check_non_finite): addmul scale to check_non_finite opr
GitOrigin-RevId: c35a219e52
|
3 years ago |
Megvii Engine Team
|
b8ccc6a211
|
fix(mgb): fix loss execution policy after opr shallow copy
GitOrigin-RevId: 4738136e4a
|
3 years ago |
Megvii Engine Team
|
c27f678230
|
feat(src/opr): add api of training of cpp and related test
GitOrigin-RevId: befb85fd43
|
3 years ago |
Megvii Engine Team
|
6bb5409976
|
feat(dnn/src): add images2neibs kernel of opencl and related test
GitOrigin-RevId: 82242b7437
|
3 years ago |
Megvii Engine Team
|
501eadc1db
|
fix(mgb): fix copybara mc20
GitOrigin-RevId: 2b491e2278
|
3 years ago |
Megvii Engine Team
|
2b8e7940b6
|
fix(lite/cambricon): fix cambricon models which have multiple comp node
GitOrigin-RevId: 624fd7f0ce
|
3 years ago |
Megvii Engine Team
|
cfad9a5df3
|
fix(mgb/cambricon): fix magicmind runtime opr when set workspace point second time
GitOrigin-RevId: 1ac9d0eaba
|
3 years ago |