Commit e66d328
CS-1187 add systemd and cgroups integration (#60)
* EH: CS-1188 control daemons with systemd
* avoid endless loop in case an invalid slice is given in the autoinstall template
// BelongsTo: CS-1188
* EH: CS-1192 at startup of daemons output the cgroups slice the service is running in
* fixed type "deamon"
* EH: CS-1223 with systemd integration, move sge_shepherd processes out of the sge_execd service cgroup
* sd_bus method StartTransientUnit does only start a job creating the unit
and returns before the action has actually finished.
Need to wait for the job to be finished.
// BelongsTo: CS-1123
* - do not report systemd as init system on ulx-* as we cannot build systemd support in sge_execd, libsystemd.so is too old
- fixed broken build on CentOS 8
* * sd_bus error was not reported to caller
* error messages were truncated at 100 characters, introduced SFN4 macro for 400 character strings
* fixed non-unique message ids
* EH: CS-1291 move shepherd child to its own scope
* shepherd tried to use systemd on host having systemd library but not having systemd as init system (Antix Linux)
* EH: CS-1292 get job online usage information via systemd
* tried to connect to systemd on host not having systemd
* errors in StartTransientUnit were not always propagated to caller
* EH: CS-1294 set job limits via systemd
* EH: CS-1315 set binding via systemd
* cleanup
* EH CS-1295 set device isolation via systemd
* EH: CS-1241 add profiling information for systemd operations
* - execd profiling could not be disabled again
- cleanup, moved code to own module
// BelongsTo: CS-1241
* EH: CS-1318 allow to run jobs under systemd control even if sge_execd itself is not started as systemd service
* EH: CS-1319 make running jobs under systemd control configurable
* added ENABLE_SYSTEMD to sge_conf.5 man page
// BelongsTo: CS-1319
* EH: CS-1322 the job specific scopes need to contain the toplevel slice name to be unique
* EH: CS-1300 do not add and handle the additional group id for jobs running under systemd
* BF: CS-1325 possible race condition between calling StartTransientUnit and waiting for the corresponding job to finish
* EH: CS-1296 kill jobs via systemd
* EH: CS-1321 allow to configure a hybrid usage data collection (both via systemd and the pdc)
* fixed memory leaks
* BF: CS-1335 need special handling for interrupted system call
* EH: CS-1342 add systemd specific settings (toplevel slice name) to the installation guide
* cleanup and added systemd integration to the release notes
* cleanup
* - addressed review comments
- fixed a race condition leading to multiple execd children trying to create the shepherds.scope
* added more details of the systemd integration to the release notes
* addressed review comments
* refactoring and documentation with Doxygen headers
* EH: CS-1408 USAGE_COLLECTION mode must be kept consistent for running jobs
* EH: CS-1419 disable systemd integration if sge_execd is started as non privileged user
* with HYBRID usage collection non systemd hosts didn't report cpu and rss
* reprioritization code was broken by systemd integration
// SeeAlso: CS-1421
* - improved diagnostics when ptf job / osjob cannot be found
- enforce cleanup in execd only when KEEP_ACTIVE is changed to FALSE
* BF: CS-1019 sge_execd logs errors when running tightly integrated parallel jobs
* BF: CS-1425 backup/restore does not handle $SGE_ROOT/$SGE_CELL/slice_name
* BF: CS-1429 sge_qmaster can segfault on qdel -f
* BF: CS-1019 sge_execd logs errors when running tightly integrated parallel jobs
* BF: CS-1430 running tightly integrated parallel jobs leaves systemd slices
// + additional cleanup
* fix to the fix for CS-1019
* added missing files
---------
Co-authored-by: Joachim Gabler <joga.oge@gabler-net.de>1 parent 1c0f383 commit e66d328
100 files changed
Lines changed: 4593 additions & 883 deletions
File tree
- cmake
- doc/markdown
- manual
- development-guide
- installation-guide
- platform-guide
- release-notes
- man/man5
- source
- common
- daemons
- common
- execd
- qmaster
- shepherd
- dist
- util
- install_modules
- rctemplates
- libs
- cull
- sgeobj
- cull
- json
- uti
- security/gss/doc
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
88 | 88 | | |
89 | 89 | | |
90 | 90 | | |
| 91 | + | |
91 | 92 | | |
92 | 93 | | |
93 | 94 | | |
| |||
188 | 189 | | |
189 | 190 | | |
190 | 191 | | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
191 | 196 | | |
192 | 197 | | |
193 | 198 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
61 | 61 | | |
62 | 62 | | |
63 | 63 | | |
64 | | - | |
65 | | - | |
66 | | - | |
67 | | - | |
68 | | - | |
69 | | - | |
70 | | - | |
71 | | - | |
72 | | - | |
73 | | - | |
74 | | - | |
75 | | - | |
76 | | - | |
77 | | - | |
78 | | - | |
79 | | - | |
80 | | - | |
81 | | - | |
82 | | - | |
| 64 | + | |
83 | 65 | | |
84 | 66 | | |
85 | 67 | | |
| |||
166 | 148 | | |
167 | 149 | | |
168 | 150 | | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
169 | 168 | | |
170 | 169 | | |
171 | 170 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1044 | 1044 | | |
1045 | 1045 | | |
1046 | 1046 | | |
1047 | | - | |
1048 | | - | |
| 1047 | + | |
| 1048 | + | |
1049 | 1049 | | |
1050 | 1050 | | |
1051 | 1051 | | |
| 1052 | + | |
| 1053 | + | |
| 1054 | + | |
| 1055 | + | |
| 1056 | + | |
| 1057 | + | |
| 1058 | + | |
| 1059 | + | |
1052 | 1060 | | |
1053 | 1061 | | |
1054 | 1062 | | |
| |||
1060 | 1068 | | |
1061 | 1069 | | |
1062 | 1070 | | |
| 1071 | + | |
| 1072 | + | |
| 1073 | + | |
| 1074 | + | |
| 1075 | + | |
| 1076 | + | |
| 1077 | + | |
| 1078 | + | |
| 1079 | + | |
1063 | 1080 | | |
1064 | 1081 | | |
1065 | 1082 | | |
| |||
1375 | 1392 | | |
1376 | 1393 | | |
1377 | 1394 | | |
| 1395 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
34 | 34 | | |
35 | 35 | | |
36 | 36 | | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
37 | 42 | | |
38 | 43 | | |
39 | | - | |
| 44 | + | |
40 | 45 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
267 | 267 | | |
268 | 268 | | |
269 | 269 | | |
270 | | - | |
| 270 | + | |
271 | 271 | | |
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
188 | 188 | | |
189 | 189 | | |
190 | 190 | | |
191 | | - | |
| 191 | + | |
192 | 192 | | |
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
24 | | - | |
| 24 | + | |
25 | 25 | | |
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
101 | 101 | | |
102 | 102 | | |
103 | 103 | | |
104 | | - | |
| 104 | + | |
105 | 105 | | |
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
296 | 296 | | |
297 | 297 | | |
298 | 298 | | |
299 | | - | |
| 299 | + | |
300 | 300 | | |
Lines changed: 3 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
280 | 280 | | |
281 | 281 | | |
282 | 282 | | |
283 | | - | |
| 283 | + | |
284 | 284 | | |
285 | 285 | | |
286 | 286 | | |
| |||
359 | 359 | | |
360 | 360 | | |
361 | 361 | | |
362 | | - | |
| 362 | + | |
| 363 | + | |
0 commit comments