Univention Bugzilla – Bug 29463
Nagios-Monitoring DRS-Replikation
Last modified: 2016-10-04 17:05:28 CEST
Eine Störung der DRS-Replikation kann zu zahlreichen Problemen führen, u.a. kein ein längere Zeit "abgekoppelter" DC alte Änderungen replizieren und damit aktuellere Änderungen rückgängig machen. Gut wäre ein Nagios-Monitoring der DRS-Replikation, z.B. durch Auswerten der Ausgabe von "samba-tool drs showrepl" (oder besser durch direktes Abfragen von samba4?). Typische Fehlersituationen: - nicht erreichbare Hosts (z.B. durch Firewall-Änderungen etc.) - Keberos-Störungen (im einfachsten Fall durch falsche Uhrzeit)
Erneut nachgefragt an Ticket#2013031321000951
A customer requested a nagios-check for drs-replication in a phone-call
See also Ticket #2016082321000589.
* univention-nagios-samba new package, installs the nagios plugin check_univention_samba_drs_failures, creates the nagios service UNIVENTION_SAMBA_DRS_FAILURES and adds the host to this service the plugins returns: NAGIOS_STATE_OK - if samba4/autostart is not true NAGIOS_STATE_CRITICAL - if there are "consecutive_sync_failures" NAGIOS_STATE_CRITICAL - for every other error NAGIOS_STATE_OK - if everything is fine the service config is: normalCheckInterval=10 retryCheckInterval=1 maxCheckAttempts=10 in case of a failure, nagios rechecks the plugin for max 10 times (maxCheckAttempts) with a interval of one minute (retryCheckInterval) before sending an email so, drs replication can be broken for ca. 10 minutes before nagios triggers the alarm, is that OK? => merged to 4.2.0 * univention-samba4 added univention-nagios-samba to recommended packages => merged to 4.2-0 * univention-dvd added univention-nagios-samba to task-ucs413 (4.1-3) added univention-nagios-samba to task-ucs413, task-ucs420 (4.2-0) * trigger list added univention-nagios-samba to ucs_4.1-0-ucs4.1-3.txt univention-samba4.yaml 5.0.1-40.671.201608251703 univention-nagios-samba 1.0.0-1.1.201608251657
It is announced as unmaintained: /var/univention/buildsystem2/mirror/testing/4.1/unmaintained/component/4.1-3-errata-test/amd64/univention-nagios-samba_1.0.0-1.2.201608261028_amd64.deb /var/univention/buildsystem2/mirror/testing/4.1/unmaintained/component/4.1-3-errata-test/i386/univention-nagios-samba_1.0.0-1.2.201608261028_i386.deb
added univention-nagios-samba to ucs_4.1-3_i386.maintained ucs_4.1-3_amd64.maintained -> apt-cache policy univention-nagios-samba univention-nagios-samba: Installiert: (keine) Installationskandidat: 1.0.0-1.2.201608261028 Versionstabelle: 1.0.0-1.2.201608261028 0 500 http://updates-test.software-univention.de/4.1/maintained/component/ 4.1-3-errata-test/amd64/ Packages
OK, it is now activated after my upgrade. One issue, you named the nagios check UNIVENTION_SAMBA_DRS_FAILURES. That is different from all other names and looks wrong in the nagios overview. Can you rename it to UNIVENTION_SAMBA_DRS or UNIVENTION_SAMBA_REPLICATION?
(In reply to Stefan Gohmann from comment #7) > OK, it is now activated after my upgrade. One issue, you named the nagios > check UNIVENTION_SAMBA_DRS_FAILURES. That is different from all other names > and looks wrong in the nagios overview. Can you rename it to > UNIVENTION_SAMBA_DRS or UNIVENTION_SAMBA_REPLICATION? OK, renamed the nagios service to UNIVENTION_SAMBA_REPLICATION (updated univention-nagios-samba in errata-4.1-3, 4.2-0, updated yaml)
Code review: OK ucs-test: OK Tests: OK YAML: OK
<http://errata.software-univention.de/ucs/4.1/241.html> <http://errata.software-univention.de/ucs/4.1/257.html>