Bug 49269 - Nagios is not always recognizing a drs repl failure
Nagios is not always recognizing a drs repl failure
Status: NEW
Product: UCS
Classification: Unclassified
Component: Monitoring (Prometheus or Nagios)
UCS 4.4
Other Linux
: P5 normal (vote)
: ---
Assigned To: UCS maintainers
UCS maintainers
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2019-04-09 16:09 CEST by Nico Stöckigt
Modified: 2021-08-30 17:53 CEST (History)
0 users

See Also:
What kind of report is it?: Bug Report
What type of bug is this?: 5: Major Usability: Impairs usability in key scenarios
Who will be affected by this bug?: 4: Will affect most installed domains
How will those affected feel about the bug?: 2: A Pain – users won’t like this once they notice it
User Pain: 0.229
Enterprise Customer affected?: Yes
School Customer affected?:
ISV affected?:
Waiting Support:
Flags outvoted (downgraded) after PO Review:
Ticket number: 2019040821000798
Bug group (optional):
Max CVSS v3 score:


Attachments
nagios.log (3.54 KB, text/x-log)
2019-04-09 16:09 CEST, Nico Stöckigt
Details
drs_showrepl.log (3.02 KB, text/x-log)
2019-04-09 16:10 CEST, Nico Stöckigt
Details
log.samba (2.75 MB, text/plain)
2019-04-09 16:13 CEST, Nico Stöckigt
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Nico Stöckigt univentionstaff 2019-04-09 16:09:24 CEST
Created attachment 9960 [details]
nagios.log

In a customers environment UCS 4.4-0 e33 the DRS replication was stalled due to an error in re-indexing a restored object. The 'samba-tool drs showrepl' clearly displayed an error while the nagios check stayed completely quite.

============================================================
root@SLAVE:~# samba-tool drs replicate -d1 $(hostname) "$(ucr get ldap/master)" "DC=domain,DC=tld" --full-sync --local
Partition[DC=domain,DC=tld] objects[403] linked_values[0]
Partition[DC=domain,DC=tld] objects[805] linked_values[0]
Partition[DC=domain,DC=tld] objects[1207] linked_values[0]
../../ldb_key_value/ldb_kv_index.c:2338: unique index violation on objectSid in CN=laptop001,OU=Win10,OU=PC,OU=Company Computer,DC=domain,DC=tld, conficts with objectGUID 3f4932c4-e313-436e-a790-5b7a2cee1383 in @INDEX:OBJECTSID::AQUAAAAAAAUVAAAA/7qOp66+13vfhnwLpwgAAA==
Failed to apply records: Failed to locally apply remote add of CN=laptop001,OU=Win10,OU=PC,OU=Company Computer,DC=domain,DC=tld: ../../ldb_key_value/ldb_kv_index.c:2506: Failed to re-index objectSid in CN=laptop001,OU=Win10,OU=PC,OU=Company Computer,DC=domain,DC=tld - ../../ldb_key_value/ldb_kv_index.c:2351: unique index violation on objectSid in CN=laptop001,OU=Win10,OU=PC,OU=Company Computer,DC=domain,DC=tld: Operations error
Failed to commit objects: WERR_GEN_FAILURE
ERROR(runtime): Error replicating DN DC=domain,DC=tld - (31, "Failed to process 'chunk' of DRS replicated objects: WERR_GEN_FAILURE")
Comment 1 Nico Stöckigt univentionstaff 2019-04-09 16:10:21 CEST
Created attachment 9961 [details]
drs_showrepl.log
Comment 2 Nico Stöckigt univentionstaff 2019-04-09 16:13:25 CEST
Created attachment 9962 [details]
log.samba