Bug 55045 - S4-BIND startup timeout with huge/many zones
Summary: S4-BIND startup timeout with huge/many zones
Status: NEW
Alias: None
Product: UCS
Classification: Unclassified
Component: DNS
Version: UCS 5.0
Hardware: Other Linux
: P5 normal
Target Milestone: ---
Assignee: UCS maintainers
QA Contact: UCS maintainers
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2022-07-29 15:23 CEST by Dirk Schnick
Modified: 2024-06-28 13:30 CEST (History)
3 users (show)

See Also:
What kind of report is it?: Bug Report
What type of bug is this?: 2: Improvement: Would be a product improvement
Who will be affected by this bug?: 2: Will only affect a few installed domains
How will those affected feel about the bug?: 2: A Pain – users won’t like this once they notice it
User Pain: 0.046
Enterprise Customer affected?: Yes
School Customer affected?: Yes
ISV affected?:
Waiting Support:
Flags outvoted (downgraded) after PO Review:
Ticket number:
Bug group (optional):
Customer ID: 00009
Max CVSS v3 score:


Attachments
bootup logfile of comment-2 (18.52 KB, text/plain)
2024-06-28 13:30 CEST, office
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Dirk Schnick univentionstaff 2022-07-29 15:23:51 CEST
On some servers of a customer the bind9 is not willing to start after the update to UCS5. After boot a manual start via systemctl start bind9.service is working.

If the timeout of /usr/lib/univention-bind/samba4 

wait-for-startup)
                /usr/bin/timeout 30 /bin/sh -c \
                        "until rndc -p 953 status | grep --quiet 'server is up and running'; do sleep 1; done"
                ;;

is increased from 30 to 60 seconds the bind service is also starting during boot.
We have a UCR for backend-ldap please introduce this timeout UCR also for samba backend.
Comment 1 Philipp Hahn univentionstaff 2022-10-13 12:00:49 CEST
See Bug #54108 comment 6:
- For "dns/backend=ldap" there is UCRV "dns/timeout-start"
Comment 2 office 2024-06-28 13:28:58 CEST
I'm also affected by this and it seems there are also some forum reports

* https://help.univention.com/t/bind9-service-dont-start-after-reboot/19748
* https://help.univention.com/t/bind9-fails-to-start-automatically/20758

It seems to be caused by some timeout on starting bind or race-condition between bind and samba4. My attached logs also indicate this.
I get this error when booting my VM-host and all guests firing up in parallel. It looks like the IO becomes to slow for the individual system as a consequence.
Comment 3 office 2024-06-28 13:30:13 CEST
Created attachment 11223 [details]
bootup logfile of comment-2