• Introducing WebPros Cloud - a fully managed infrastructure platform purpose-built to simplify the deployment of WebPros products !  WebPros Cloud enables you to easily deliver WebPros solutions — without the complexity of managing the infrastructure.
    Join the pilot program today!
  • Support for BIND DNS has been removed from Plesk for Windows due to security and maintenance risks.
    If a Plesk for Windows server is still using BIND, the upgrade to Plesk Obsidian 18.0.70 will be unavailable until the administrator switches the DNS server to Microsoft DNS.

Issue Alma 8/9 increasing number of server crashes for apparently "no reason"

Bitpalast

Plesk addicted!
Plesk Guru
We are seeing an increasing number of random server crashes on Alma 8 and 9 systems since about April 2025. It seems that the load on servers strongly increases within seconds so that the system becomes unresponsive. However, there is no traffic increase seen, no suspicious log entries, just nothing that points to a cause in websites, database or PHP usage. We suspect that the wave of updates that were delivered during the past months by Alma could have something to do with it, but it's just a guess, because that's the only thing that has changed.

My question here is whether others on Alma 8 or 9 have experienced similar symptoms lately.
 
Peter, FWIW, I cannot see any unusual activity in the support queue reporting abnormalities with the server load and random crashes.
 
Yes, we are seeing the same since upgrading a cpanel server to Alma last month.
Prior to that no crashes in months, now at least once a week it just stops responding.
Can’t log into whm, nothing. We have to do a reset in vcenter.
 
Hello Peter, maybe a Problem on the Infrastructure Level? We had such situations already at HE and H…
I think that on one server we might have caught a real hardware error with the PCIe interface on the RAID controller. But: We still saw this on two other machines lately. On one of them it might have been caused by too many processes, but we'll still need to see what will happen on that machine during the next few weeks. We're doing snapshots of the process list there every five seconds, so hopefully we'll gain some insights if the issue appears again. However, on one other machine, RAID controller is totally different and operating system is different, too, plus there are no signs for any extensive use by customers. It still feels unsafe, but with nothing logged it'll be awfully hard to find out more.
 
Back
Top