Investigating issue with Lightning server

Updates

Resolved
February 18, 2024 at 4:01 AM
Resolved
February 18, 2024 at 4:01 AM
We are extremely happy to announce that the server is back online in full capacity. With the collaboration of our staff and the Cloudlinux staff, we were able to recover the boot partition and bring the server back online.
We found that this corruption of the boot partition happened before the hardware migration took affect. So we are hoping the new hardware will resolve these issue with the random reboots on this server. We also have the Cloudlinux staff reviewing a kernel dump from another server that has a similar issue with the random reboots. Once they analyze the kdump from that server, we should get more information on what is causing the random reboots. Although from what we seen on this server, it might be unrelated and most likely was a failing hardware component.
We thank everyone for their patience during all this. This was certainly one of our worst and challenging outages since we have been in business. We really felt it was gonna lead to a server reinstall, but our team and the cloudlinux team really pulled together to pull off a miracle to rebuild and restore the boot partition.
Update
February 17, 2024 at 11:46 PM
Update
February 17, 2024 at 11:46 PM
We have discovered that after the crash of the server, one of the partitions got corrupted. We are working to repair the partition and then go from there. Most likely if we are unable to repair the partition, we will have to do a full reinstall and restore backups.
We anticipate that neither process is gonna be quick and you should prepare that this will be a lengthy outage, but do not panic as we have full backups of all accounts from today and they will be restored if we need to reinstall the server.
Update
February 17, 2024 at 10:18 PM
Update
February 17, 2024 at 10:18 PM
We got notice that the drives were migrated too the new hardware, but now we are facing issues getting the server to boot the os. We are working on this along with the datacenter to get the server back online.

In a absolute worst case scenario, we do have up to date backups if it would come to that, but we are not at that stage yet.
Update
February 17, 2024 at 9:43 PM
Update
February 17, 2024 at 9:43 PM
We are still waiting on confirmation of the hardware swap completion. We will update as soon as we hear anything.
Update
February 17, 2024 at 8:01 PM
Update
February 17, 2024 at 8:01 PM
We are going to proceed with the hardware replacement to hopefully prevent further outages. You can expect around 2 hours downtime for this to take place. We will update further once the server is back online.
Update
February 17, 2024 at 7:13 PM
Update
February 17, 2024 at 7:13 PM
We are in talks will replacing the hardware on the server and moving the hard drives to the new server. Once we come to a conclusion to move forward, we will update this incident.
Identified
February 17, 2024 at 6:51 PM
Identified
February 17, 2024 at 6:51 PM
The Lightning server seems to have crashed again. We have the datacenter looking into it and investigating the cause of the crash. Updates will follow when more information comes in.
Resolved
February 14, 2024 at 1:51 AM
Resolved
February 14, 2024 at 1:51 AM
We are still working with Cloudlinux to troubleshoot these issues, but for now we are going to close this post to unclutter our client portal. We will still post updates as more information comes in or if there are any further issues.
Update
February 13, 2024 at 9:04 PM
Update
February 13, 2024 at 9:04 PM
We have brought in Cloudlinux technicians to take a further look at this. There has been similar issues on other servers since installing Cloudlinux and we are doing everything possible to get to the root of the problem.
Monitoring
February 13, 2024 at 7:30 PM
Monitoring
February 13, 2024 at 7:30 PM
The Lightning server is now back online. The datacenter did not detect any hardware issues, so we are going to dig through the server logs and see what is causing this. We will be monitoring the server very closely for any further disruptions.
Update
February 13, 2024 at 7:06 PM
Update
February 13, 2024 at 7:06 PM
The datacenter has reported that they are investigating the issue. We will update when we get more details.
Update
February 13, 2024 at 6:04 PM
Update
February 13, 2024 at 6:04 PM
We are still not sure of the issue, but the server is not coming up after a hardware reset. So we have contacted the datacenter to investigate further. We suspect it may be a failed hardware component, but we will wait for them to confirm.
Investigating
February 13, 2024 at 5:49 PM
Investigating
February 13, 2024 at 5:49 PM
We are aware of an issue with the Lightning server. It did several reboots right after another and then has not come back up. We are investigating this and will update when we have more information.

All systems operational

About This Site

MonsterMegs - Investigating issue with Lightning server – Incident details

All systems operational

About This Site

Investigating issue with Lightning server