MonsterMegs - Notice history

All systems operational

About This Site

Welcome to MonsterMegs' status page. If you want to keep on top of any disruptions in service to our website, control panel, or hosting platform, this is the page to check. We report minor/major outages and scheduled maintenance to this status page. However, issues affecting a small number of customers may be reported directly to affected customers in MyMonsterMegs (https://my.monstermegs.com). If you're experiencing an issue you do not see reported on this page, please log into MyMonsterMegs to view any alerts our team has added to your account.

100% - uptime

Website - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024

Customer Portal - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024
100% - uptime

Thunder - Operational

100% - uptime
Apr 2024 · 99.09%May · 100.0%Jun · 99.69%
Apr 2024
May 2024
Jun 2024

Hurricane - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024

Storm - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024

Lightning - Operational

100% - uptime
Apr 2024 · 99.83%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024
100% - uptime

DNS-1 - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024

DNS-2 - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024

DNS-3 - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024

DNS-4 - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024
100% - uptime

US Backup Storage Daily - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024

US Backup Storage Weekly - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024

EU Backup Storage Daily - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024

EU Backup Storage Weekly - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024
100% - uptime

Zabbix Monitoring Server US - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024

Zabbix Monitoring Server EU - Operational

100% - uptime
Apr 2024 · 100.0%May · 100.0%Jun · 100.0%
Apr 2024
May 2024
Jun 2024

Third Party: Cloudflare → Cloudflare Sites and Services → CDN/Cache - Operational

Notice history

Jun 2024

Investigating issue with Thunder server
  • Resolved
    Resolved

    The motherboard and memory have been replaced. The server is now back online. With all new hardware except for the hard drives. This should resolve the reboot issues. So we are gonna consider this closed.

  • Identified
    Identified

    There have been a few random reboots, but not quite as frequently as they were. Since this is still ongoing, the datacenter is going to replace the memory and motherboard, that we have replaced all components with new ones.

    The server is going offline now and is expected to be down for about 30-60 minutes. We will update when the server is back online.

  • Monitoring
    Monitoring

    We are monitoring the server for random reboots to track down the cause. There may be short interruptions though out the day as we attempt to capture logs on the random reboots.

  • Resolved
    Resolved

    The server seems to be stable at this time. We will reopen this incident if there are any further issues.

  • Monitoring
    Monitoring

    Upon a kernel crash, the server did not come up properly. We initiated a manual reboot and now the server is back online. We will monitor closely and update if there are any further issues.

  • Investigating
    Investigating

    We are currently investigating an issue with the server Thunder. Our engineers have been alerted and further details will be provided if necessary.

Investigating issue with Thunder server
  • Resolved
    Resolved

    We are gonna close this incident for now. Meanwhile we will monitor the server closely and if any further issues arrive, we will once again reopen it with additional updates.

  • Monitoring
    Monitoring

    The cpu has been replaced and we will continue to monitor closely to watch for any further reboots.

  • Identified
    Identified

    Well the server was running fine for about 1 1/2 hours with no reboots and then the issued returned. We are gonna try to swap out the cpu and hopefully this resolves the problem. If that does not work, the last resort would be to change out the chassis completely.

    The server will be going down in about 5 minutes to replace the cpu.

  • Resolved
    Resolved

    It appears the PSU replacement did the trick and the server is no longer rebooting every few minutes. We will continue to monitor closely over the next few hours and will reopen this incident if the issue arises again.

  • Monitoring
    Monitoring

    The Power supply has been replaced and the server is back online. We are going to monitor closely for any further repeated reboots over the next couple hours.

  • Identified
    Update

    In talks with the datacenter, it appears that the power supply may have become faulty. They are going to take the server down and replace the power supply.

  • Identified
    Identified

    We are working out an issue with repeated reboots on the server. We will update when we have more information.

  • Investigating
    Investigating

    We are currently investigating an issue with the server Thunder. Our engineers have been alerted and further details will be provided if necessary.

May 2024

Reboot and Failed Raid Drive - Storm
  • Resolved
    Resolved

    The firmware upgrades have been completed and everything is back online. We found there was an issue with earlier versions of the Samsung 990 Pro's that were allowing the disk to fail much earlier than its lifespan. We applied the latest firmware that addresses that issue as well as both of the drives in the server as of now, are the latest production version of the hard drives.

    So with that said, we do not anticipate any further hard drive failures.

  • Identified
    Update

    The server has been back online for about 30 minutes now. We are waiting for the raid to finish rebuilding and then we will proceed with the firmware updates.

  • Identified
    Update

    The server has just went down for the hard drive replacement. Please anticipate several shorter downtimes over the next couple hours as we apply these firmware updates in rescue mode.

  • Identified
    Update

    We have checked the server in recue mode and the drive did not show. We have gotten the server back online with the single drive. So the datacenter will be replacing this drive very shortly. Once the server is back online, we will performing firmware updated so all internal server components to hope this rectifies the hard drives failing or bricking themselves before their life span.

  • Identified
    Identified

    We are going to perform an emergency reboot to check a reported failed hard drive in our raid setup. While doing this, we will most likely be doing firmware updates on the motherboard and hard drives to try and resolve these hard drives that keep getting reported as failed.

    We will update as we determine the course of action.

Apr 2024

Emergency Downtime - Storm
  • Completed
    April 18, 2024 at 10:46 AM
    Completed
    April 18, 2024 at 10:46 AM

    Not sure why our last update did not post, but the hard drive replacement has been completed. We also modified boot records across all drives to fix issues with recent boot records becoming unavailable. This should lead to a more stable environment as compared to the last couple months.

  • Update
    April 18, 2024 at 12:48 AM
    In progress
    April 18, 2024 at 12:48 AM

    The server is back online temporarily. We first had to reinstall boot records to the single drive. Now that we know it boots, we are gonna contact the datacenter to replace the failed hard drive. After they replace the drive, the server should boot normally and we will readd the drive to the raid array. Depending on how fast the datacenter can install the new drive will determine when the maintenance window will close. Until they can install the hard drive, the server will be online.

    The reinstall of the boot records tooo longer than expected due to the server moving the boot folder and modifying the boot entries to the folders new location, which caused havoc on regenerating the boot records due files listing the wrong folder locations. This is not something that usually happens, so it took awhile to track down.

    We would anticipate the datacenter will replace the drive within the next hour or 2.

  • Update
    April 18, 2024 at 12:17 AM
    In progress
    April 18, 2024 at 12:17 AM

    The maintenance is still in progress, but it is taking longer than we expected. We still hope to have the server back up by the close of the maintenance window. If there will be any further delays, we will update accordingly.

  • Update
    April 17, 2024 at 11:00 PM
    In progress
    April 17, 2024 at 11:00 PM
    Maintenance is now in progress
  • In progress
    April 17, 2024 at 11:00 PM
    In progress
    April 17, 2024 at 11:00 PM
    Maintenance is now in progress
  • Planned
    April 17, 2024 at 11:00 PM
    Planned
    April 17, 2024 at 11:00 PM

    We are scheduling an emergency downtime window of 2 hours, starting at 6pm CST and lasting until 8pm CST. during this emergency downtime, we will be taking the server offline to investigate a failed/offline hard drive in our raid array and also will be doing some maintenance to resolve some of the crashes and reboot issues we have suffered in the past.

    While we are scheduling a 2 hour window, we believe this will be resolved within 30-40 minutes or so. We understand this is short notice, but we do not want to take the risk of the other drive failing and being forced to reinstall and restore backups.

Investigating issue with Thunder server
  • Resolved
    Resolved

    We are closing this incident for now. We are still working with the datacenter for a fix. This seems to be related to Almalinux and the way it handles the grub and boot partitions when the disk drives are in a raid format. We are exploring options to bypass this can copy the EFI boot partitions across all drives.

  • Monitoring
    Update

    We found that the server for some reason just dropped all hard drives from the server. Upon doing a full server reset, the drives then showed back up. We are having the datacenter looking for a resolution so this does not happen again. Hopefully it can be a firmware update of some sort and not actually the motherboard itself. Either way, we will update as we gather more information on a resolution to this.

  • Monitoring
    Monitoring

    The server is back online, but we still need to investigate the reason for the crash and boot into bios. We will update this when we have more info.

  • Investigating
    Update

    We have found that the server crashed and rebooted to the bios screen. The datacenter is currently looking into this to see if it is a hardware issue.

  • Investigating
    Investigating

    We are currently investigating an issue with the server Thunder. Our engineers have been alerted and further details will be provided if necessary.

Unable to access Cpanels on all servers
  • Resolved
    Resolved

    We just got ahold of cpanel and the licenses have been restored.

  • Identified
    Identified

    We just came to notice that all servers are not able to access cpanel and WHM. After further investigation we found that all our cpanel licenses are listed as suspended. This appears to be a billing error on Cpanel's side and we are already in contact with them.

    They recently migrated to a new billing system and it appears to be an error as outlined in this forums post on the cpanel's website.

    https://support.cpanel.net/hc/en-us/community/posts/22571293184023

Apr 2024 to Jun 2024

Next