Going Live with a Linux Server

Based on past mistakes by myself and others, here is a check-list before putting a Linux (or other Unix) server online:

  1. Run memtest86+ (or an equivalent program for other architectures) before going live, ideally run it before installing the OS. Run it again every time you upgrade the RAM.
  2. Reboot the machine after every significant change. EG if you install a new daemon then reboot it to make sure that the daemon starts correctly. It’s better to have 5 minutes of down-time for a scheduled reboot than a few hours of down-time after something goes wrong at 2AM.
  3. Make sure that every account that is used for cron jobs has it’s email directed somewhere that a human will see it. Make sure that root has it’s mail sent somewhere useful even if you don’t plan to have any root cron jobs.
  4. Make sure that ntpd is running and has at least two servers to look at. If you have a big site then run two NTP servers yourself and have each of them look to two servers in the outside world or one server and a GPS.
  5. Make sure that you have some sort of daily cron job doing basic log analysis. The Red Hat logwatch program is quite effective, then you need to have some way of making sure that you notice if an email stops being sent (getting 11 instead of 12 messages from logwatch in the morning won’t be noticed by most people).
  6. Make sure that when (not if) a hard drive in your RAID array dies then you will notice it.

Any suggestions on other things I can add?

3 comments to Going Live with a Linux Server

  • ZOG

    yes the memtest thing has bitten me a few times..
    also maybe (some of these are more basic security things):
    run nmap from a remote location to make sure no un-needed services are running.
    lock down ssh via sshd_config (no root logins, etc)
    if possible check switch port duplex settings are all matching what the eth interface thinks it should be, and after blasting with traffic no errors on ethernet interface or packet loss to nearby host.

  • Jon Holdsworth

    Keep a separate copy of /boot on a separate disk, and keep it up to date with the _exact_ same contents as the real /boot. Have a rescue disk handy with a grub installer on it and PRACTICE using it before you install anything.

  • Jamie

    May I make a suggestion? Using only two NTP servers is not that ideal. If one is wrong, which one do you believe? Ideally there should be one NTP server which is considered the “source of truth”, or at least three NTP servers so there is a tie-breaker in the event of a conflict.

Leave a Reply

You can use these HTML tags

<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>