Insights

CodeBase: Nagios Plugin > LSI Hardware RAID Controller

Category: Resources

A simple and efficient way of grabbing the smart health of a scalable amount of disks on an LSI Hardware RAID Controller, exiting to Nagios statuses.

We are providing the following code to be used as an information resource only. It is licensed under GNU GPL. Any use of, or development of the resources provided is strictly unsupported and VooServers Ltd will not be liable for any loss of or damage to business or personal assets.

#!/bin/bash
#
#Dave Byrne
#LSI Hardware RAID S.M.A.R.T Check
#

#Where is storcli?
storclibin="/opt/MegaRAID/storcli/storcli64"

#Check if storcli actually exists, echo and exit warning if it isnt
if ! [ -e "$storclibin" ]
then
  echo "WARNING - StorCli Not Found"
  exit 1
fi

#Get number of underlying physical disks from storcli
underlyingdisks=`$storclibin /c0 show|grep "Physical Drives"|awk '{ print $4}'`

#Loop to grab the smart health result from all physical disks
counter=0
while [ $counter -lt $underlyingdisks ]; do
  #echo counter at $counter
  smartctl -d sat+megaraid,$counter -a /dev/sda|grep "SMART overall-health self-assessment test result" >> /usr/results.txt
  let counter=counter+1
done

#Analyse results and start giving exit codes

if grep -q FAILED "/usr/results.txt";
then
    #failed string found, time to work out what disk it was
    line=`awk '/FAILED/{ print NR; exit }' /usr/results.txt`
    #subtract 1 from line number as disks start at 0
    let line=line-1
    diskinfo=`smartctl -d sat+megaraid,$line -a /dev/sda|grep -E 'Device Model|Serial Number'`
    printf "CRITICAL - Device ID #$line FAILURE ImminentnDISK INFO: n$diskinfo n"
    rm -f /usr/results.txt
    exit 2
else
    #failed string not found, echo and exit ok
    echo "OK - All array disks healthy"
    rm -f /usr/results.txt
    exit 0
fi

This plugin is also published in Nagios Exchange HERE

More articles

Empty modern server room with large servers, wires and buttons. Data storage, cloud storage, mining farm.

What’s the latest on Cloud Repatriation?

Cloud repatriation is something that has been gathering momentum in 2024, particularly with companies looking to take control of their tech and I.T. infrastructure costs. In addition, many companies are becoming more conscious of their cyber security and want to have greater control over their defenses.

Read the article
blog default img

Securing Your Data in 2024: The Ransomware Threat

Today’s vast digital landscapes put more and more data into positions that are vulnerable to ransomware attacks. The new flavour of choice for attackers sees your data rendered unusable, encrypted or locked away until a considerable ransom is paid. The backup, disaster recovery and data protection strategies of old might not be enough to keep you safe.

Read the article