RAID 제어기 명령

Adapatec® CLI를 사용하여 RAID 컨트롤러 명령을 실행할 수 있습니다. 다음 명령은 사용할 수 있는 가장 일반적인 RAID 제어기 명령입니다.

/usr/Adaptec_Event_Monitor/arcconf getstatus 1

_GETSTATUS_는 작업 유형, 논리 드라이브 번호, 논리 드라이브 크기 및 작업 진행률을 나열합니다. 다음 항목과 같이 실행 중인 백그라운드 명령의 상태도 볼 수 있습니다.

최신 다시 빌드
동기화
논리 드라이브 마이그레이션
압축/확장

/usr/Adaptec_Event_Monitor/arcconf getconfig 1

_GETCONFIG_는 제어기, 논리 드라이브 및 물리 드라이브에 대한 정보를 나열합니다. 다음 항목과 같은 정보가 표시됩니다.

제어기 유형
BIOS, 부트 블록, 디바이스 드라이버 및 펌웨어 버전
물리적 디바이스 유형, 디바이스 ID, PFA 유무
물리적 디바이스 상태
격납장치 정보: 팬, 전원 공급 장치 및 온도

/usr/Adaptec_Event_Monitor/arcconf getlogs 1 device tabular

_GETLOGS_은(는) 제어기의 상태 및 이벤트 로그에 대한 액세스를 제공합니다. _DEVICE xxx_은(는) 제어기가 발견하는 모든 디바이스 오류에 대한 로그를 표시합니다.

GETLOGS 명령을 사용하여 작성되는 출력에 대해서는 다음 예를 참조하십시오.

driveErrorEntry
smartError.. ............................ false
vendorID ................................ WDC
serialNumber ............................ WD-XXX
wwn ..................................... xxxxxxxxxxxxxxxx - CC_FILTER
deviceID ................................ 10
productID ............................... WD1003FB
numParityErrors ......................... 0
linkFailures ............................ 0
hwErrors ................................ 0
abortedCmds ............................. 7
mediumErrors ............................ 20
smartWarning ............................ 0

/opt/MegaRAID/storcli/storcli64 /c0/eall/sall show all | grep -iE "det|cou|tem|SN|S.M|fir”

이 명령을 사용하여 특정 드라이브와 가능한 드라이브 오류를 보여줍니다. 다음 예제에서는 출력을 보여줍니다.

Drive /c0/e252/s0 - Detailed Information:
Shield Counter = 0
Media Error Count = 0
Other Error Count = 0
Drive Temperature = 24C (75.20 F)
Predictive Failure Count = 0
S.M.A.R.T alert flagged by drive = No
SN = XXXX
Firmware Revision = SN04

Drive /c0/e252/s1 - Detailed Information:
Shield Counter = 0
Media Error Count = 0
Other Error Count = 0
Drive Temperature = 22C (71.60 F)
Predictive Failure Count = 0
S.M.A.R.T alert flagged by drive = No
SN = xxxx
Firmware Revision = SN03

Drive /c0/e252/s2 - Detailed Information:
Shield Counter = 0
Media Error Count = 0
Other Error Count = 0
Drive Temperature = 21C (69.80 F)
Predictive Failure Count = 0
S.M.A.R.T alert flagged by drive = No
SN = xxxx
Firmware Revision = SN04

Drive /c0/e252/s3 - Detailed Information:
Shield Counter = 0
Media Error Count = 0
Other Error Count =
Drive Temperature = 23C (73.40 F)
Predictive Failure Count = 0
S.M.A.R.T alert flagged by drive = No
SN = xxxx
Firmware Revision = SN03

/opt/MegaRAID/storcli/storcli64 /c0/eall/sall show rebuild

이 명령은 모든 드라이브의 다시 빌드 상태 및 다시 빌드 완료에 대한 예상 시간을 표시합니다. 명령을 실행하면 다음 출력이 표시됩니다.

---------------------------------------------
Drive-ID Progress% Status Estimated Time Left
---------------------------------------------
/c0/e252/s0 - Not in progress
/c0/e252/s1 - Not in progress
/c0/e252/s2 - Not in progress
/c0/e252/s3 - Not in progress
---------------------------------------------

RAID alert "Spam"

기본 구성(/opt/Broadcom/mrmonitor/MegaMonitor/config-current.xml)의 '전역' 섹션을 변경합니다:

<global>
<severity level="FATAL">
<do-systemlog/>
<do-email/>
</severity>
<severity level="CRITICAL">
<do-email/>
<do-systemlog/>
</severity>
<severity level="WARNING">
<do-email/>
<do-systemlog/>
</severity>
<severity level="INFO"><do-systemlog/>
</severity>
</global>

다음과 같아 보이도록:

<global>
<severity level="FATAL">
<do-systemlog/>
<do-email/>
</severity>
<severity level="CRITICAL">
<do-email/>
<do-systemlog/>
</severity>
<severity level="WARNING">
<do-systemlog/>
</severity>
<severity level="INFO">
<do-systemlog/>
</severity>
</global>

""WARNING" 레벨에 대해 "do-email" 태그를 제거하십시오. 또는 보안 레벨을 "INFO"로 변경하십시오.

공통 드라이브 오류

가장 일반적인 드라이버 오류는 스마트 오류, 하드웨어 오류 및 매체 오류입니다. 드라이브에 장애가 발생하면 이 오류가 표시됩니다. 따라서 최대한 빨리 드라이브를 교체해야 합니다.

일반적이지는 않지만 중단된 명령은 또 다른 일반 오류입니다. 그러나 중단된 명령의 숫자가 증가하는 경우(예: 100개)에는 지원 케이스를 여십시오.

링크 오류에서 케이블을 재고정해야 하거나 교체해야 할 수도 있음을 나타냅니다.

지원 케이스 정보

지원 케이스를 열 때 다음 정보가 필요합니다.

Adaptec RAID 카드

지원 케이스를 열 때 arcconf getconfig 1/arcconf getlogs 1 device tabular의 전체 출력을 포함해야 합니다. 이 정보를 제공하면 지원 팀이 드라이브 순서, 어레이 멤버십, 어레이 기하학 및 케이블링 문제를 파악하는 데 도움이 됩니다. 이 정보는 유실된 RAID 구성 복구에 중요합니다. 초기 업데이트에서 재시작/전원 종료 권한을 부여하거나 핫스왑을 요청하면 지원 케이스 처리 속도가 빨라집니다.

Broadcom RAID 카드

다음 명령을 사용하여 Broadcom RAID 카드의 로그 파일을 가져올 수 있습니다. 지원 케이스에 이러한 로그 파일의 전체 출력을 포함시켜야 합니다.

/opt/MegaRAID/storcli/storcli64 /c0 show all
/opt/MegaRAID/storcli/storcli64 /c0 show TermLog
/opt/MegaRAID/storcli/storcli64 /c0 /eall /sall show all | grep -iE "det|cou|tem|SN|S.M|fir"
/opt/MegaRAID/storcli/storcli64 /c0 show TermLog

Storcli 설치

Linux에 대해서는 다음 단계를 따르세요.

서버에 대한 SSH
cd /tmp (또는 원하는 디렉토리)
wget http://downloads.service.softlayer.com/lsitools/1.14.12_StorCLI.zip
x.xx.xx_StorCLI.zip 압축 파일 추출
cd /tmp/storcli_all_os/Linux/(또는 다운로드한 디렉토리로 이동)
rpm -ivh storcli-x.xx.xx-x.noarch.rpm
스토어클리가 성공적으로 설치되었는지 확인

Vmware ESXi의 경우 다음 단계를 따르십시오.

/tmp 디렉토리로 이동합니다.

# cd /tmp
storcli를 다운로드하십시오.

# wget http://downloads.service.softlayer.com/lsitools/1.14.12_StorCLI.zip
파일의 압축을 푸십시오.

` # unzip 1.14.12_StorCLI.zip'
/tmp/storcli_all_os/Vmware-NDS/. 로 이동하십시오.

` # cd /tmp/storcli_all_os/Vmware-NDS/'
storcli를 설치하십시오.

# esxcli software vib install -v=/tmp/storcli_all_os/Vmware-NDS/vmware-esx-storcli-1.14.12.vib --no-sig-check

storcli가 설치된 후 다음 두 명령을 실행하여 서버의 디스크 상태를 확인할 수 있습니다.

ESXi의 경우 7.X, 다음 명령을 사용하십시오.

/opt/lsi/storcli64/storcli64 /c0 show all
/opt/lsi/storcli64/storcli64 /c0 show eventloginfo
/opt/lsi/storcli64/storcli64 /c0 /eall /sall show all | grep -iE "det|cou|tem|SN|S.M|fir"
/opt/lsi/storcli64/storcli64 /c0 show TermLog

RAID 구성 확인

다음 명령을 사용하여 RAID 구성을 확인하십시오.

/opt/lsi/storcli64/storcli64 /c0 show all
/opt/MegaRAID/storcli/storcli64 /c0 /eall /sall show all

출력에서 RAID 유형이 컬럼으로 나열되는 토폴로지 섹션을 찾으십시오.

문제를 해결하기 전에 모든 작업을 백업해야 합니다.