Fujitsu Notes: Difference between revisions

From Alteeve Wiki
Jump to navigation Jump to search
 
(57 intermediate revisions by 2 users not shown)
Line 3: Line 3:
This covers specific setup notes for Fujitsu Primenergy servers on EL6.
This covers specific setup notes for Fujitsu Primenergy servers on EL6.


= PrimeEnergy RX300 G7 =  
= iRMC Remote KVM Access Through a Firewall =
 
#. Port-forward port 80. (ie: <public ip>:41080 -> <ipmi ip>:80)
#. Log in; Network Settings -> Ports and Services -> HTTPS Port
##. Default is '443', but with multiple nodes, they can't all listen on 443 (unless you have many external IPs). So in this example, I change this to '41443'.
##. Save, log out.
#. Update firewall to forward <public ip>:41443 -> <ipmi ip>:41443
#. Connect to https://<public ip>:41443
 
Now iKVM will work.
 
= iRMC HTML5 Video Redirection =
 
Any server using iRMC S4 version 8.01 and up (8.24F current as of June 2016) has the option of using HTML5 rather than Java for video console redirection. To enable said functionality, from the iRMC web interface:
 
-> Console Redirection -> Video Redirection
 
And select HTML5 Viewer Enabled.
 
 
= Primergy RX200 S8 =
 
== BIOS ==
 
* <span class="code">Advanced</span>
** <span class="code">SATA Configuration</span>
*** <span class="code">SATA Mode</span> -> <span class="code">Disabled</span>
* <span class="code">Server Mgmt</span>
** <span class="code">Asset Tag</span> -> (short host name)
** <span class="code">Temperature Monitoring</span> -> <span class="code">Enabled</span>
* <span class="code">Boot</span>
** <span class="code">Bootup NumLock State</span> -> <span class="code">On</span>
** <span class="code">PXE Boot Option Retry</span> -> <span class="code">Enabled</span>
 
== RAID Controller (D3116C) ==
 
* <span class="code">Configuration Wizard</span>
** <span class="code">New Configuration</span> -> <span class="code">Next</span>
** Confirm clear config -> <span class="code">Yes</span>
** <span class="code">Manual Configuration</span> -> <span class="code">Next</span>
*** Press and hold <span class="code"><ctrl></span>, Click to highlight all <span class="code">Drives</span> in left pane -> <span class="code">Add to Array</span> -> <span class="code">Accept DG</span> -> <span class="code">Next</span>.
*** <span class="code">Add to SPAN</span> -> <span class="code">Next</span>
**** <span class="code">RAID Level</span>; 1-8 driver == <span class="code">RAID 5</span>, 9+ drives == <span class="code">RAID 6</span>
**** <span class="code">Write Policy</span> -> <span class="code">Write Back with BBU</span>
**** <span class="code">Select Size</span> -> Enter size in green text under right pane; <span class="code">R5</span> size for <span class="code">RAID 5</span>, <span class="code">R6</span> size for <span class="code">RAID 6</span>.
**** <span class="code">Accept</span> -> Confirm cache policy; <span class="code">Yes</span> -> <span class="code">Next</span>
** <span class="code">Accept</span>
** Save the configuration; <span class="code">Yes</span> -> Confirm existing data wipe; <span class="code">Yes</span>
** Click to select <span class="code">Set Boot Drive</span> -> <span class="code">Go</span> -> <span class="code">Back</span>
* <span class="code">Exit</span> -> Confirm exit; <span class="code">Yes</span>
 
Reboot.
 
= Primergy RX300 S7 =  


== RAID ==
== RAID ==
Line 27: Line 80:


<source lang="bash">
<source lang="bash">
rpm -Uvh https://alteeve.com/files/Lib_Utils-1.00-09.noarch.rpm https://alteeve.com/files/MegaCli-8.04.07-1.noarch.rpm
rpm -Uvh https://alteeve.ca/files/Lib_Utils-1.00-09.noarch.rpm https://alteeve.ca/files/MegaCli-8.04.07-1.noarch.rpm


# This makes MegaCli64 app available without the full path
# This makes MegaCli64 app available without the full path
Line 35: Line 88:
Once installed, verify that you can see your hardware:
Once installed, verify that you can see your hardware:


* [http://kb.lsi.com/KnowledgebaseArticle16516.aspx Cheat-sheet]
* [[MegaCli64 Cheat Sheet]]
 
== Replacing a Failed Drive ==
 
Replacing a failed drive involves two steps;
 
# Identify the drive that is failing and gathering the data needed to request the RMA.
# Swapping the actual drive when the replacement arrives on client site.
 
=== Identify the Failing Drive ===
 
If the drive has failed entirely, the red front LED on the drive should be lit, making identification and RMA request simple.  
 
However, if the drive has not yet failed, identifying the drive and confirming it's pending failure requires a little extra work.
 
Identify the failed drive:


<source lang="bash">
<source lang="bash">
# Number of adapters in the server
MegaCli64 PDList aAll
MegaCli64 adpcount
</source>
<source lang="text">


# Make sure the controller will yell if the battery fails
<snip>
MegaCli64 AdpGetProp BatWarnDsbl aAll


# Is the alarm going off (or has it been silenced)
Enclosure Device ID: 10
MegaCli64 AdpGetProp AlarmDsply aAll
Slot Number: 5
Drive's postion: DiskGroup: 0, Span: 0, Arm: 5
Enclosure position: 1
Device Id: 7
WWN: 5000C50054AE9C38
Sequence Number: 2
Media Error Count: 0
Other Error Count: 2
Predictive Failure Count: 0
Last Predictive Failure Event Seq Number: 0
PD Type: SAS


# All adapter properties
Raw Size: 279.396 GB [0x22ecb25c Sectors]
MegaCli64 AdpAllInfo aAll
Non Coerced Size: 278.896 GB [0x22dcb25c Sectors]
Coerced Size: 278.875 GB [0x22dc0000 Sectors]
Firmware state: Online, Spun Up
Device Firmware Level: 5301
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x5000c50054ae9c39
SAS Address(1): 0x0
Connected Port Number: 0(path0)
Inquiry Data: SEAGATE ST9300653SS    53016XN1EMF2    @#87980
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: 6.0Gb/s
Link Speed: 6.0Gb/s
Media Type: Hard Disk Device
Drive Temperature :29C (84.20 F)
PI Eligibility:  No
Drive is formatted for PI information:  No
PI: No PI
Port-0 :
Port status: Active
Port's Linkspeed: 6.0Gb/s
Port-1 :
Port status: Active
Port's Linkspeed: 6.0Gb/s
Drive has flagged a S.M.A.R.T alert : No
</source>


# Get the battery info
Notice the line:
MegaCli64 AdpBbuCmd aAll
 
<source lang="text">
Other Error Count: 2
</source>
 
This is a sign of impending failure, despite SMART showing the drive as still healthy. You can gather more details (which Fujitsu will require to verify impending failure), run;
 
<syntaxhighlight lang="bash">
MegaCli64 -AdpEventLog -GetEvents -f raid_events.log -aALL
</syntaxhighlight>
 
When this finishes gathering data, it will create a file called <span class="code">raid_events.log</span>. Send this file to your Fujitsu support rep. They will validate the pending failure and issue an RMA.
 
=== Identifying The Drive Prior to Replacement ===
 
If the drive has failed, identifying the drive is as simple as looking at the front of the node for the drive with the red error LED lit.
 
If the drive has not failed yet, then you can use the identify command to blink the LED. That is what we'll cover here.
 
In the previous section, we identified the failed drive using the <span class="code">MegaCli64</span> command. You need to note the following;


# Get physical drive details
<source lang="bash">
MegaCli64 PDList aAll
MegaCli64 PDList aAll
</source>
<source lang="text">
Enclosure Device ID: 10
Slot Number: 5
...
Other Error Count: 2
</source>
The two bits of information you need are the enclosure ID and slot number. In this case, that is <span class="code">10</span> and <span class="code">5</span>, respectively.
With that info, you can trigger the drive's red LED using the following command;
<source lang="bash">
MegaCli64 -PdLocate -start -physdrv [10:5] -aALL
</source>
<source lang="text">
Adapter: 0: Device at EnclId-10 SlotId-5  -- PD Locate Start Command was successfully sent to Firmware
Exit Code: 0x00
</source>
Once you've located the drive, you can stop the "locate" command using:


# Get virtual drive details
<source lang="bash">
MegaCli64 LDInfo Lall aAll
MegaCli64 -PdLocate -stop -physdrv [10:5] -aALL
</source>
<source lang="text">
Adapter: 0: Device at EnclId-10 SlotId-5  -- PD Locate Stop Command was successfully sent to Firmware
 
Exit Code: 0x00
</source>
</source>
Now that you know which drive has failed, you can OFFLINE it in preparation of replacing it.


<source lang="bash">
<source lang="bash">
MegaCli64 AdpAllInfo aAll
MegaCli64 -PDOffline -physdrv [10:5] -aALL
</source>
</source>
<source lang="text">
<source lang="text">
Adapter #0
Adapter: 0: EnclId-10 SlotId-5 state changed to OffLine.
 
Exit Code: 0x00
</source>
 
You can now physically remove the failed disk and insert the replacement disk.
 
=== Monitoring the Rebuild ===
 
When a replacement disk is inserted, the array should recognise it and automatically begin rebuilding the array. You can monitor this operation by calling;
 
<syntaxhighlight lang="bash">
MegaCli64 -PDRbld -ProgDsply -PhysDrv [10:5] -aALL
</syntaxhighlight>
 
This will display the rebuild progress as a textual bar graph. A rebuild of a 300 [[GB]] 15,000rpm SAS drive in a 6-drive array took about 30 minutes. How long it takes in your case will vary depending on disk speed, array size and load.
 
== Checking the BBU ==
 
Check the BBU;
 
<syntaxhighlight lang="bash">
MegaCli64 AdpBbuCmd aAll
</syntaxhighlight>
 
Triggering a re-learn cycle.
 
<syntaxhighlight lang="bash">
MegaCli64 AdpBbuCmd BbuLearn aAll
</syntaxhighlight>
 
Monitor a learn cycle:
 
<syntaxhighlight lang="bash">
watch "MegaCli64 AdpBbuCmd aAll | grep -i -e learn -e status -e charge -e operation"
</syntaxhighlight>
 
= Updating Firmware =
 
Tested on:
* RX200 S8
 
== Dependencies ==


==============================================================================
<syntaxhighlight lang="bash">
                    Versions
yum install kernel-headers gcc gcc-c++ compat-libstdc++-33.i686 libstdc++.i686 libstdc++-devel.i686 kernel-devel
                ================
</syntaxhighlight>
Product Name    : LSI MegaRAID SAS 9265-8i
Serial No      : SV21503813
FW Package Build: 21.0.1-0100


                    Mfg. Data
== iRMC ==
                ================
Mfg. Date      : 04/20/12
Rework Date    : 00/00/00
Revision No    : 01C
Battery FRU    : N/A


                Image Versions in Flash:
{{warning|1=Update the iRMC before updating the BIOS.}}
                ================
BIOS Version      : 5.14.00_4.11.05.00_0x04360000
WebBIOS Version    : 6.1-10-e_10-Rel
Preboot CLI Version: 05.00-02:#%00008
FW Version        : 3.130.05-1169
NVDATA Version    : 2.1012.03-0022
Boot Block Version : 2.03.00.00-0003
BOOT Version      : 06.253.57.219


                Pending Images in Flash
Go to: [http://support.ts.fujitsu.com/IndexDownload.asp?lng=COM Fujitsu Downloads]
                ================
None


                PCI Info
Select:
                ================
* "Product Search"
Controller Id : 0000
* Enter the server's model number in the 'Product Search' text field. For example; "<span class="code">RX2540M1</span>". Note that you can enter a partial model number and it will ask you to select the proper model from a list.
Vendor Id      : 1000
* Click the 'Selected operating system' drop down list and choose "Red Hat Enterprise Linux 6 (x86_64)". Note that you might need to click on the selection box twice to get the pop-up menu with the OS selection list to appear.
Device Id      : 005b
* Under the "Driver" tab;
SubVendorId    : 1000
* Click to expand "Server Management Controller"
SubDeviceId    : 9265
** Click to expand "iRMC S4 (Kronos 4) (Onboard on D3289-A1x)"
** Verify the title is 'iRMC S4 (Kronos4) Firmware - RX2540 M1 (ASP for Linux)'. If so, click on "Direct Download" on the left.


Host Interface  : PCIE
{{warning|1=Read and understand the notes and warnings!}}


ChipRevision    : NA
* When you are ready, click to check the "<span class="code">Terms of Use</span>" checkbox and then click on "<span class="code">Download File</span>".
* Locate the file on your computer and extract the zip file.
* Copy the <span class="code">RX2540M1_MangtCtr_<version>.scexe</span> to the node.


Number of Frontend Port: 0
{{note|1=In this tutorial, the file name we're using is '<span class="code">RX2540M1_MangtCtr_FW0824F_SDR367.scexe</span>'.}}
Device Interface  : PCIE


Number of Backend Port: 8
Log into the node and run:
Port  :  Address
0        5000c5004ce47ed1
1        4433221105000000
2        4433221104000000
3        5000c5004c92339d
4        5000c5004cd3349d
5        5000c5004c922fd9
6        0000000000000000
7        0000000000000000


                HW Configuration
<syntaxhighlight lang="bash">
                ================
# -u == update only, don't reflash or downgrade
SAS Address      : 500605b00493bb80
# -k == keep extracted files.
BBU              : Present
sh RX2540M1_MangtCtr_FW0824F_SDR367.scexe -u -k
Alarm            : Present
</syntaxhighlight>
NVRAM            : Present
Serial Debugger  : Present
Memory          : Present
Flash            : Present
Memory Size      : 1024MB
TPM              : Absent
On board Expander: Absent
Upgrade Key      : Present
Temperature sensor for ROC    : Absent
Temperature sensor for controller    : Absent


This will start the iRMC firmware update, you need to confirm:


                Settings
<syntaxhighlight lang="text">
                ================
</syntaxhighlight>
Current Time                    : 3:46:38 8/6, 2012
Predictive Fail Poll Interval    : 300sec
Interrupt Throttle Active Count  : 16
Interrupt Throttle Completion    : 50us
Rebuild Rate                    : 30%
PR Rate                          : 30%
BGI Rate                        : 30%
Check Consistency Rate          : 30%
Reconstruction Rate              : 30%
Cache Flush Interval            : 4s
Max Drives to Spinup at One Time : 4
Delay Among Spinup Groups        : 2s
Physical Drive Coercion Mode    : Disabled
Cluster Mode                    : Disabled
Alarm                            : Enabled
Auto Rebuild                    : Enabled
Battery Warning                  : Enabled
Ecc Bucket Size                  : 15
Ecc Bucket Leak Rate            : 1440 Minutes
Restore HotSpare on Insertion    : Disabled
Expose Enclosure Devices        : Enabled
Maintain PD Fail History        : Enabled
Host Request Reordering          : Enabled
Auto Detect BackPlane Enabled    : SGPIO/i2c SEP
Load Balance Mode                : Auto
Use FDE Only                    : No
Security Key Assigned            : No
Security Key Failed              : No
Security Key Not Backedup        : No
Default LD PowerSave Policy      : Automatic
Maximum number of direct attached drives to spin up in 1 min : 120
Auto Enhanced Import            : No
Any Offline VD Cache Preserved  : No
Allow Boot with Preserved Cache  : No
Disable Online Controller Reset  : No
PFK in NVRAM                    : No
Use disk activity for locate    : No
POST delay : 90 seconds


                Capabilities
'''The install will take a while, be patient!'''
                ================
RAID Level Supported            : RAID0, RAID1, RAID5, RAID6, RAID00, RAID10, RAID50, RAID60, PRL 11, PRL 11 with spanning, SRL 3 supported, PRL11-RLQ0 DDF layout with no span, PRL11-RLQ0 DDF layout with span
Supported Drives                : SAS, SATA


Allowed Mixing:
<syntaxhighlight lang="text">
#...
</syntaxhighlight>


Mix in Enclosure Allowed
When it's done, you will hear the fans spin.
Mix of SAS/SATA of HDD type in VD Allowed


                Status
== BIOS ==
                ================
ECC Bucket Count                : 0


                Limitations
{{note|1=If you already selected the machine type in the previous section, you will not need to select the machine type or operating system a second time.}}
                ================
Max Arms Per VD          : 32
Max Spans Per VD        : 8
Max Arrays              : 128
Max Number of VDs        : 64
Max Parallel Commands    : 1008
Max SGE Count            : 60
Max Data Transfer Size  : 8192 sectors
Max Strips PerIO        : 42
Max LD per array        : 16
Min Strip Size          : 8 KB
Max Strip Size          : 1.0 MB
Max Configurable CacheCade Size: 512 GB
Current Size of CacheCade      : 0 GB
Current Size of FW Cache      : 887 MB


                Device Present
Go to: [http://support.ts.fujitsu.com/IndexDownload.asp?lng=COM Fujitsu Downloads]
                ================
Virtual Drives    : 1
  Degraded        : 0
  Offline        : 0
Physical Devices  : 7
  Disks          : 6
  Critical Disks  : 0
  Failed Disks    : 0


                Supported Adapter Operations
Select:
                ================
* "Product Search"
Rebuild Rate                    : Yes
* Enter the server's model number in the 'Product Search' text field. For example; "<span class="code">RX2540M1</span>". Note that you can enter a partial model number and it will ask you to select the proper model from a list.
CC Rate                        : Yes
* Click the 'Selected operating system' drop down list and choose "Red Hat Enterprise Linux 6 (x86_64)". Note that you might need to click on the selection box twice to get the pop-up menu with the OS selection list to appear.
BGI Rate                        : Yes
* Click on the "BIOS" tab.
Reconstruct Rate                : Yes
* Click to expand "Flash - BIOS"
Patrol Read Rate                : Yes
** Click to expand "Flash BIOS for D3289-A1x" (the final ID may differ depending on your machine)
Alarm Control                  : Yes
** Verify the title is 'Flash BIOS RX2540 M1 (ASP for Linux)'. If so, click on "Direct Download" on the left.
Cluster Support                : Yes
BBU                            : No
Spanning                        : Yes
Dedicated Hot Spare            : Yes
Revertible Hot Spares          : Yes
Foreign Config Import          : Yes
Self Diagnostic                : Yes
Allow Mixed Redundancy on Array : No
Global Hot Spares              : Yes
Deny SCSI Passthrough          : No
Deny SMP Passthrough            : No
Deny STP Passthrough            : No
Support Security                : No
Snapshot Enabled                : No
Support the OCE without adding drives : Yes
Support PFK                    : Yes
Support PI                      : No
Support Boot Time PFK Change    : No
Disable Online PFK Change      : No
PFK TrailTime Remaining        : 0 days 0 hours
Support Shield State            : No
Block SSD Write Disk Cache Change: No


                Supported VD Operations
{{warning|1=Read and understand the notes and warnings!}}
                ================
Read Policy          : Yes
Write Policy        : Yes
IO Policy            : Yes
Access Policy        : Yes
Disk Cache Policy    : Yes
Reconstruction      : Yes
Deny Locate          : No
Deny CC              : No
Allow Ctrl Encryption: No
Enable LDBBM        : No
Support Breakmirror  : No
Power Savings        : Yes


                Supported PD Operations
* When you are ready, click to check the "<span class="code">Terms of Use</span>" checkbox and then click on "<span class="code">Download File</span>".
                ================
Force Online                            : Yes
Force Offline                          : Yes
Force Rebuild                          : Yes
Deny Force Failed                      : No
Deny Force Good/Bad                    : No
Deny Missing Replace                    : No
Deny Clear                              : No
Deny Locate                            : No
Support Temperature                    : Yes
Disable Copyback                        : No
Enable JBOD                            : No
Enable Copyback on SMART                : No
Enable Copyback to SSD on SMART Error  : Yes
Enable SSD Patrol Read                  : No
PR Correct Unconfigured Areas          : Yes
Enable Spin Down of UnConfigured Drives : Yes
Disable Spin Down of hot spares        : No
Spin Down time                          : 30
T10 Power State                        : Yes
                Error Counters
                ================
Memory Correctable Errors  : 0
Memory Uncorrectable Errors : 0


                Cluster Information
Extract on the target and run:
                ================
Cluster Permitted    : Yes
Cluster Active        : No


                Default Settings
<syntaxhighlight lang="bash">
                ================
# -u == update only, don't reflash or downgrade
Phy Polarity                    : 0
# -k == keep extracted files.
Phy PolaritySplit                : 0
sh RX200S8_D3302_BiosV4654_R160.scexe -u -k
Background Rate                  : 30
</syntaxhighlight>
Strip Size                      : 64kB
Flush Time                      : 4 seconds
Write Policy                    : WB
Read Policy                      : Adaptive
Cache When BBU Bad              : Disabled
Cached IO                        : No
SMART Mode                      : Mode 6
Alarm Disable                    : Yes
Coercion Mode                    : None
ZCR Config                      : Unknown
Dirty LED Shows Drive Activity  : No
BIOS Continue on Error          : No
Spin Down Mode                  : None
Allowed Device Type              : SAS/SATA Mix
Allow Mix in Enclosure          : Yes
Allow HDD SAS/SATA Mix in VD    : Yes
Allow SSD SAS/SATA Mix in VD    : No
Allow HDD/SSD Mix in VD          : No
Allow SATA in Cluster            : No
Max Chained Enclosures          : 16
Disable Ctrl-R                  : Yes
Enable Web BIOS                  : Yes
Direct PD Mapping                : No
BIOS Enumerate VDs              : Yes
Restore Hot Spare on Insertion  : No
Expose Enclosure Devices        : Yes
Maintain PD Fail History        : Yes
Disable Puncturing              : No
Zero Based Enclosure Enumeration : No
PreBoot CLI Enabled              : Yes
LED Show Drive Activity          : Yes
Cluster Disable                  : No
SAS Disable                      : No
Auto Detect BackPlane Enable    : SGPIO/i2c SEP
Use FDE Only                    : No
Enable Led Header                : No
Delay during POST                : 0
EnableCrashDump                  : No
Disable Online Controller Reset  : No
EnableLDBBM                      : No
Un-Certified Hard Disk Drives    : Allow
Treat Single span R1E as R10    : No
Max LD per array                : 16
Power Saving option              : All power saving options are enabled
Default spin down time in minutes: 30
Enable JBOD                      : No
TTY Log In Flash                : No
Auto Enhanced Import            : No
BreakMirror RAID Support        : No
Disable Join Mirror              : No
Enable Shield State              : No
Time taken to detect CME        : 60s


Exit Code: 0x00
<syntaxhighlight lang="text">
</source>
***************** PRIMERGY Autonomous Support Package ***************
 
Description: Flash BIOS RX200 S8 
VersionMajor: V4.6.5.4
VersionMinor: R1.6.0
VersionBuild: 1.0.0
Software Class - Category: Flash - BIOS
Software Class - Name: (SV) Flash Bios
Vendor: Fujitsu Technology Solutions
 
*********************************************************************
 
Continue processing this ASP?
Please answer: yes/y or no/n
</syntaxhighlight>
 
ACK
 
<syntaxhighlight lang="text">
*********************************************************************
 
 
                          CAUTION!
 
        Currently a new version is being installed.
 
      The installation process will take a long time.
 
                  .... please wait ....
 
          Don't interrupt this installation process!
 
 
 
*********************************************************************
*********************************************************************
 
 
                          FAILED!
 
        The new version was not correctly installed.


and:


<source lang="bash">
MegaCli64 PDList aAll
</source>
<source lang="text">
Adapter #0


Enclosure Device ID: 252
*********************************************************************
Slot Number: 0
</syntaxhighlight>
Drive's postion: DiskGroup: 0, Span: 0, Arm: 3
Enclosure position: N/A
Device Id: 11
WWN: 5000C5004C922FD8
Sequence Number: 2
Media Error Count: 0
Other Error Count: 0
Predictive Failure Count: 0
Last Predictive Failure Event Seq Number: 0
PD Type: SAS


Raw Size: 279.396 GB [0x22ecb25c Sectors]
In this case, the BIOS was already up to date. Verify via 'dmidecode':
Non Coerced Size: 278.896 GB [0x22dcb25c Sectors]
Coerced Size: 278.875 GB [0x22dc0000 Sectors]
Firmware state: Online, Spun Up
Device Firmware Level: 0004
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x5000c5004c922fd9
SAS Address(1): 0x0
Connected Port Number: 5(path0)
Inquiry Data: SEAGATE ST9300653SS    00046XN0TLD7           
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: 6.0Gb/s
Link Speed: 6.0Gb/s
Media Type: Hard Disk Device
Drive Temperature :37C (98.60 F)
PI Eligibility:  No
Drive is formatted for PI information:  No
PI: No PI
Port-0 :
Port status: Active
Port's Linkspeed: 6.0Gb/s
Port-1 :
Port status: Active
Port's Linkspeed: Unknown
Drive has flagged a S.M.A.R.T alert : No


<syntaxhighlight lang="bash">
# dmidecode 2.12
SMBIOS 2.7 present.
88 structures occupying 4084 bytes.
Table at 0x7C8BD018.


Handle 0x0000, DMI type 0, 24 bytes
BIOS Information
        Vendor: FUJITSU // American Megatrends Inc.
        Version: V4.6.5.4 R1.6.0 for D3302-A1x
        Release Date: 01/30/2014
        Address: 0xF0000
        Runtime Size: 64 kB
        ROM Size: 13248 kB
        Characteristics:
                PCI is supported
                BIOS is upgradeable
                BIOS shadowing is allowed
                Boot from CD is supported
                Selectable boot is supported
                EDD is supported
                Print screen service is supported (int 5h)
                Serial services are supported (int 14h)
                Printer services are supported (int 17h)
                ACPI is supported
                USB legacy is supported
                BIOS boot specification is supported
                Targeted content distribution is supported
                UEFI is supported
        BIOS Revision: 1.6
</syntaxhighlight>


Enclosure Device ID: 252
And compare against the latest version on the download page under "BIOS Update - Admin Pack for D3302-A1x"
Slot Number: 1
Drive's postion: DiskGroup: 0, Span: 0, Arm: 2
Enclosure position: N/A
Device Id: 10
WWN: 5000C5004CD3349C
Sequence Number: 2
Media Error Count: 0
Other Error Count: 0
Predictive Failure Count: 0
Last Predictive Failure Event Seq Number: 0
PD Type: SAS


Raw Size: 279.396 GB [0x22ecb25c Sectors]
<syntaxhighlight lang="text">
Non Coerced Size: 278.896 GB [0x22dcb25c Sectors]
V4.6.5.4 - R1.6.0
Coerced Size: 278.875 GB [0x22dc0000 Sectors]
</syntaxhighlight>
Firmware state: Online, Spun Up
Device Firmware Level: 0004
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x5000c5004cd3349d
SAS Address(1): 0x0
Connected Port Number: 4(path0)
Inquiry Data: SEAGATE ST9300653SS    00046XN0WNY9           
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: 6.0Gb/s
Link Speed: 6.0Gb/s
Media Type: Hard Disk Device
Drive Temperature :38C (100.40 F)
PI Eligibility:  No
Drive is formatted for PI information:  No
PI: No PI
Port-0 :
Port status: Active
Port's Linkspeed: 6.0Gb/s
Port-1 :
Port status: Active
Port's Linkspeed: Unknown
Drive has flagged a S.M.A.R.T alert : No


{{warning|1=If the update succeeds, immediately reboot after.}}


Done!


Enclosure Device ID: 252
== D3116C RAID Controller ==
Slot Number: 2
Drive's postion: DiskGroup: 0, Span: 0, Arm: 1
Enclosure position: N/A
Device Id: 9
WWN: 5000C5004C92339C
Sequence Number: 2
Media Error Count: 0
Other Error Count: 0
Predictive Failure Count: 0
Last Predictive Failure Event Seq Number: 0
PD Type: SAS


Raw Size: 279.396 GB [0x22ecb25c Sectors]
{{warning|1=Immediately reboot when update completes!}}
Non Coerced Size: 278.896 GB [0x22dcb25c Sectors]
Coerced Size: 278.875 GB [0x22dc0000 Sectors]
Firmware state: Online, Spun Up
Device Firmware Level: 0004
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x5000c5004c92339d
SAS Address(1): 0x0
Connected Port Number: 3(path0)
Inquiry Data: SEAGATE ST9300653SS    00046XN0TLGR           
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: 6.0Gb/s
Link Speed: 6.0Gb/s
Media Type: Hard Disk Device
Drive Temperature :38C (100.40 F)
PI Eligibility:  No
Drive is formatted for PI information:  No
PI: No PI
Port-0 :
Port status: Active
Port's Linkspeed: 6.0Gb/s
Port-1 :
Port status: Active
Port's Linkspeed: Unknown
Drive has flagged a S.M.A.R.T alert : No


Go to: [http://support.ts.fujitsu.com/Download/Index.asp?lng=COM&lnid=4 RX*** Downloads]


Select the machine, RHEL 6 x86_64, choose SAS RAID - RAID Ctrl SAS 6G 1GB (D3116C) - Firmware for RAID Ctrl SAS 6G D3116C (ASP for Linux)


Enclosure Device ID: 252
Extract on the target and run:
Slot Number: 3
Drive's postion: DiskGroup: 0, Span: 0, Arm: 0
Enclosure position: N/A
Device Id: 8
WWN: 5000C5004CE47ED0
Sequence Number: 2
Media Error Count: 0
Other Error Count: 0
Predictive Failure Count: 0
Last Predictive Failure Event Seq Number: 0
PD Type: SAS


Raw Size: 279.396 GB [0x22ecb25c Sectors]
<syntaxhighlight lang="bash">
Non Coerced Size: 278.896 GB [0x22dcb25c Sectors]
# -u == update only, don't reflash or downgrade
Coerced Size: 278.875 GB [0x22dc0000 Sectors]
# -k == keep extracted files.
Firmware state: Online, Spun Up
sh FTS_FirmwareforRAIDCtrlSAS6GD3116CASPforLinu_2390003332201252420100_1106185.SCEXE -u -k
Device Firmware Level: 0004
</syntaxhighlight>
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x5000c5004ce47ed1
SAS Address(1): 0x0
Connected Port Number: 0(path0)
Inquiry Data: SEAGATE ST9300653SS    00046XN0RPCD           
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: 6.0Gb/s
Link Speed: 6.0Gb/s
Media Type: Hard Disk Device
Drive Temperature :36C (96.80 F)
PI Eligibility:  No
Drive is formatted for PI information:  No
PI: No PI
Port-0 :
Port status: Active
Port's Linkspeed: 6.0Gb/s
Port-1 :
Port status: Active
Port's Linkspeed: Unknown
Drive has flagged a S.M.A.R.T alert : No


<syntaxhighlight lang="text">
***************** PRIMERGY Autonomous Support Package ***************


Description: Firmware for RAID Ctrl SAS 6G D3116C 
VersionMajor: 23.9.0-0033
VersionMinor: 3.220.125-2420
VersionBuild: 1.0.0
Software Class - Category: Flash - Firmware
Software Class - Name: (SV) Flash - Firmware
Vendor: LSI Logic


Enclosure Device ID: 252
*********************************************************************
Slot Number: 6
Enclosure position: N/A
Device Id: 13
WWN: 5000000000000000
Sequence Number: 1
Media Error Count: 0
Other Error Count: 0
Predictive Failure Count: 0
Last Predictive Failure Event Seq Number: 0
PD Type: SATA


Raw Size: 119.242 GB [0xee7c2b0 Sectors]
Continue processing this ASP?
Non Coerced Size: 118.742 GB [0xed7c2b0 Sectors]
Please answer: yes/y or no/n
Coerced Size: 118.718 GB [0xed70000 Sectors]
</syntaxhighlight>
Firmware state: Unconfigured(good), Spun Up
Device Firmware Level: 1.0
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x4433221105000000
Connected Port Number: 1(path0)
Inquiry Data: 120681000100145305F5Corsair Performance Pro                1.0   
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: 6.0Gb/s
Link Speed: 6.0Gb/s
Media Type: Solid State Device
Drive Temperature : N/A
PI Eligibility:  No
Drive is formatted for PI information:  No
PI: No PI
Port-0 :
Port status: Active
Port's Linkspeed: 6.0Gb/s
Drive has flagged a S.M.A.R.T alert : No


ACK


<syntaxhighlight lang="text">
*********************************************************************


Enclosure Device ID: 252
Slot Number: 7
Enclosure position: N/A
Device Id: 12
WWN: 5000000000000000
Sequence Number: 1
Media Error Count: 0
Other Error Count: 0
Predictive Failure Count: 0
Last Predictive Failure Event Seq Number: 0
PD Type: SATA


Raw Size: 119.242 GB [0xee7c2b0 Sectors]
                          CAUTION!
Non Coerced Size: 118.742 GB [0xed7c2b0 Sectors]
Coerced Size: 118.718 GB [0xed70000 Sectors]
Firmware state: Unconfigured(good), Spun Up
Device Firmware Level: 1.0
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x4433221104000000
Connected Port Number: 2(path0)
Inquiry Data: 120681000100145307B9Corsair Performance Pro                1.0   
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: 6.0Gb/s
Link Speed: 6.0Gb/s
Media Type: Solid State Device
Drive Temperature : N/A
PI Eligibility:  No
Drive is formatted for PI information:  No
PI: No PI
Port-0 :
Port status: Active
Port's Linkspeed: 6.0Gb/s
Drive has flagged a S.M.A.R.T alert : No


        Currently a new version is being installed.


      The installation process will take a long time.


                  .... please wait ....


Exit Code: 0x00
          Don't interrupt this installation process!
</source>


and:


<source lang="bash">
MegaCli64 LDInfo Lall aAll
</source>
<source lang="text">
Adapter 0 -- Virtual Drive Information:
Virtual Drive: 0 (Target Id: 0)
Name                :
RAID Level          : Primary-5, Secondary-0, RAID Level Qualifier-3
Size                : 836.625 GB
Parity Size        : 278.875 GB
State              : Optimal
Strip Size          : 64 KB
Number Of Drives    : 4
Span Depth          : 1
Default Cache Policy: WriteBack, ReadAhead, Direct, No Write Cache if Bad BBU
Current Cache Policy: WriteBack, ReadAhead, Direct, No Write Cache if Bad BBU
Default Access Policy: Read/Write
Current Access Policy: Read/Write
Disk Cache Policy  : Disk's Default
Encryption Type    : None
Default Power Savings Policy: Controller Defined
Current Power Savings Policy: Automatic
Can spin up in 1 minute: Yes
LD has drives that support T10 power conditions: Yes
LD's IO profile supports MAX power savings with cached writes: Yes
Is VD Cached: No


*********************************************************************
*********************************************************************




Exit Code: 0x00
                      Attention!
</source>


and:
                    Successfully flashed.
                      Reboot your system.


<source lang="bash">
MegaCli64 AdpBbuCmd aAll
</source>
<source lang="text">
BBU status for Adapter: 0


BatteryType: iBBU-09
Voltage: 4084 mV
Current: 0 mA
Temperature: 30 C
Battery State    : Operational


BBU Firmware Status:


  Charging Status              : None
*********************************************************************
  Voltage                                : OK
</syntaxhighlight>
  Temperature                            : OK
  Learn Cycle Requested                   : No
  Learn Cycle Active                      : No
  Learn Cycle Status                      : OK
  Learn Cycle Timeout                    : No
  I2c Errors Detected                    : No
  Battery Pack Missing                    : No
  Battery Replacement required            : No
  Remaining Capacity Low                  : No
  Periodic Learn Required                : No
  Transparent Learn                      : No
  No space to cache offload              : No
  Pack is about to fail & should be replaced : No
  Cache Offload premium feature required  : No
  Module microcode update required        : No
BBU GasGauge Status: 0x0280
  Relative State of Charge: 100 %
  Charger System State: 1
  Charger System Ctrl: 0
  Charging current: 0 mA
  Absolute state of charge: 0 %
  Max Error: 0 %


  Battery backup charge time : 48 hours +
'''''REBOOT'''''


BBU Capacity Info for Adapter: 0
<syntaxhighlight lang="bash">
reboot
</syntaxhighlight>


  Relative State of Charge: 100 %
Once rebooted, reset the controller to factory defaults. This will _not_ effect the Logical Disk! If you followed the [[AN!Cluster Tutorial 2]], then you will not need to change any settings. If does require another reboot though.
  Absolute State of charge: 86 %
  Remaining Capacity: 1294 mAh
  Full Charge Capacity: 1294 mAh
  Run time to empty: Battery is not being discharged.
  Average time to empty: 2 Hour, 35 Min.  
  Estimated Time to full recharge: Battery is not being charged.
  Cycle Count: 3


BBU Design Info for Adapter: 0
Record the current settings (we'll diff after the reset):


  Date of Manufacture: 12/27, 2011
<syntaxhighlight lang="bash">
  Design Capacity: 1500 mAh
MegaCli64 AdpAllInfo aAll > Adapter.pre-reset
  Design Voltage: 4100 mV
MegaCli64 LDInfo Lall aAll > LD.pre-reset
  Specification Info: 0
</syntaxhighlight>
  Serial Number: 2641
  Pack Stat Configuration: 0x0000
  Manufacture Name: LS36691
  Device Name: iBBU-09
  Device Chemistry: LION
  Battery FRU: N/A
  Transparent Learn = 0
  App Data = 0


BBU Properties for Adapter: 0
Reset:


  Auto Learn Period: 30 Days
<syntaxhighlight lang="bash">
  Next Learn time: Sep 4 2012 01:24:20 
MegaCli64 AdpFacDefSet a0
  Learn Delay Interval:0 Hours
</syntaxhighlight>
  Auto-Learn Mode: Enabled
<syntaxhighlight lang="text">
  BBU Mode = 5
Adapter 0: Factory Default Set Successfully.
Please reboot the system for the changes to take effect


Exit Code: 0x00
Exit Code: 0x00
</source>
</syntaxhighlight>
 
Now reboot again.
 
<syntaxhighlight lang="bash">
reboot
</syntaxhighlight>
 
Now dump the new settings and check for changes:
 
<syntaxhighlight lang="bash">
MegaCli64 AdpAllInfo aAll > Adapter.post-reset
MegaCli64 LDInfo Lall aAll > LD.post-reset
</syntaxhighlight>
 
Check for differences:
 
<syntaxhighlight lang="bash">
diff -u Adapter.pre-reset Adapter.post-reset
</syntaxhighlight>
<syntaxhighlight lang="diff">
--- Adapter.pre-reset 2014-04-14 15:48:35.011122984 -0400
+++ Adapter.post-reset 2014-04-14 15:52:27.445603604 -0400
@@ -72,11 +72,11 @@
Temperature sensor for ROC    : Present
Temperature sensor for controller    : Absent
-ROC temperature : 74  degree Celsius
+ROC temperature : 75  degree Celsius
                Settings
                ================
-Current Time                    : 19:48:35 4/14, 2014
+Current Time                    : 19:52:26 4/14, 2014
Predictive Fail Poll Interval    : 300sec
Interrupt Throttle Active Count  : 16
Interrupt Throttle Completion    : 50us
</syntaxhighlight>
 
<syntaxhighlight lang="bash">
diff -u LD.pre-reset LD.post-reset
</syntaxhighlight>
<syntaxhighlight lang="bash">
# no output
</syntaxhighlight>


Done!


= Creating a Backup Set =
= Creating a Backup Set =
Line 760: Line 574:
ToDo: Explain this...
ToDo: Explain this...


<source lang="bash">
<syntaxhighlight lang="bash">
mkdir ~/base
mkdir /root/base
cd ~/base
cd /root/base
mkdir root
mkdir /root/base/root
mkdir -p etc/sysconfig/network-scripts/
mkdir -p /root/base/etc/sysconfig/network-scripts
mkdir -p /root/base/etc/udev/rules.d
mkdir -p /root/base/etc/init.d
mkdir -p /root/base/var/spool/cron
 
# Root user
rsync -av /root/.bashrc  /root/base/root/
rsync -av /root/.ssh      /root/base/root/
rsync -av /root/an-cm*    /root/base/root/
rsync -av /root/archive_* /root/base/root/
 
# Directories
rsync -av /etc/ssh    /root/base/etc/
rsync -av /etc/apcupsd /root/base/etc/
rsync -av /etc/cluster /root/base/etc/
rsync -av /etc/drbd.*  /root/base/etc/
rsync -av /etc/an      /root/base/etc/
rsync -av /etc/yum    /root/base/etc/
rsync -av /etc/pki    /root/base/etc/
rsync -av --exclude 'archive' --exclude 'cache' --exclude 'backup' /etc/lvm /root/base/etc/
 
# Specific files.
rsync -av /etc/sysconfig/network-scripts/ifcfg-{eth*,bond*,vbr*} /root/base/etc/sysconfig/network-scripts/
rsync -av /etc/udev/rules.d/70-persistent-net.rules              /root/base/etc/udev/rules.d/
rsync -av /etc/sysconfig/network /root/base/etc/sysconfig/
rsync -av /etc/hosts            /root/base/etc/
rsync -av /etc/ntp.conf          /root/base/etc/
rsync -av /etc/init.d/apcupsd    /root/base/etc/init.d/
rsync -av /var/spool/cron/root  /root/base/var/spool/cron/


rsync -av /root/.bashrc root/
# Save recreating user accounts.
rsync -av /root/.ssh root/
rsync -av /etc/passwd            /root/base/etc/
rsync -av /etc/sysconfig/network-scripts/ifcfg-{eth*,bond*,vbr*} etc/sysconfig/network-scripts/
rsync -av /etc/group            /root/base/etc/
rsync -av /etc/sysconfig/network etc/sysconfig/
rsync -av /etc/shadow            /root/base/etc/
rsync -av /etc/hosts etc/
rsync -av /etc/gshadow          /root/base/etc/
rsync -av /etc/ssh etc/


# If you have the cluster built and want to backup it's configs.
# If you have the cluster built and want to backup it's configs.
mkdir etc/cluster
mkdir /root/base/etc/cluster
mkdir etc/lvm
mkdir /root/base/etc/lvm
rsync -av /etc/cluster/cluster.conf etc/cluster/
rsync -av /etc/cluster/cluster.conf /root/base/etc/cluster/
rsync -av /etc/lvm/lvm.conf etc/lvm/
 
# NOTE: DRBD won't work until you've manually created the partitions.
# NOTE: DRBD won't work until you've manually created the partitions.
rsync -av /etc/drbd.d etc/
rsync -av /etc/drbd.d /root/base/etc/


# If you had to manually set the UUID in libvirtd;
# If you're running RHEL and want to backup your registration info;
mkdir etc/libvirt
if [ -e "/etc/sysconfig/rhn" ]
rsync -av /etc/libvirt/libvirt.conf etc/libvirt/
then
rsync -av /etc/sysconfig/rhn /root/base/etc/sysconfig/
fi


# If you're running RHEL and want to backup your registration info;
# Back up the logical and extended partition structure
rsync -av /etc/sysconfig/rhn etc/sysconfig/
for d in $(fdisk -l | grep 'Disk /dev' | grep -v mapper | sed 's/Disk \(.*\):.*/\1/')
do
        echo "#!/bin/bash" > /root/base/root/partition_drives.sh
        for i in $(parted -m -s -a opt $d "print free" | grep '^[4-9]')
        do
                if [ `echo $i | grep '^4:'` ]
                then
                        echo "$d:$i" | perl -pe 's/^(.*?):(\d+):(.*?):(.*?):.*/parted -s -a opt \1 "mkpart extended \3 \4"/'
                else
                        echo "$d:$i" | perl -pe 's/^(.*?):(\d+):(.*?):(.*?):.*/parted -s -a opt \1 "mkpart logical \3 \4"/'
                fi
        done
done >> /root/base/root/partition_drives.sh
chmod 755 /root/base/root/partition_drives.sh


# Pack it up
# Pack it up
# NOTE: Change the name to suit your node.
# NOTE: Change the name to suit your node.
tar -cvzf base_an-c01n01.tar.gz etc root
cd /root/
</source>
tar -cvf base_$(hostname -s).tar /root/base/etc /root/base/root /root/base/var
ls -lah /root/base_*
</syntaxhighlight>
 
Now copy it to your [[PXE]] server. In my case, that is <span class="code">10.255.255.250</span>. I am backing up [[RHEL6]] nodes, so the places I store my backups in is <span class="code">/var/www/html/rhel6/x86_64/files/</span>.
 
<syntaxhighlight lang="bash">
rsync -av /root/base_$(hostname -s).tar root@10.255.255.250:/var/www/html/rhel6/x86_64/files/
</syntaxhighlight>
<syntaxhighlight lang="text">
root@10.255.255.250's password:
sending incremental file list
base_an-c05n02.tar
 
sent 4045378 bytes  received 31 bytes  898979.78 bytes/sec
total size is 4044800  speedup is 1.00
</syntaxhighlight>
 
Now that it is on the server, I can use the following <span class="code">%post</span> [[kickstart]] script entry in dedicated, per-node kickstart scripts.
 
<syntaxhighlight lang="bash">
%post
# Download the backup files and load them.
cd ~
wget http://10.255.255.250/rhel6/x86_64/files/base_an-c05n01.tar
cp base_an-c05n01.tar.gz /mnt/sysimage/root/
/etc/init.d/network stop
tar -xvzf base_an-c05n01.tar.gz -C /
rm -f /etc/udev/rules.d/70-persistent-net.rules
start_udev
/etc/init.d/network start
/mnt/systemroot/root/partition_drives.sh
</syntaxhighlight>
 
When the install finishes, that will load all the the files we backed up. So when the node reboots, all if it's old [[RHN]] registration, network configs and so on will be restored. This should dramatically reduce recovery time!
 
= Specific Fujitsu Model Notes =
 
== RX200 S7 ==
 
=== BIOS Changes for use in Anvil!s ===
 
To enter the [[BIOS]], press <span class="code">F2</span> during POST.
 
* <span class="code">Advanced</span>
** <span class="code">SATA Configuration</span>
*** <span class="code">SATA Mode</span> -> <span class="code">Disaabled</span> (if no optical drive)
*** - OR -
*** <span class="code">SATA Mode</span> -> <span class="code">AHCI Mode</span> (if optical drive)
** <span class="code">Onboard Device Configuration</span>
*** <span class="code">LAN 2 Oprom</span> -> <span class="code">PXE</span>
*** <span class="code">Onboard SAS/SATA (SCU)</span> -> <span class="code">Disabled</span> (if no optical drive)
* <span class="code">Server Mgmt</span>
** <span class="code">Asset Asset Tag</span> -> short host name of the node, lower case
** <span class="code">Temperature Monitoring</span> -> <span class="code">Enabled</span>
* <span class="code">Boot</span>
** <span class="code">Bootup NumLock State</span> -> <span class="code">On</span>
Save and Exit
 
=== LSI RAID Controller Setup ===
 
{{warning|1=These instructions assume NO existing data. This is a destructive process!}}
 
After [[POST]], press <span class="code"><ctrl></span> + <span class="code">H</span> to enter the controller's WebBIOS
 
* Choose the controller (usually only one available), click <span class="code">Start</span>
* Choose <span class="code">Configuration Wizard</span>
** Choose <span class="code">New Configuration</span> -> click <span class="code">Next</span>
** Confirm the clear by clicking <span class="code">Yes</span>
** Choose <span class="code">Manual Configuration</span> -> click <span class="code">Next</span>
*** Click on the first drive, usually <span class="code">Slot: 0,...</span>, press and hold <span class="code"><ctrl></span> button and then click to select the rest of the drives. Click <span class="code">Add to Array</span>.
*** Click on <span class="code">Accept DG</span> to create the drive group then click on <span class="code">Next</span>
*** Click on <span class="code">Add to SPAN</span>, then click on <span class="code">Next</span>
*** Configure the array;
**** For 1 to 8 drives;
***** <span class="code">RAID Level</span> -> <span class="code">RAID 5</span>
***** <span class="code">Write Policy</span> -> <span class="code">Write Back with BBU</span>
***** Look at the <span class="code">R5:xxxx</span> size on the right and enter that size in the <span class="code">Select Size</span> section. Be sure to match the <span class="code">GB</span> or <span class="code">TB</span> suffix.
**** For 9 or more drives;
***** <span class="code">RAID Level</span> -> <span class="code">RAID 6</span>
***** <span class="code">Write Policy</span> -> <span class="code">Write Back with BBU</span>
***** Look at the <span class="code">R6:xxxx</span> size on the right and enter that size in the <span class="code">Select Size</span> section. Be sure to match the <span class="code">GB</span> or <span class="code">TB</span> suffix.
**** Click on <span class="code">Accept</span> then click on <span class="code">Yes</span> to accept the warning.
**** The virtual disk will now be shown on the right. Click <span class="code">Next</span> to proceed.
*** Click <span class="code">Accept</span> and then click <span class="code">Yes</span> to save the configuration. Click <span class="code">Yes</span> to acknowledge the warning and initialize the drive.
** Click on <span class="code">Set Boot Drive</span> and then click <span class="code">Go</span>.
** Clock on <span class="code">Home</span>.
* Click on <span class="code">Exit</span> and then on <span class="code">Yes</span>.
 
Reboot and you are done.




<span class="code"></span>
<span class="code"></span>
<source lang="bash">
<syntaxhighlight lang="bash">
</source>
</syntaxhighlight>
 
<syntaxhighlight lang="text">
</syntaxhighlight>


{{footer}}
{{footer}}

Latest revision as of 21:50, 7 July 2016

 AN!Wiki :: How To :: Fujitsu Notes

This covers specific setup notes for Fujitsu Primenergy servers on EL6.

iRMC Remote KVM Access Through a Firewall

  1. . Port-forward port 80. (ie: <public ip>:41080 -> <ipmi ip>:80)
  2. . Log in; Network Settings -> Ports and Services -> HTTPS Port
    1. . Default is '443', but with multiple nodes, they can't all listen on 443 (unless you have many external IPs). So in this example, I change this to '41443'.
    2. . Save, log out.
  3. . Update firewall to forward <public ip>:41443 -> <ipmi ip>:41443
  4. . Connect to https://<public ip>:41443

Now iKVM will work.

iRMC HTML5 Video Redirection

Any server using iRMC S4 version 8.01 and up (8.24F current as of June 2016) has the option of using HTML5 rather than Java for video console redirection. To enable said functionality, from the iRMC web interface:

-> Console Redirection -> Video Redirection

And select HTML5 Viewer Enabled.


Primergy RX200 S8

BIOS

  • Advanced
    • SATA Configuration
      • SATA Mode -> Disabled
  • Server Mgmt
    • Asset Tag -> (short host name)
    • Temperature Monitoring -> Enabled
  • Boot
    • Bootup NumLock State -> On
    • PXE Boot Option Retry -> Enabled

RAID Controller (D3116C)

  • Configuration Wizard
    • New Configuration -> Next
    • Confirm clear config -> Yes
    • Manual Configuration -> Next
      • Press and hold <ctrl>, Click to highlight all Drives in left pane -> Add to Array -> Accept DG -> Next.
      • Add to SPAN -> Next
        • RAID Level; 1-8 driver == RAID 5, 9+ drives == RAID 6
        • Write Policy -> Write Back with BBU
        • Select Size -> Enter size in green text under right pane; R5 size for RAID 5, R6 size for RAID 6.
        • Accept -> Confirm cache policy; Yes -> Next
    • Accept
    • Save the configuration; Yes -> Confirm existing data wipe; Yes
    • Click to select Set Boot Drive -> Go -> Back
  • Exit -> Confirm exit; Yes

Reboot.

Primergy RX300 S7

RAID

Install the MegaCLI tools;

Check for an updates MegaCLI from here (under "Management Software and Tools"). If there is an updated version, follow the

mkdir ~/temp
cd ~/temp
# Download the updated 8.04.07_MegaCLI.zip here
unzip 8.04.07_MegaCLI.zip
unzip CLI_Lin_8.04.07.zip 
unzip MegaCliLin.zip
rpm -Uvh MegaCli-8.04.07-1.noarch.rpm Lib_Utils-1.00-09.noarch.rpm

# This makes MegaCli64 app available without the full path
ln -s /opt/MegaRAID/MegaCli/MegaCli64 /sbin/

If you want to install from the AN!Cache, you can do this;

rpm -Uvh https://alteeve.ca/files/Lib_Utils-1.00-09.noarch.rpm https://alteeve.ca/files/MegaCli-8.04.07-1.noarch.rpm

# This makes MegaCli64 app available without the full path
ln -s /opt/MegaRAID/MegaCli/MegaCli64 /sbin/

Once installed, verify that you can see your hardware:

Replacing a Failed Drive

Replacing a failed drive involves two steps;

  1. Identify the drive that is failing and gathering the data needed to request the RMA.
  2. Swapping the actual drive when the replacement arrives on client site.

Identify the Failing Drive

If the drive has failed entirely, the red front LED on the drive should be lit, making identification and RMA request simple.

However, if the drive has not yet failed, identifying the drive and confirming it's pending failure requires a little extra work.

Identify the failed drive:

MegaCli64 PDList aAll
<snip>

Enclosure Device ID: 10
Slot Number: 5
Drive's postion: DiskGroup: 0, Span: 0, Arm: 5
Enclosure position: 1
Device Id: 7
WWN: 5000C50054AE9C38
Sequence Number: 2
Media Error Count: 0
Other Error Count: 2
Predictive Failure Count: 0
Last Predictive Failure Event Seq Number: 0
PD Type: SAS

Raw Size: 279.396 GB [0x22ecb25c Sectors]
Non Coerced Size: 278.896 GB [0x22dcb25c Sectors]
Coerced Size: 278.875 GB [0x22dc0000 Sectors]
Firmware state: Online, Spun Up
Device Firmware Level: 5301
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x5000c50054ae9c39
SAS Address(1): 0x0
Connected Port Number: 0(path0)
Inquiry Data: SEAGATE ST9300653SS     53016XN1EMF2    @#87980
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: 6.0Gb/s
Link Speed: 6.0Gb/s
Media Type: Hard Disk Device
Drive Temperature :29C (84.20 F)
PI Eligibility:  No
Drive is formatted for PI information:  No
PI: No PI
Port-0 :
Port status: Active
Port's Linkspeed: 6.0Gb/s
Port-1 :
Port status: Active
Port's Linkspeed: 6.0Gb/s
Drive has flagged a S.M.A.R.T alert : No

Notice the line:

Other Error Count: 2

This is a sign of impending failure, despite SMART showing the drive as still healthy. You can gather more details (which Fujitsu will require to verify impending failure), run;

MegaCli64 -AdpEventLog -GetEvents -f raid_events.log -aALL

When this finishes gathering data, it will create a file called raid_events.log. Send this file to your Fujitsu support rep. They will validate the pending failure and issue an RMA.

Identifying The Drive Prior to Replacement

If the drive has failed, identifying the drive is as simple as looking at the front of the node for the drive with the red error LED lit.

If the drive has not failed yet, then you can use the identify command to blink the LED. That is what we'll cover here.

In the previous section, we identified the failed drive using the MegaCli64 command. You need to note the following;

MegaCli64 PDList aAll
Enclosure Device ID: 10
Slot Number: 5
...
Other Error Count: 2

The two bits of information you need are the enclosure ID and slot number. In this case, that is 10 and 5, respectively.

With that info, you can trigger the drive's red LED using the following command;

MegaCli64 -PdLocate -start -physdrv [10:5] -aALL
Adapter: 0: Device at EnclId-10 SlotId-5  -- PD Locate Start Command was successfully sent to Firmware 

Exit Code: 0x00

Once you've located the drive, you can stop the "locate" command using:

MegaCli64 -PdLocate -stop -physdrv [10:5] -aALL
Adapter: 0: Device at EnclId-10 SlotId-5  -- PD Locate Stop Command was successfully sent to Firmware 

Exit Code: 0x00

Now that you know which drive has failed, you can OFFLINE it in preparation of replacing it.

MegaCli64 -PDOffline -physdrv [10:5] -aALL
Adapter: 0: EnclId-10 SlotId-5 state changed to OffLine.

Exit Code: 0x00

You can now physically remove the failed disk and insert the replacement disk.

Monitoring the Rebuild

When a replacement disk is inserted, the array should recognise it and automatically begin rebuilding the array. You can monitor this operation by calling;

MegaCli64 -PDRbld -ProgDsply -PhysDrv [10:5] -aALL

This will display the rebuild progress as a textual bar graph. A rebuild of a 300 GB 15,000rpm SAS drive in a 6-drive array took about 30 minutes. How long it takes in your case will vary depending on disk speed, array size and load.

Checking the BBU

Check the BBU;

MegaCli64 AdpBbuCmd aAll

Triggering a re-learn cycle.

MegaCli64 AdpBbuCmd BbuLearn aAll

Monitor a learn cycle:

watch "MegaCli64 AdpBbuCmd aAll | grep -i -e learn -e status -e charge -e operation"

Updating Firmware

Tested on:

  • RX200 S8

Dependencies

yum install kernel-headers gcc gcc-c++ compat-libstdc++-33.i686 libstdc++.i686 libstdc++-devel.i686 kernel-devel

iRMC

Warning: Update the iRMC before updating the BIOS.

Go to: Fujitsu Downloads

Select:

  • "Product Search"
  • Enter the server's model number in the 'Product Search' text field. For example; "RX2540M1". Note that you can enter a partial model number and it will ask you to select the proper model from a list.
  • Click the 'Selected operating system' drop down list and choose "Red Hat Enterprise Linux 6 (x86_64)". Note that you might need to click on the selection box twice to get the pop-up menu with the OS selection list to appear.
  • Under the "Driver" tab;
  • Click to expand "Server Management Controller"
    • Click to expand "iRMC S4 (Kronos 4) (Onboard on D3289-A1x)"
    • Verify the title is 'iRMC S4 (Kronos4) Firmware - RX2540 M1 (ASP for Linux)'. If so, click on "Direct Download" on the left.
Warning: Read and understand the notes and warnings!
  • When you are ready, click to check the "Terms of Use" checkbox and then click on "Download File".
  • Locate the file on your computer and extract the zip file.
  • Copy the RX2540M1_MangtCtr_<version>.scexe to the node.
Note: In this tutorial, the file name we're using is 'RX2540M1_MangtCtr_FW0824F_SDR367.scexe'.

Log into the node and run:

# -u == update only, don't reflash or downgrade
# -k == keep extracted files.
sh RX2540M1_MangtCtr_FW0824F_SDR367.scexe -u -k

This will start the iRMC firmware update, you need to confirm:

The install will take a while, be patient!

#...

When it's done, you will hear the fans spin.

BIOS

Note: If you already selected the machine type in the previous section, you will not need to select the machine type or operating system a second time.

Go to: Fujitsu Downloads

Select:

  • "Product Search"
  • Enter the server's model number in the 'Product Search' text field. For example; "RX2540M1". Note that you can enter a partial model number and it will ask you to select the proper model from a list.
  • Click the 'Selected operating system' drop down list and choose "Red Hat Enterprise Linux 6 (x86_64)". Note that you might need to click on the selection box twice to get the pop-up menu with the OS selection list to appear.
  • Click on the "BIOS" tab.
  • Click to expand "Flash - BIOS"
    • Click to expand "Flash BIOS for D3289-A1x" (the final ID may differ depending on your machine)
    • Verify the title is 'Flash BIOS RX2540 M1 (ASP for Linux)'. If so, click on "Direct Download" on the left.
Warning: Read and understand the notes and warnings!
  • When you are ready, click to check the "Terms of Use" checkbox and then click on "Download File".

Extract on the target and run:

# -u == update only, don't reflash or downgrade
# -k == keep extracted files.
sh RX200S8_D3302_BiosV4654_R160.scexe -u -k
***************** PRIMERGY Autonomous Support Package ***************

Description: Flash BIOS RX200 S8  
VersionMajor: V4.6.5.4
VersionMinor: R1.6.0
VersionBuild: 1.0.0
Software Class - Category: Flash - BIOS 
Software Class - Name: (SV) Flash Bios 
Vendor: Fujitsu Technology Solutions 

*********************************************************************

Continue processing this ASP?
Please answer: yes/y or no/n

ACK

*********************************************************************


                          CAUTION!

         Currently a new version is being installed.

       The installation process will take a long time.

                   .... please wait ....

          Don't interrupt this installation process!



*********************************************************************
*********************************************************************


                           FAILED!

         The new version was not correctly installed.



*********************************************************************

In this case, the BIOS was already up to date. Verify via 'dmidecode':

# dmidecode 2.12
SMBIOS 2.7 present.
88 structures occupying 4084 bytes.
Table at 0x7C8BD018.

Handle 0x0000, DMI type 0, 24 bytes
BIOS Information
        Vendor: FUJITSU // American Megatrends Inc.
        Version: V4.6.5.4 R1.6.0 for D3302-A1x
        Release Date: 01/30/2014
        Address: 0xF0000
        Runtime Size: 64 kB
        ROM Size: 13248 kB
        Characteristics:
                PCI is supported
                BIOS is upgradeable
                BIOS shadowing is allowed
                Boot from CD is supported
                Selectable boot is supported
                EDD is supported
                Print screen service is supported (int 5h)
                Serial services are supported (int 14h)
                Printer services are supported (int 17h)
                ACPI is supported
                USB legacy is supported
                BIOS boot specification is supported
                Targeted content distribution is supported
                UEFI is supported
        BIOS Revision: 1.6

And compare against the latest version on the download page under "BIOS Update - Admin Pack for D3302-A1x"

V4.6.5.4 - R1.6.0
Warning: If the update succeeds, immediately reboot after.

Done!

D3116C RAID Controller

Warning: Immediately reboot when update completes!

Go to: RX*** Downloads

Select the machine, RHEL 6 x86_64, choose SAS RAID - RAID Ctrl SAS 6G 1GB (D3116C) - Firmware for RAID Ctrl SAS 6G D3116C (ASP for Linux)

Extract on the target and run:

# -u == update only, don't reflash or downgrade
# -k == keep extracted files.
sh FTS_FirmwareforRAIDCtrlSAS6GD3116CASPforLinu_2390003332201252420100_1106185.SCEXE -u -k
***************** PRIMERGY Autonomous Support Package ***************

Description: Firmware for RAID Ctrl SAS 6G D3116C  
VersionMajor: 23.9.0-0033
VersionMinor: 3.220.125-2420
VersionBuild: 1.0.0
Software Class - Category: Flash - Firmware 
Software Class - Name: (SV) Flash - Firmware 
Vendor: LSI Logic 

*********************************************************************

Continue processing this ASP?
Please answer: yes/y or no/n

ACK

*********************************************************************


                          CAUTION!

         Currently a new version is being installed.

       The installation process will take a long time.

                   .... please wait ....

          Don't interrupt this installation process!



*********************************************************************
*********************************************************************


                       Attention! 

                     Successfully flashed.
                      Reboot your system.




*********************************************************************

REBOOT

reboot

Once rebooted, reset the controller to factory defaults. This will _not_ effect the Logical Disk! If you followed the AN!Cluster Tutorial 2, then you will not need to change any settings. If does require another reboot though.

Record the current settings (we'll diff after the reset):

MegaCli64 AdpAllInfo aAll > Adapter.pre-reset
MegaCli64 LDInfo Lall aAll > LD.pre-reset

Reset:

MegaCli64 AdpFacDefSet a0
Adapter 0: Factory Default Set Successfully. 
Please reboot the system for the changes to take effect

Exit Code: 0x00

Now reboot again.

reboot

Now dump the new settings and check for changes:

MegaCli64 AdpAllInfo aAll > Adapter.post-reset
MegaCli64 LDInfo Lall aAll > LD.post-reset

Check for differences:

diff -u Adapter.pre-reset Adapter.post-reset
--- Adapter.pre-reset	2014-04-14 15:48:35.011122984 -0400
+++ Adapter.post-reset	2014-04-14 15:52:27.445603604 -0400
@@ -72,11 +72,11 @@
 Temperature sensor for ROC    : Present
 Temperature sensor for controller    : Absent
 
-ROC temperature : 74  degree Celsius
+ROC temperature : 75  degree Celsius
 
                 Settings
                 ================
-Current Time                     : 19:48:35 4/14, 2014
+Current Time                     : 19:52:26 4/14, 2014
 Predictive Fail Poll Interval    : 300sec
 Interrupt Throttle Active Count  : 16
 Interrupt Throttle Completion    : 50us
diff -u LD.pre-reset LD.post-reset
# no output

Done!

Creating a Backup Set

ToDo: Explain this...

mkdir /root/base
cd /root/base
mkdir /root/base/root
mkdir -p /root/base/etc/sysconfig/network-scripts
mkdir -p /root/base/etc/udev/rules.d
mkdir -p /root/base/etc/init.d
mkdir -p /root/base/var/spool/cron

# Root user
rsync -av /root/.bashrc   /root/base/root/
rsync -av /root/.ssh      /root/base/root/
rsync -av /root/an-cm*    /root/base/root/
rsync -av /root/archive_* /root/base/root/

# Directories
rsync -av /etc/ssh     /root/base/etc/
rsync -av /etc/apcupsd /root/base/etc/
rsync -av /etc/cluster /root/base/etc/
rsync -av /etc/drbd.*  /root/base/etc/
rsync -av /etc/an      /root/base/etc/
rsync -av /etc/yum     /root/base/etc/
rsync -av /etc/pki     /root/base/etc/
rsync -av --exclude 'archive' --exclude 'cache' --exclude 'backup' /etc/lvm /root/base/etc/

# Specific files.
rsync -av /etc/sysconfig/network-scripts/ifcfg-{eth*,bond*,vbr*} /root/base/etc/sysconfig/network-scripts/
rsync -av /etc/udev/rules.d/70-persistent-net.rules              /root/base/etc/udev/rules.d/
rsync -av /etc/sysconfig/network /root/base/etc/sysconfig/
rsync -av /etc/hosts             /root/base/etc/
rsync -av /etc/ntp.conf          /root/base/etc/
rsync -av /etc/init.d/apcupsd    /root/base/etc/init.d/
rsync -av /var/spool/cron/root   /root/base/var/spool/cron/

# Save recreating user accounts.
rsync -av /etc/passwd            /root/base/etc/
rsync -av /etc/group             /root/base/etc/
rsync -av /etc/shadow            /root/base/etc/
rsync -av /etc/gshadow           /root/base/etc/

# If you have the cluster built and want to backup it's configs.
mkdir /root/base/etc/cluster
mkdir /root/base/etc/lvm
rsync -av /etc/cluster/cluster.conf /root/base/etc/cluster/

# NOTE: DRBD won't work until you've manually created the partitions.
rsync -av /etc/drbd.d /root/base/etc/

# If you're running RHEL and want to backup your registration info;
if [ -e "/etc/sysconfig/rhn" ]
then
	rsync -av /etc/sysconfig/rhn /root/base/etc/sysconfig/
fi

# Back up the logical and extended partition structure
for d in $(fdisk -l | grep 'Disk /dev' | grep -v mapper | sed 's/Disk \(.*\):.*/\1/')
do
        echo "#!/bin/bash" > /root/base/root/partition_drives.sh
        for i in $(parted -m -s -a opt $d "print free" | grep '^[4-9]')
        do
                if [ `echo $i | grep '^4:'` ]
                then
                        echo "$d:$i" | perl -pe 's/^(.*?):(\d+):(.*?):(.*?):.*/parted -s -a opt \1 "mkpart extended \3 \4"/'
                else
                        echo "$d:$i" | perl -pe 's/^(.*?):(\d+):(.*?):(.*?):.*/parted -s -a opt \1 "mkpart logical \3 \4"/'
                fi
        done
done >> /root/base/root/partition_drives.sh
chmod 755 /root/base/root/partition_drives.sh

# Pack it up
# NOTE: Change the name to suit your node.
cd /root/
tar -cvf base_$(hostname -s).tar /root/base/etc /root/base/root /root/base/var
ls -lah /root/base_*

Now copy it to your PXE server. In my case, that is 10.255.255.250. I am backing up RHEL6 nodes, so the places I store my backups in is /var/www/html/rhel6/x86_64/files/.

rsync -av /root/base_$(hostname -s).tar root@10.255.255.250:/var/www/html/rhel6/x86_64/files/
root@10.255.255.250's password: 
sending incremental file list
base_an-c05n02.tar

sent 4045378 bytes  received 31 bytes  898979.78 bytes/sec
total size is 4044800  speedup is 1.00

Now that it is on the server, I can use the following %post kickstart script entry in dedicated, per-node kickstart scripts.

%post
# Download the backup files and load them.
cd ~
wget http://10.255.255.250/rhel6/x86_64/files/base_an-c05n01.tar
cp base_an-c05n01.tar.gz /mnt/sysimage/root/
/etc/init.d/network stop
tar -xvzf base_an-c05n01.tar.gz -C /
rm -f /etc/udev/rules.d/70-persistent-net.rules
start_udev
/etc/init.d/network start
/mnt/systemroot/root/partition_drives.sh

When the install finishes, that will load all the the files we backed up. So when the node reboots, all if it's old RHN registration, network configs and so on will be restored. This should dramatically reduce recovery time!

Specific Fujitsu Model Notes

RX200 S7

BIOS Changes for use in Anvil!s

To enter the BIOS, press F2 during POST.

  • Advanced
    • SATA Configuration
      • SATA Mode -> Disaabled (if no optical drive)
      • - OR -
      • SATA Mode -> AHCI Mode (if optical drive)
    • Onboard Device Configuration
      • LAN 2 Oprom -> PXE
      • Onboard SAS/SATA (SCU) -> Disabled (if no optical drive)
  • Server Mgmt
    • Asset Asset Tag -> short host name of the node, lower case
    • Temperature Monitoring -> Enabled
  • Boot
    • Bootup NumLock State -> On

Save and Exit

LSI RAID Controller Setup

Warning: These instructions assume NO existing data. This is a destructive process!

After POST, press <ctrl> + H to enter the controller's WebBIOS

  • Choose the controller (usually only one available), click Start
  • Choose Configuration Wizard
    • Choose New Configuration -> click Next
    • Confirm the clear by clicking Yes
    • Choose Manual Configuration -> click Next
      • Click on the first drive, usually Slot: 0,..., press and hold <ctrl> button and then click to select the rest of the drives. Click Add to Array.
      • Click on Accept DG to create the drive group then click on Next
      • Click on Add to SPAN, then click on Next
      • Configure the array;
        • For 1 to 8 drives;
          • RAID Level -> RAID 5
          • Write Policy -> Write Back with BBU
          • Look at the R5:xxxx size on the right and enter that size in the Select Size section. Be sure to match the GB or TB suffix.
        • For 9 or more drives;
          • RAID Level -> RAID 6
          • Write Policy -> Write Back with BBU
          • Look at the R6:xxxx size on the right and enter that size in the Select Size section. Be sure to match the GB or TB suffix.
        • Click on Accept then click on Yes to accept the warning.
        • The virtual disk will now be shown on the right. Click Next to proceed.
      • Click Accept and then click Yes to save the configuration. Click Yes to acknowledge the warning and initialize the drive.
    • Click on Set Boot Drive and then click Go.
    • Clock on Home.
  • Click on Exit and then on Yes.

Reboot and you are done.


 

Any questions, feedback, advice, complaints or meanderings are welcome.
Alteeve's Niche! Enterprise Support:
Alteeve Support
Community Support
© Alteeve's Niche! Inc. 1997-2024   Anvil! "Intelligent Availability®" Platform
legal stuff: All info is provided "As-Is". Do not use anything here unless you are willing and able to take responsibility for your own actions.