RAID 降级问题,进来的统统 diameter_cm += 2, length_cm += 3

2019-06-05 09:51:07 +08:00
 qwerthhusn

Dell 服务器,用 MegaCli 查看状态提示已降级( Degraded ),机器正面也有一块硬盘在闪黄灯,其他的都是绿灯

smartctl -H /dev/bus/0 -d megaraid,N(所有盘,包括那块闪灯的)去看健康状态,都是 ok 的

这个是硬盘问题还是 RAID 配置问题,如何定位,请给个思路,我本来就是一个写增删改查的码畜,这些东西完全不知道该怎么玩。

1223 次点击
所在节点    问与答
4 条回复
onehao28
2019-06-05 10:11:56 +08:00
smartctl 看 是不准确的,应该 smartctl -a 看一下,看下具体参数。很有可能坏道太多。
lucifer9
2019-06-05 10:42:33 +08:00
找 Dell 工程师来处理,给钱了干嘛还要给自己找麻烦
qwerthhusn
2019-06-05 17:13:06 +08:00
@onehao28 这个能帮忙看一下么?
···
smartctl 6.7 2018-08-04 r4747 [x86_64-linux-3.10.0-957.12.2.el7.x86_64] (daily-20180804)
Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Vendor: SEAGATE
Product: ST1200MM0099
Revision: ST31
Compliance: SPC-4
User Capacity: 1,200,243,695,616 bytes [1.20 TB]
Logical block size: 512 bytes
Formatted with type 2 protection
8 bytes of protection information per logical block
LU is fully provisioned
Rotation Rate: 10000 rpm
Form Factor: 2.5 inches
Logical Unit id: 0x5000c500bca87617
Serial number: WFK2RM60
Device type: disk
Transport protocol: SAS (SPL-3)
Local Time is: Wed Jun 5 09:10:55 2019 UTC
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
Temperature Warning: Disabled or Not Supported
Read Cache is: Enabled
Writeback Cache is: Disabled

=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK

Grown defects during certification <not available>
Total blocks reassigned during format <not available>
Total new blocks reassigned = 1
Power on minutes since format <not available>
Current Drive Temperature: 34 C
Drive Trip Temperature: 60 C

Manufactured in week 49 of year 2018
Specified cycle count over device lifetime: 10000
Accumulated start-stop cycles: 20
Specified load-unload count over device lifetime: 300000
Accumulated load-unload cycles: 225
Elements in grown defect list: 39

Vendor (Seagate Cache) information
Blocks sent to initiator = 2181055139
Blocks received from initiator = 2453981
Blocks read from cache and sent to initiator = 13569935
Number of read and write commands whose size <= segment size = 35257
Number of read and write commands whose size > segment size = 0

Vendor (Seagate/Hitachi) factory information
number of hours powered up = 12.98
number of minutes until next internal SMART test = 51

Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9 bytes] errors
read: 2180959242 1 0 2180959243 0 1134.149 0
write: 0 0 0 0 0 1.352 0
verify: 1237 0 0 1237 26 0.000 0

Non-medium error count: 0

SMART Self-test log
Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
Description number (hours)
# 1 Background long Failed in first segment 96 5 - [0x4 0x44 0x0]

Long (extended) Self-test duration: 6723 seconds [112.0 minutes]

scsiPrintBackgroundResults Failed [Input/output error]
Protocol Specific port log page for SAS SSP
relative target port id = 1
generation code = 2
number of phys = 1
phy identifier = 0
attached device type: SAS or SATA device
attached reason: unknown
reason: loss of dword synchronization
negotiated logical link rate: phy enabled; 12 Gbps
attached initiator port: ssp=1 stp=1 smp=1
attached target port: ssp=0 stp=0 smp=0
SAS address = 0x5000c500bca87615
attached SAS address = 0x54cd98f0469eb107
attached phy identifier = 7
Invalid DWORD count = 0
Running disparity error count = 0
Loss of DWORD synchronization = 6
Phy reset problem = 0
Phy event descriptors:
Invalid word count: 0
Running disparity error count: 0
Loss of dword synchronization count: 6
Phy reset problem count: 0
relative target port id = 2
generation code = 2
number of phys = 1
phy identifier = 1
attached device type: no device attached
attached reason: unknown
reason: unknown
negotiated logical link rate: phy enabled; unknown
attached initiator port: ssp=0 stp=0 smp=0
attached target port: ssp=0 stp=0 smp=0
SAS address = 0x5000c500bca87616
attached SAS address = 0x0
attached phy identifier = 0
Invalid DWORD count = 0
Running disparity error count = 0
Loss of DWORD synchronization = 0
Phy reset problem = 0
Phy event descriptors:
Invalid word count: 0
Running disparity error count: 0
Loss of dword synchronization count: 0
Phy reset problem count: 0
···
crc8
2019-06-05 17:55:37 +08:00
硬盘问题,显示 OK 只能表明还没到显示不 OK 的时候。

这是一个专为移动设备优化的页面(即为了让你能够在 Google 搜索结果里秒开这个页面),如果你希望参与 V2EX 社区的讨论,你可以继续到 V2EX 上打开本讨论主题的完整版本。

https://www.v2ex.com/t/570998

V2EX 是创意工作者们的社区,是一个分享自己正在做的有趣事物、交流想法,可以遇见新朋友甚至新机会的地方。

V2EX is a community of developers, designers and creative people.

© 2021 V2EX