DISK 2 (SDB) seem have issue or SATA HAT port KO N

Hi @setq
I’ve for 2nd time an issue with port 2 of SATA HAT.
I’ve change HDD because previous flag with error under SMART OVM and red light on SATA HAT Board.
I’ve reinstalled my system recently and changed all of my 4 disks.

Today, OVM show disk at “BAD” condition, no blue light on SATA HAT BOARD for port 2 (sdb).
image

It’s the 2nd times, then I’m un doubt if it’s bad luck with 2 disks for SATA HAT issue.
Your feedback, advices? Tips to test port 2 in command line?

smartctl 7.2 2020-12-30 r5155 [armv7l-linux-5.10.92-v7l+] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     HGST Travelstar 5K1000
Device Model:     HGST HTS541010A9E680
Serial Number:    JD1092DP06RSDU
LU WWN Device Id: 5 000cca 82dc30fa5
Firmware Version: JA0OA7G0
User Capacity:    1,000,204,886,016 bytes [1.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Form Factor:      2.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2, ATA8-ACS T13/1699-D revision 6
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Wed Feb 16 08:37:25 2022 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM level is:     254 (maximum performance)
Rd look-ahead is: Enabled
Write cache is:   Enabled
DSN feature is:   Unavailable
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Unknown

=== START OF READ SMART DATA SECTION ===
SMART Status not supported: Incomplete response, ATA output registers missing
SMART overall-health self-assessment test result: PASSED
Warning: This result is based on an Attribute check.

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (   45) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 222) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     POSR-K   100   097   062    -    0
  2 Throughput_Performance  P-S--K   198   100   040    -    125
  3 Spin_Up_Time            PO---K   200   100   033    -    1
  4 Start_Stop_Count        -O--CK   100   100   000    -    1069
  5 Reallocated_Sector_Ct   PO--CK   100   100   005    -    0
  7 Seek_Error_Rate         POSR-K   100   100   067    -    0
  8 Seek_Time_Performance   P-S--K   115   100   040    -    34
  9 Power_On_Hours          -O--CK   097   097   000    -    1477
 10 Spin_Retry_Count        PO--CK   100   100   060    -    0
 12 Power_Cycle_Count       -O--CK   100   100   000    -    1029
183 Runtime_Bad_Block       -O--CK   100   100   000    -    0
184 End-to-End_Error        PO--CK   100   100   097    -    0
187 Reported_Uncorrect      -O--CK   100   100   000    -    85901901824
188 Command_Timeout         -O--CK   100   100   000    -    25770786816
190 Airflow_Temperature_Cel -O---K   054   049   045    -    46 (Min/Max 41/51)
191 G-Sense_Error_Rate      -O--CK   098   098   000    -    646
192 Power-Off_Retract_Count -O--CK   100   100   000    -    10354846
193 Load_Cycle_Count        -O--CK   099   099   000    -    14895
196 Reallocated_Event_Count -O--CK   100   100   000    -    0
197 Current_Pending_Sector  -O--CK   100   100   000    -    8
198 Offline_Uncorrectable   ----CK   100   100   000    -    0
199 UDMA_CRC_Error_Count    -OS-CK   100   100   000    -    0
223 Load_Retry_Count        -O-R-K   100   100   000    -    0
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01           SL  R/O      1  Summary SMART error log
0x02           SL  R/O      1  Comprehensive SMART error log
0x03       GPL     R/O      1  Ext. Comprehensive SMART error log
0x06           SL  R/O      1  SMART self-test log
0x07       GPL     R/O      1  Extended self-test log
0x09           SL  R/W      1  Selective self-test log
0x10       GPL     R/O      1  NCQ Command Error log
0x11       GPL     R/O      1  SATA Phy Event Counters log
0x80-0x9f  GPL,SL  R/W     16  Host vendor specific log
0xe0       GPL,SL  R/W      1  SCT Command/Status
0xe1       GPL,SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (1 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:                  3
SCT Version (vendor specific):       256 (0x0100)
Device State:                        Active (0)
Current Temperature:                    46 Celsius
Power Cycle Min/Max Temperature:     41/51 Celsius
Lifetime    Min/Max Temperature:     12/51 Celsius
Specified Max Operating Temperature:    36 Celsius
Under/Over Temperature Limit Count:   0/0

SCT Temperature History Version:     2
Temperature Sampling Period:         1 minute
Temperature Logging Interval:        1 minute
Min/Max recommended Temperature:      0/60 Celsius
Min/Max Temperature Limit:           -40/65 Celsius
Temperature History Size (Index):    128 (88)

Index    Estimated Time   Temperature Celsius
  89    2022-02-16 06:30    46  ***************************
  90    2022-02-16 06:31    45  **************************
 ...    ..(  2 skipped).    ..  **************************
  93    2022-02-16 06:34    45  **************************
  94    2022-02-16 06:35    46  ***************************
  95    2022-02-16 06:36    45  **************************
 ...    ..( 12 skipped).    ..  **************************
 108    2022-02-16 06:49    45  **************************
 109    2022-02-16 06:50    46  ***************************
 110    2022-02-16 06:51    45  **************************
 ...    ..( 42 skipped).    ..  **************************
  25    2022-02-16 07:34    45  **************************
  26    2022-02-16 07:35    44  *************************
  27    2022-02-16 07:36    45  **************************
 ...    ..(  2 skipped).    ..  **************************
  30    2022-02-16 07:39    45  **************************
  31    2022-02-16 07:40    44  *************************
  32    2022-02-16 07:41    45  **************************
  33    2022-02-16 07:42    45  **************************
  34    2022-02-16 07:43    44  *************************
  35    2022-02-16 07:44    44  *************************
  36    2022-02-16 07:45    45  **************************
  37    2022-02-16 07:46    44  *************************
 ...    ..( 26 skipped).    ..  *************************
  64    2022-02-16 08:13    44  *************************
  65    2022-02-16 08:14    45  **************************
  66    2022-02-16 08:15    46  ***************************
  67    2022-02-16 08:16    46  ***************************
  68    2022-02-16 08:17    47  ****************************
 ...    ..(  2 skipped).    ..  ****************************
  71    2022-02-16 08:20    47  ****************************
  72    2022-02-16 08:21    46  ***************************
 ...    ..(  3 skipped).    ..  ***************************
  76    2022-02-16 08:25    46  ***************************
  77    2022-02-16 08:26    47  ****************************
  78    2022-02-16 08:27    47  ****************************
  79    2022-02-16 08:28    48  *****************************
  80    2022-02-16 08:29    48  *****************************
  81    2022-02-16 08:30    48  *****************************
  82    2022-02-16 08:31    47  ****************************
  83    2022-02-16 08:32    47  ****************************
  84    2022-02-16 08:33    46  ***************************
 ...    ..(  3 skipped).    ..  ***************************
  88    2022-02-16 08:37    46  ***************************

SMART WRITE LOG does not return COUNT and LBA_LOW register
SCT (Get) Error Recovery Control command failed

Device Statistics (GP/SMART Log 0x04) not supported

Pending Defects log (GP Log 0x0c) not supported

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x0001  2            0  Command failed due to ICRC error
0x0002  2            0  R_ERR response for data FIS
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0005  2            0  R_ERR response for non-data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS
0x0009  2            3  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2            2  Device-to-host register FISes sent due to a COMRESET
0x000b  2            0  CRC errors within host-to-device FIS
0x000d  2            0  Non-CRC errors within host-to-device FIS

each of your 4 disks on port 2 raises the same issue?
did you check dmesg?