OVHcloud Network Status

Current status
Legend
  • Operational
  • Degraded performance
  • Partial Outage
  • Major Outage
  • Under maintenance
rbx-g1-a9
Incident Report for Network & Infrastructure
Resolved
Nous venons d'avoir un reload d'une carte sur rbx-g1-a9 à cause d'une erreur ECC.

RP/0/RSP0/CPU0:Feb 11 14:30:57 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR-6-NODE_STATE_CHANGE : 0/6/CPU0 A9K-8T-L state:IOS XR FAILURE
RP/0/RSP0/CPU0:Feb 11 14:30:57 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR-6-NODE_STATE_CHANGE : 0/6/CPU0 A9K-8T-L state:BRINGDOWN

RP/0/RSP1/CPU0:Feb 11 14:31:31 CEST: ce_switch_srv[54]: %PLATFORM-CE_SWITCH-6-UPDN : Interface 6 (LC_Slot_6) is up
RP/0/RSP0/CPU0:Feb 11 14:31:32 CEST: ce_switch_srv[54]: %PLATFORM-CE_SWITCH-6-UPDN : Interface 6 (LC_Slot_6) is up
RP/0/RSP0/CPU0:Feb 11 14:31:33 CEST: ce_switch_srv[54]: %PLATFORM-CE_SWITCH-6-UPDN : Interface 6 (LC_Slot_6) is down
RP/0/RSP0/CPU0:Feb 11 14:31:33 CEST: ce_switch_srv[54]: %PLATFORM-CE_SWITCH-6-UPDN : Interface 6 (LC_Slot_6) is up


Update(s):

Date: 2016-02-11 17:45:44 UTC
Le RMA a ete recu. Il n'y a qu'un seul port en production sur la carte qui ne fonctionne plus, nous le deplacons.

Date: 2016-02-11 13:58:49 UTC
Le RMA est demandé chez Cisco.

Date: 2016-02-11 12:43:35 UTC
La carte reload bien en boucle.

Afin d'éviter un impact sur le routeur nous la shuttons.

Date: 2016-02-11 12:40:38 UTC
RP/0/RSP0/CPU0:Feb 11 14:40:17 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR-6-NODE_STATE_CHANGE : 0/6/CPU0 A9K-8T-L state:MBI-RUNNING
RP/0/RSP0/CPU0:Feb 11 14:40:17 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR-6-NODE_STATE_CHANGE : 0/6/CPU0 A9K-8T-L state:MBI-RUNNING


Date: 2016-02-11 12:39:57 UTC
La carte semble reloader en boucle.

RP/0/RSP0/CPU0:Feb 11 14:39:14 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR_HAL-6-BOOT_REQ_RECEIVED : Boot Request from 0/6/CPU0, RomMon Version: 1.3
RP/0/RSP0/CPU0:Feb 11 14:39:14 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR-6-NODE_STATE_CHANGE : 0/6/CPU0 A9K-8T-L state:MBI-BOOTING
RP/0/RSP0/CPU0:Feb 11 14:39:14 CEST: shelfmgr[392]: %PLATFORM-SHELFMGR-6-NODE_STATE_CHANGE : 0/6/CPU0 A9K-8T-L state:MBI-BOOTING

Nous allons vérifier avec Cisco pourquoi et demander un RMA si besoin.


Date: 2016-02-11 12:35:46 UTC
Aucun impact n'a été constaté concernant le service cependant.

Date: 2016-02-11 12:35:08 UTC
LC/0/6/CPU0:Feb 11 14:15:34 CEST: pfm_node_lc[279]: %PLATFORM-NP-0-HW_DOUBLE_ECC_ERROR : Set|prm_server_tr[159813]|0x1008000|NP DOUBLE ECC ERROR, NP=0, memId=17, subMemId=0x1


LC/0/6/CPU0:Feb 11 14:15:34 CEST: pfm_node_lc[279]: %PLATFORM-NP-0-HW_DOUBLE_ECC_ERROR : Set|prm_server_tr[159813]|0x1008000|NP DOUBLE ECC ERROR, NP=0, memId=17, subMemId=0x1
Posted Feb 11, 2016 - 12:32 UTC
This incident affected: Infrastructure || RBX (RBX1, RBX2, RBX3, RBX4, RBX5, RBX6, RBX7, RBX8).