Closed Bug 1007328 Opened 10 years ago Closed 9 years ago

Please run diagnostics on t-w732-ix-059

Categories

(Infrastructure & Operations :: DCOps, task)

x86
macOS
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: coop, Unassigned)

References

Details

(Whiteboard: reimaging)

This slave is displaying the Pink Pixel of Death(tm), which usually indicates memory problems.
running  diagnostics
colo-trip: --- → scl3
Whiteboard: hardware diagnostics
Host passed multiple diags, we can create a case with IX if the issue comes up again
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
Product: mozilla.org → Infrastructure & Operations
(In reply to Salvador Espinoza [:sal] from comment #2)
> Host passed multiple diags, we can create a case with IX if the issue comes
> up again

Ok. let's do that, please. It's hitting the same errors again.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
we fixed the 'pink pixel of death' issue last time by replacing the video card. will swap out next DC visit.
well host doesn't seem to power on, opened a case with IX

ticket ID #SPJ-937-91915
Whiteboard: hardware diagnostics → ticket ID #SPJ-937-91915
update from ix

"Sal,

Your node A1-27708 has been received in the testing area and will begin testing shortly."
Node is ready for pickup, seems we might have a bad chassis. 
update from ix,

"Hello Sal,

We wanted to update you with our findings on the node we have in-house.

Since receiving the node it has completed over 20 burn-in tests in house without any recorded errors or any issues booting.

There has been no issues found, which points back to an issue within the chassis.

Previously, you stated you do not have a spare node to verify the slot. However, we have verified the node, so if upon return the node fails to boot the conclusion is a bad slot on the chassis that should be investigated.

If possible could the returning node and another node in the same chassis switch positions to see if the slot stays dead or follows the node.

As always, if you have any further questions or concerns, we are here to help.

Thanks,"
i'll pick up the blade on Monday when i pick up our RMA drives as we have several outstanding tickets with them.
Picked up host from IX, currently reimaging. 
Video card was replaced so hopefully that solves the issue.
Whiteboard: ticket ID #SPJ-937-91915 → reimaging
Host reimaged.

vhua$ ssh t-w732-ix-059.wintest.releng.scl3.mozilla.com
The authenticity of host 't-w732-ix-059.wintest.releng.scl3.mozilla.com (10.26.40.209)' can't be established.
RSA key fingerprint is 1d:a5:4f:93:5d:c1:3a:d0:c4:8f:fc:1d:4d:75:0a:89.
Are you sure you want to continue connecting (yes/no)?
Status: REOPENED → RESOLVED
Closed: 10 years ago9 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.