Closed Bug 977341 Opened 10 years ago Closed 10 years ago

A lot of XP machines out of action

Categories

(Infrastructure & Operations Graveyard :: CIDuty, task)

x86_64
Linux
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: armenzg, Unassigned)

References

Details

I've noticed an XP machines with this message:
"Terminate batch job (Y/N)?"
http://people.mozilla.org/~armenzg/sattap/58d4f9bc.png

Not sure if the rest will be having the same issue or not.

It seems that it happens *just* when we try to reboot and prevents the reboot (which I'm shocked).

There are more than 20 machines out of action:
https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slavetype.html?class=test&type=t-xp32-ix

slave rebooter is probably failing to reboot the machine since it cannot see buildbot stopping on the twistd.log.
There were 34 machines in this state.
slaverebooter is not taking care of them properly:
https://bugzilla.mozilla.org/show_bug.cgi?id=971861
See Also: → 971861
Depends on: 991236
- I rebooted a bunch of these machines a while ago.
- It looks like 991236 is fixed.
- There is only one machine in xp pool that is broken.

I am resolving this for the above reasons.
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
Well, it's "fixed" as long as there's another slaverebooter bug about the way it fails to actually reboot them, instead waiting forever on shutdowns that will never happen. I've just started manually rebooting every "broken" XP slave four or five times a day, as a substitute for either slaverebooter or buildduty doing so.
Product: Release Engineering → Infrastructure & Operations
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.