Closed
Bug 1164441
Opened 9 years ago
Closed 9 years ago
nagios alert for free ips in amazon
Categories
(Infrastructure & Operations :: RelOps: General, task)
Infrastructure & Operations
RelOps: General
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: catlee, Assigned: arich)
Details
(Whiteboard: [nagios])
Attachments
(2 files)
52 bytes,
text/x-github-pull-request
|
rail
:
review+
rail
:
checked-in+
|
Details | Review |
5.70 KB,
patch
|
dustin
:
review+
arich
:
checked-in+
|
Details | Diff | Splinter Review |
We've divided up our AWS VPCs into subnets, each subnet for a certain type of machines loosely corresponding to vlans (e.g. build, test, servers, etc.) If we have very few or no free IPs in a subnet, that means we can't create new machines there. We need a nagios check that alerts us if we're running low on available IPs per class of subnet, per region (or AZ).
Reporter | ||
Comment 1•9 years ago
|
||
Attachment #8606304 -
Flags: review?(rail)
Updated•9 years ago
|
Attachment #8606304 -
Flags: review?(rail)
Attachment #8606304 -
Flags: review+
Attachment #8606304 -
Flags: checked-in+
Reporter | ||
Comment 2•9 years ago
|
||
So we need a nagios check that runs: /builds/aws_manager/bin/python /builds/aws_manager/cloud-tools/scripts/aws_check_subnets.py -r us-east-1 -r us-west-2 -s test -s try -s build
Assignee | ||
Comment 3•9 years ago
|
||
This adds in a custom nagios check to call the script that catlee wrote. This script shoud run as the user buildduty for credential purposes. I've also modified the sudoers::custom manifest as well so that it takes three arguments, user, runas, and command. This gives us the flexibility to run commands as users other than root.
Assignee: relops → arich
Attachment #8626676 -
Flags: review?(dustin)
Updated•9 years ago
|
Attachment #8626676 -
Flags: review?(dustin) → review+
Assignee | ||
Comment 4•9 years ago
|
||
https://hg.mozilla.org/build/puppet/rev/db24b89f7d77 https://hg.mozilla.org/build/puppet/rev/f22f77171fc2 https://hg.mozilla.org/build/puppet/rev/a595ba7f9aff https://hg.mozilla.org/build/puppet/rev/07af46d74a08
Assignee | ||
Updated•9 years ago
|
Attachment #8626676 -
Flags: checked-in+
Assignee | ||
Comment 5•9 years ago
|
||
I added a nagios check for every 300 seconds, since this isn't something we're likely to hit with great rapidity unless something else is going wrong as well.
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
You need to log in
before you can comment on or make changes to this bug.
Description
•