Closed Bug 1125218 Opened 9 years ago Closed 9 years ago

disk space nagios alerts are too aggressive for signing*.srv.releng.scl3.mozilla.com

Categories

(Infrastructure & Operations :: MOC: Service Requests, task)

x86
macOS
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: jlund, Assigned: vinh)

Details

We are currently getting warnings at < 20%

For this host, it should do a Warning at < 10% and Critical < 5%

https://nagios.mozilla.org/releng-scl3/cgi-bin/extinfo.cgi?type=2&host=signing4.srv.releng.scl3.mozilla.com&service=disk+-+%2F
Assignee: server-ops → nobody
Component: Server Operations → MOC: Service Requests
Product: mozilla.org → Infrastructure & Operations
QA Contact: shyam → lypulong
Component: MOC: Service Requests → Other
Product: Infrastructure & Operations → Release Engineering
QA Contact: lypulong → pmoore
Per chat with linda, this is indeed a MOC: Service Request, considering its a change needed to nagios-releng which is in infra puppet.
Component: Other → MOC: Service Requests
Product: Release Engineering → Infrastructure & Operations
QA Contact: pmoore → lypulong
(In reply to Jordan Lund (:jlund) from comment #0)
> https://nagios.mozilla.org/releng-scl3/cgi-bin/extinfo.
> cgi?type=2&host=signing4.srv.releng.scl3.mozilla.com&service=disk+-+%2F

There are 3 signing hosts: signing4, signing5, and signing6. They should all receive the new thresholds.
Summary: disk space nagios alerts are too aggressive for signing4.srv.releng.scl3.mozilla.com → disk space nagios alerts are too aggressive for signing*.srv.releng.scl3.mozilla.com
Assignee: nobody → vhua
I've bumped down the disk warning threshold for signing[456].srv.releng.

--- services.pp	(revision 100397)
+++ services.pp	(working copy)
@@ -79,6 +79,20 @@
             hostgroups => $::fqdn ? {
                 'nagios1.private.releng.scl3.mozilla.com' => [
                     'mac-signing-servers',
+                ],
+                default => [
+                ]
+            }
+        },
+        "check_disk_10_5_signing" => {
+            service_description => "disk - /",
+            notification_options => 'w,c,r,u',
+            normal_check_interval => 5,
+            notification_interval => 5,
+            contact_groups => 'build',
+            check_command => 'check_nrpe_disk!10%!5%!/',
+            hostgroups => $::fqdn ? {
+                'nagios1.private.releng.scl3.mozilla.com' => [
                     'signing-servers'
                 ],
                 default => [
vhua-19865:releng vhua$ svn commit -m "Bug 1125218 - signing*.srv.releng disk check"
Sending        releng/services.pp
Transmitting file data .
Committed revision 100399.
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
that's great. thank you!
You need to log in before you can comment on or make changes to this bug.