Closed Bug 1086934 Opened 10 years ago Closed 10 years ago

Production's commander_settings.py is missing treeherder-etl from CELERY_HOSTGROUP

Categories

(Tree Management :: Treeherder, defect, P1)

defect

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: emorley, Assigned: fubar)

References

Details

[emorley@treeherderadm.private.scl3 treeherder-service]$ cat deployment/update/commander_settings.py | grep 'HOSTGROUP'

WEB_HOSTGROUP = 'treeherder-web'
CELERY_HOSTGROUP = 'treeherder-processors'


These hostgroups are used by the deploy script:
https://github.com/mozilla/treeherder-service/blob/f7fed3e46ebebc2dfc7dabf331057dbb48db8d7d/deployment/update/update.py#L65

The new treeherder etl nodes are missing - so they won't get updated on Chief deploy.
Once prod's version is updated, I'll update the example file in the repo:
https://github.com/mozilla/treeherder-service/blob/master/deployment/update/commander_settings.py-dist
Also, perhaps we should just check commander_settings.py into the repo, and not have a split example-real file workflow? There's nothing secret in the prod file that can't be checked into the repo.
Just remembered prod vs stage will need different values for DEPLOY_SCRIPT, so comment 2 won't work.

(Bah the delights of things that occur to you after pressing submit)
for whatever reason, that file isn't in puppet. I've updated it on disk to be

CELERY_HOSTGROUP = ['treeherder-processors',
                    'treeherder-etl'
                   ]

we may have yet more bumps when it comes to staging, too, but I won't know for certain until I start refactoring things and trying to puppetize them.
Thank you - I'll update the in-repo example file in another bug.
Assignee: nobody → klibby
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
Blocks: 1087313
hostgroups for were changed again in bug 1112290 to support restarting workers on each group:

WEB_HOSTGROUP = 'treeherder-stage-web'
ETL_HOSTGROUP = 'treeherder-stage-etl'
LOG_HOSTGROUP = 'treeherder-stage-processors'
RABBIT_HOSTGROUP = 'treeherder-stage-rabbit'

added commander_settings.py to both new and old puppet modules; old version still contains the old CELERY_HOSTGROUP (which is replaced by the three new ones above in staging).
(In reply to Kendall Libby [:fubar] from comment #6)
> hostgroups for were changed again in bug 1112290 to support restarting
> workers on each group:
> 
> WEB_HOSTGROUP = 'treeherder-stage-web'
> ETL_HOSTGROUP = 'treeherder-stage-etl'
> LOG_HOSTGROUP = 'treeherder-stage-processors'
> RABBIT_HOSTGROUP = 'treeherder-stage-rabbit'
> 
> added commander_settings.py to both new and old puppet modules; old version
> still contains the old CELERY_HOSTGROUP (which is replaced by the three new
> ones above in staging).

[emorley@treeherderadm.private.scl3 treeherder-service]$ cat deployment/update/commander_settings.py
...
WEB_HOSTGROUP = 'treeherder-web'
ETL_HOSTGROUP = 'treeherder-etl'
LOG_HOSTGROUP = 'treeherder-processors'
RABBIT_HOSTGROUP = 'treeherder-rabbit'
CELERY_HOSTGROUP = ['treeherder-processors',
                    'treeherder-etl',
                    'treeherder-rabbitmq',
                   ]
...

Seems like CELERY_HOSTGROUP is now unused and so can be removed from that file?
Once we've finalised the prod file I'll update the in-repo example accordingly :-)
Flags: needinfo?(klibby)
Blocks: 1117096
No longer blocks: 1117096
Yup, it's unused. Removed the CELERY_HOSTGROUP.
Flags: needinfo?(klibby)
Thanks :-)
You need to log in before you can comment on or make changes to this bug.