1177190 - git+http doesn't appear to honor keep alive settings with Centos 6

Reporter

Description

•

9 years ago

tl;dr: timeouts fetching from git.mozilla.org may be due to lack of HTTP keep alive support in libcurl

Over the last few days, there have been numerous reports of timeouts in TC jobs interacting with git.mozilla.org.

Due to excellent detective work by a combined TC, MOC, Dev Services & Releng Crew, the following events were noticed:
 - TC builder client had a "hung" git fetch for /external/caf/platform/external/libpng.git
 - TC builder client had a TCP socket in CLOSE_WAIT state
 - git1.dmz.scl3 did not have an associated connection
 - git1.dmz.scl3 did have some "client disconnect" messages, but unclear if related
 - zlb VIP did not have an associated connection
 - zlb does not log connection terminations

The socket in CLOSE_WAIT state triggered a check of keep alive configuration. Neither client nor server override the default setting of 5 seconds for git protocol connections.

However, while researching if there was a configuration setting for keepalive on git+HTTP, the following article http://git.661346.n2.nabble.com/PATCH-http-enable-keepalive-on-TCP-sockets-td7597589.html suggested that git+HTTP keepalives were only supported with libcurl version 7.25 and later.

Investigation of the TC builder client showed it is using centos6, which has version 7.16.7 of libcurl.

There are several options from here - that's what this bug is to coordinate.

Pete Moore [:pmoore][:pete]

Comment 2

•

9 years ago

Awesome detective work guys, and nice summary!

Ryan VanderMeulen [:RyanVM]

Comment 3

•

9 years ago

Do we understand why this only started to affect us so severely within the last week?

Hal Wine [:hwine] use NI!

Reporter

Comment 4

•

9 years ago

Moving bug - also occurring in Buildbot jobs, which also use Centos6 builders.

Component: TaskCluster → General Automation

Product: Testing → Release Engineering

QA Contact: catlee

Hal Wine [:hwine] use NI!

Reporter

Comment 5

•

9 years ago

Attached patch timeout.patch — Details — Splinter Review

WORKAROUND: change timeout to allow quicker fails while rest of problem investigated.

"10 min" picked as 33% higher than average time.

Attachment #8626311 - Flags: review?(catlee)

Hal Wine [:hwine] use NI!

Reporter

Comment 6

•

9 years ago

Comment on attachment 8626311 [details] [diff] [review]
timeout.patch

r+ from :catlee IRL (yay WW)

Attachment #8626311 - Flags: review?(catlee) → review+

Hal Wine [:hwine] use NI!

Reporter

Comment 7

•

9 years ago

Comment on attachment 8626311 [details] [diff] [review]
timeout.patch

https://hg.mozilla.org/build/mozharness/rev/f5d11e85e980

Attachment #8626311 - Flags: checked-in+

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=mozilla-central&job_id=1704198
repository: mozilla-central
start_time: 2015-06-25T16:45:40
who: philringnalda[at]gmail[dot]com
machine: bld-linux64-spot-043
buildname: b2g_mozilla-central_emulator-debug_dep
revision: bbc26cc168c7

Return code: 1
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_m-cen_emu-d_dep-0000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_m-cen_emu-d_dep-0000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_m-cen_emu-d_dep-0000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_m-cen_emu-d_dep-0000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_m-cen_emu-d_dep-0000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
failed to run config.sh
Running post_fatal callback...
Exiting -1

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=mozilla-central&job_id=1704180
repository: mozilla-central
start_time: 2015-06-25T16:42:56
who: philringnalda[at]gmail[dot]com
machine: unknown
revision: bbc26cc168c7

Ryan VanderMeulen [:RyanVM]

Comment 10

•

9 years ago

Failure logs with the newer 600s timeout:
https://treeherder.mozilla.org/logviewer.html#?job_id=11126605&repo=mozilla-inbound

Hal Wine [:hwine] use NI!

Reporter

Comment 11

•

9 years ago

Next step is to get someone from b2g build team to debug the "repo tool" output and/or add debugging output to it.

Until we know what specific command is failing, and how, we're stuck.

ni: mwu for help and/or a reference

Flags: needinfo?(mwu)

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219192
repository: b2g-inbound
start_time: 2015-06-29T05:55:25
who: tomcat[at]mozilla[dot]com
machine: bld-linux64-spot-171
buildname: b2g_b2g-inbound_emulator_dep
revision: c32116f21ebc

Return code: 1
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
failed to run config.sh
Running post_fatal callback...
Exiting -1

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219191
repository: b2g-inbound
start_time: 2015-06-29T05:55:23
who: tomcat[at]mozilla[dot]com
machine: bld-linux64-spot-210
buildname: b2g_b2g-inbound_emulator-debug_dep
revision: c32116f21ebc

Return code: 1
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu-d_dep-000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu-d_dep-000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu-d_dep-000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu-d_dep-000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu-d_dep-000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
failed to run config.sh
Running post_fatal callback...
Exiting -1

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219280
repository: b2g-inbound
start_time: 2015-06-29T06:26:15
who: tomcat[at]mozilla[dot]com
machine: bld-linux64-spot-150
buildname: b2g_b2g-inbound_emulator_dep
revision: b0d7e5cb376e

Return code: 1
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
failed to run config.sh
Running post_fatal callback...
Exiting -1

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219281
repository: b2g-inbound
start_time: 2015-06-29T06:25:18
who: tomcat[at]mozilla[dot]com
machine: bld-linux64-spot-157
buildname: b2g_b2g-inbound_flame-kk_eng_dep
revision: b0d7e5cb376e

Return code: 1
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
failed to run config.sh
Running post_fatal callback...
Exiting -1

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219193
repository: b2g-inbound
start_time: 2015-06-29T05:55:25
who: tomcat[at]mozilla[dot]com
machine: bld-linux64-spot-106
buildname: b2g_b2g-inbound_flame-kk_eng_dep
revision: c32116f21ebc

Return code: 1
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
failed to run config.sh
Running post_fatal callback...
Exiting -1

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219064
repository: b2g-inbound
start_time: 2015-06-29T05:37:04
who: tomcat[at]mozilla[dot]com
machine: bld-linux64-spot-252
buildname: b2g_b2g-inbound_emulator-debug_dep
revision: 95e512cac569

Return code: 1
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu-d_dep-000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu-d_dep-000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu-d_dep-000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu-d_dep-000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu-d_dep-000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
failed to run config.sh
Running post_fatal callback...
Exiting -1

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219065
repository: b2g-inbound
start_time: 2015-06-29T05:37:34
who: tomcat[at]mozilla[dot]com
machine: bld-linux64-spot-251
buildname: b2g_b2g-inbound_emulator_dep
revision: 95e512cac569

Return code: 1
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'emulator', '/builds/slave/b2g_b2g-in_emu_dep-00000000000/build/tmp_manifest/emulator.xml']
timed out after 600 seconds of no output
Return code: -9
failed to run config.sh
Running post_fatal callback...
Exiting -1

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219066
repository: b2g-inbound
start_time: 2015-06-29T05:37:10
who: tomcat[at]mozilla[dot]com
machine: bld-linux64-spot-226
buildname: b2g_b2g-inbound_flame-kk_eng_dep
revision: 95e512cac569

Return code: 1
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng_dep-0000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
failed to run config.sh
Running post_fatal callback...
Exiting -1

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219112
repository: b2g-inbound
start_time: 2015-06-29T05:43:42
who: tomcat[at]mozilla[dot]com
machine: bld-linux64-spot-174
buildname: b2g_b2g-inbound_nexus-5-l_periodic
revision: 95e512cac569

Return code: 1
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'nexus-5-l', '/builds/slave/b2g_b2g-in_n5-l_dep-0000000000/build/tmp_manifest/nexus-5-l.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'nexus-5-l', '/builds/slave/b2g_b2g-in_n5-l_dep-0000000000/build/tmp_manifest/nexus-5-l.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'nexus-5-l', '/builds/slave/b2g_b2g-in_n5-l_dep-0000000000/build/tmp_manifest/nexus-5-l.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'nexus-5-l', '/builds/slave/b2g_b2g-in_n5-l_dep-0000000000/build/tmp_manifest/nexus-5-l.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'nexus-5-l', '/builds/slave/b2g_b2g-in_n5-l_dep-0000000000/build/tmp_manifest/nexus-5-l.xml']
timed out after 600 seconds of no output
Return code: -9
failed to run config.sh
Running post_fatal callback...
Exiting -1

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219108
repository: b2g-inbound
start_time: 2015-06-29T05:43:41
who: tomcat[at]mozilla[dot]com
machine: bld-linux64-spot-1013
buildname: b2g_b2g-inbound_flame-kk_periodic
revision: 95e512cac569

Return code: 1
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_dep-00000000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_dep-00000000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_dep-00000000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_dep-00000000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_dep-00000000/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
failed to run config.sh
Running post_fatal callback...
Exiting -1

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219487
repository: b2g-inbound
start_time: 2015-06-29T06:57:36
who: tomcat[at]mozilla[dot]com
machine: tst-linux64-spot-535
buildname: Ubuntu ASAN VM 12.04 x64 b2g-inbound opt test mochitest-e10s-2
revision: 95e512cac569

1759 INFO TEST-UNEXPECTED-FAIL | dom/media/test/test_load_same_resource.html | Clone http://mochi.test:8888/tests/dom/media/test/dynamic_resource.sjs?key=30208124&res1=320x240.ogv&res2=short-video.ogv duration: 1.081179 expected: 0.266 - expected PASS
TEST-UNEXPECTED-FAIL | dom/media/webspeech/recognition/test/test_timeout.html | application terminated with exit code 1
Return code: 1

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219111
repository: b2g-inbound
start_time: 2015-06-29T05:43:44
who: tomcat[at]mozilla[dot]com
machine: bld-linux64-spot-064
buildname: b2g_b2g-inbound_nexus-5-l_eng_periodic
revision: 95e512cac569

Return code: 1
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'nexus-5-l', '/builds/slave/b2g_b2g-in_n5-l_eng_dep-000000/build/tmp_manifest/nexus-5-l.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'nexus-5-l', '/builds/slave/b2g_b2g-in_n5-l_eng_dep-000000/build/tmp_manifest/nexus-5-l.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'nexus-5-l', '/builds/slave/b2g_b2g-in_n5-l_eng_dep-000000/build/tmp_manifest/nexus-5-l.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'nexus-5-l', '/builds/slave/b2g_b2g-in_n5-l_eng_dep-000000/build/tmp_manifest/nexus-5-l.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'nexus-5-l', '/builds/slave/b2g_b2g-in_n5-l_eng_dep-000000/build/tmp_manifest/nexus-5-l.xml']
timed out after 600 seconds of no output
Return code: -9
failed to run config.sh
Running post_fatal callback...
Exiting -1

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219841
repository: b2g-inbound
start_time: 2015-06-29T06:53:24
who: tomcat[at]mozilla[dot]com
machine: panda-0407
buildname: Android 4.0 armv7 API 11+ b2g-inbound debug test jsreftest-1
revision: 95e512cac569

PROCESS-CRASH | http://10.26.131.21:30407/jsreftest/tests/jsreftest.html?test=ecma_3/Function/arguments-002.js | application crashed [@ nsQueryInterface::operator()]
Return code: 1
No tests run or test summary not found

Comment hidden (Legacy TBPL/Treeherder Robot)

log: https://treeherder.mozilla.org/logviewer.html#?repo=b2g-inbound&job_id=2219107
repository: b2g-inbound
start_time: 2015-06-29T05:43:44
who: tomcat[at]mozilla[dot]com
machine: bld-linux64-spot-072
buildname: b2g_b2g-inbound_flame-kk_eng-debug_periodic
revision: 95e512cac569

Return code: 1
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng-d_dep-00/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng-d_dep-00/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng-d_dep-00/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng-d_dep-00/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
Automation Error: mozprocess timed out after 600 seconds running ['./config.sh', '-q', 'flame-kk', '/builds/slave/b2g_b2g-in_flm-kk_eng-d_dep-00/build/tmp_manifest/flame-kk.xml']
timed out after 600 seconds of no output
Return code: -9
failed to run config.sh
Running post_fatal callback...
Exiting -1

Ryan VanderMeulen [:RyanVM]

Comment 26

•

9 years ago

This is arguably a tree-closing issue (or hide most B2G emulator/device image builds), so this needs attention. I'm not sure who's around right now, but a ~50+% failure rate isn't acceptable and needs attention from *someone* ASAP.

Flags: needinfo?(sdeckelmann)

Flags: needinfo?(jonas)

Flags: needinfo?(jocheng)

Flags: needinfo?(faramarz)

Flags: needinfo?(fabrice)

Michael Wu [:mwu]

Comment 27

•

9 years ago

No references here. We don't usually mess with git and repo. Sounds like something changed on the automation side and needs to be backed out. Alternately, you can experiment with upgrading git and/or libcurl.

Flags: needinfo?(mwu)

Selena Deckelmann :selenamarie :selena

Comment 28

•

9 years ago

:wcosta is working right now on upgrading libcurl to stop burning builds. The longer-term fix here is probably getting off CentOS6.

Flags: needinfo?(sdeckelmann)

Jonas Sicking (:sicking) No longer reading bugmail consistently

Updated

•

9 years ago

Flags: needinfo?(jonas)

[:fabrice] Fabrice Desré

Updated

•

9 years ago

Flags: needinfo?(fabrice)

Wander Lairson Costa

Assignee

Updated

•

9 years ago

Assignee: nobody → wcosta

Status: NEW → ASSIGNED

Wander Lairson Costa

Assignee

Comment 29

•

9 years ago

Attached file MozReview Request: Bug 1177190: Update libcurl in docker images. r=selenamarie — Details

Bug 1177190: Update libcurl in docker images. r=selena

libcurl shipped with CentOS 6 doesn't support keepalive. This is causing
builds to burn.

Wander Lairson Costa

Assignee

Updated

•

9 years ago

Attachment #8627515 - Flags: review?(sdeckelmann)

Wander Lairson Costa

Assignee

Comment 30

•

9 years ago

Comment on attachment 8627515 [details]
MozReview Request: Bug 1177190: Update libcurl in docker images. r=selenamarie

Bug 1177190: Update libcurl in docker images. r=selena

libcurl shipped with CentOS 6 doesn't support keepalive. This is causing
builds to burn.

Selena Deckelmann :selenamarie :selena

Updated

•

9 years ago

Attachment #8627515 - Flags: review?(sdeckelmann) → review+

Selena Deckelmann :selenamarie :selena

Comment 31

•

9 years ago

Comment on attachment 8627515 [details]
MozReview Request: Bug 1177190: Update libcurl in docker images. r=selenamarie

https://reviewboard.mozilla.org/r/12255/#review10733

Ship It!

Wander Lairson Costa

Assignee

Comment 32

•

9 years ago

https://hg.mozilla.org/integration/b2g-inbound/rev/bba8e8d63c37

Carsten Book [:Tomcat]

Comment 33

•

9 years ago

(In reply to Wander Lairson Costa [:wcosta] from comment #32)
> https://hg.mozilla.org/integration/b2g-inbound/rev/bba8e8d63c37

sorry had to back this out for perma failures like https://treeherder.mozilla.org/logviewer.html#?job_id=2225232&repo=b2g-inbound

Pulsebot

Comment 34

•

9 years ago

Backout:
https://hg.mozilla.org/integration/b2g-inbound/rev/a16f198045ae

Carsten Book [:Tomcat]

Updated

•

9 years ago

Flags: needinfo?(wcosta)

Wander Lairson Costa

Assignee

Updated

•

9 years ago

Depends on: 1178899

Wander Lairson Costa

Assignee

Comment 35

•

9 years ago

Emulator bustage was caused by Bug 1178899. Should be fixed now, I could run a successfully build:
https://tools.taskcluster.net/task-inspector/#a8ecyIG3QEuGN95D7BOXUg/2

Can we backout the backout?

Flags: needinfo?(wcosta) → needinfo?(cbook)

Wander Lairson Costa

Assignee

Comment 36

•

9 years ago

Just talked to selena, we are going to that in other way.

Flags: needinfo?(cbook)

Wander Lairson Costa

Assignee

Updated

•

9 years ago

Depends on: 1178997

Wander Lairson Costa

Assignee

Comment 37

•

9 years ago

Comment on attachment 8627515 [details]
MozReview Request: Bug 1177190: Update libcurl in docker images. r=selenamarie

Bug 1177190: Update libcurl in docker images. r=selenamarie

libcurl on CentOS 6 doesn't support keealive, so we upgrade it.
The approach we take to avoid breaking buildbot machines is to
grab libcurl from CentOS 7, build it on CentOS 6 and upload rpms
to S3.

Attachment #8627515 - Attachment description: MozReview Request: Bug 1177190: Update libcurl in docker images. r=selena → MozReview Request: Bug 1177190: Update libcurl in docker images. r=selenamarie

Attachment #8627515 - Flags: review+ → review?(sdeckelmann)

Wander Lairson Costa

Assignee

Updated

•

9 years ago

Attachment #8627515 - Flags: review?(sdeckelmann) → review?(dustin)

Wander Lairson Costa

Assignee

Comment 38

•

9 years ago

Comment on attachment 8627515 [details]
MozReview Request: Bug 1177190: Update libcurl in docker images. r=selenamarie

Bug 1177190: Update libcurl in docker images. r=selenamarie

libcurl on CentOS 6 doesn't support keealive, so we upgrade it.
The approach we take to avoid breaking buildbot machines is to
grab libcurl from CentOS 7, build it on CentOS 6 and upload rpms
to S3.

Wander Lairson Costa

Assignee

Comment 39

•

9 years ago

https://hg.mozilla.org/integration/b2g-inbound/rev/4bfe1c223646

Dustin J. Mitchell [:dustin] (he/him)

Comment 40

•

9 years ago

https://reviewboard.mozilla.org/r/12253/#review10881

::: testing/docker/b2g-build/Dockerfile:18
(Diff revision 2)
> +  cd -

You should be able to just 'yum install $url' which avoids loading the RPMs onto disk

Dustin J. Mitchell [:dustin] (he/him)

Comment 41

•

9 years ago

Comment on attachment 8627515 [details]
MozReview Request: Bug 1177190: Update libcurl in docker images. r=selenamarie

It'd be good to have some comments in there regarding why these aren't installed from a yum repo, too.

FWIW, there's another option to enforce keepalive for everything:
  http://www.tldp.org/HOWTO/html_single/TCP-Keepalive-HOWTO/#libkeepalive
why the linux kernel doesn't do this by default, I don't know.  Would the Internet collapse from an extra TCP round trip every 5 minutes?  The number of TCP connections that last that long is a vanishingly small portion of all TCP connections.  But I digress..

Attachment #8627515 - Flags: review?(dustin) → review+

Ryan VanderMeulen [:RyanVM]

Comment 42

•

9 years ago

https://hg.mozilla.org/mozilla-central/rev/4bfe1c223646

Status: ASSIGNED → RESOLVED

Closed: 9 years ago

status-firefox42: --- → fixed

Resolution: --- → FIXED

Josh Cheng [:josh]

Updated

•

9 years ago

Flags: needinfo?(jocheng)

Ryan VanderMeulen [:RyanVM]

Updated

•

9 years ago

Flags: needinfo?(faramarz)

Nobody; OK to take it and work on it

Updated

•

6 years ago

Component: General Automation → General

timeout.patch 9 years ago Hal Wine [:hwine] use NI! 1.64 KB, patch	hwine : review+ hwine : checked-in+	Details \| Diff \| Splinter Review
MozReview Request: Bug 1177190: Update libcurl in docker images. r=selenamarie 9 years ago Wander Lairson Costa 40 bytes, text/x-review-board-request	dustin : review+	Details