[go: nahoru, domu]

Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

gha: ccruntime_e2e: Do **NOT** fail fast and add a timeout for the test execution #311

Conversation

fidencio
Copy link
Member

gha: ccruntime_e2e: Avoid fail-fast

Every now and then we'll face errors that may be different depending on
the distro being used, due to some distro changes or even some flakiness
in the tests.

Till everything is 100% stable, which is not the case yet, we better
allow the tests to finish even if one test fails, giving us a better
idea of the problem being a generic one or specific to flakiness / one
distro.


gha: ccruntime_e2e: Add a 30 minutes timeout

Today I've faced one test hanging for more than one hour, stuck, without
giving us any useful information, leaving the "cancel" option (which is
up to the user to do) as the only option to finish the test and re-start
it.

As a possible way to avoid wasting resources for too long, let's simply
add a timeout of 30 minutes in the execution of the e2e tests. This is
10 minutes more than what's been observed as the total time taken by
this step.


This is related to #309, but doesn't cover the tests split into reusable parts.

Every now and then we'll face errors that may be different depending on
the distro being used, due to some distro changes or even some flakiness
in the tests.

Till everything is 100% stable, which is not the case yet, we better
allow the tests to finish even if one test fails, giving us a better
idea of the problem being a generic one or specific to flakiness / one
distro.

Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>
Copy link
Contributor
@ldoktor ldoktor left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Those are very useful, I only have concerns about the 30m timeout, would you be against increasing it to be safe when the runners are overloaded?

Today I've faced one test hanging for more than one hour, stuck, without
giving us any useful information, leaving the "cancel" option (which is
up to the user to do) as the only option to finish the test and re-start
it.

As a possible way to avoid wasting resources for too long, let's simply
add a timeout of 45 minutes in the execution of the e2e tests.  This is
10 minutes more than what's been observed as the total time taken by
this step.

Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>
@fidencio fidencio force-pushed the topic/gha-do-not-fail-fast-and-add-timeout branch from 06b747c to 5c53644 Compare January 2, 2024 12:38
Copy link
Contributor
@ldoktor ldoktor left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

Copy link
Member
@stevenhorsman stevenhorsman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks!

@fidencio fidencio merged commit 2039b23 into confidential-containers:main Jan 8, 2024
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants