core(driver): security errors are no longer a fatal or pageload error #8865

paulirish · 2019-05-05T18:59:57Z

Following the investigation and proposal outlined here: #6655 (comment)

I've tested with every badssl page and we never hang indefinitely on any of them. I did see a few PROTOCOL_TIMEOUT on the offlinePass though.

Also every badssl case where a security interstitial is shown hits our FAILED _DOC_REQ case and the localizedFailDescription that we show in the warning captures the same error code we got via securityState.insecureDescriptions. \o/

see also #8340

patrickhulce

Well that's awesome news!

LGTM

lighthouse-core/gather/driver.js

brendankenny

I really like the change, but not sure about the very end result. Does it align with "we shouldn't throw if we can technically finish loading the page" since we can't technically load the page? It feels like there should be non-zero exit code for this one, otherwise it's going to be very easy to miss in any programmatic environment.

brendankenny · 2019-05-06T07:09:38Z

lighthouse-cli/test/smokehouse/error-expectations.js

@@ -18,10 +18,10 @@ module.exports = [
    },
  },
  {
-    errorCode: 'INSECURE_DOCUMENT_REQUEST',
+    errorCode: undefined,


this should have other expectations then? And maybe a comment that this is ensuring there isn't an error (since it's in the error-expectations file :)

This should be fixed by followup to #8340, the comments I have there address the fact that none of these are currently fatal errors but IMO should be non-zero exit codes.

patrickhulce · 2019-05-21T20:08:34Z

What's our opinion on this one are we nuking it entirely like this PR does?

Or are we just going to move it to be non-fatal with a warning like followup to #8340 would?

brendankenny

so, this seems workable, but it has a few drawbacks currently:

this definitely yells that something went wrong, but it's just kind of impressively blown up instead of communicating well what happened. Consistent style for lh-warnings #7011 will help, but is it strange to be showing all this content when none of it is meaningful?
a whole bunch of logging from work that didn't need to be done (all the audits)
as @patrickhulce mentioned, a little unfortunate that most of the error messages are referencing the gatherer (Required TagsBlockingFirstPaint gatherer encountered an error: FAILED_DOCUMENT_REQUEST), but at least they get a good error code in there
also weird that we still take a trace and a devtools log for a page that isn't the page being targeted (this is why all the metrics fail with Something went wrong with recording the trace over your page load. Please run Lighthouse again. (NO_NAVSTART), since their artifacts (traces and devtoolsLogs) do exist)
as a result of how we currently put together runtimeError from the artifacts, NO_NAVSTART doesn't bubble up as a runtime error (even though it's marked as one), so e.g. the current badssl error smoke test doesn't get a runtimeError or an error exit code. That definitely shouldn't depend on what audits are being run.

Maybe this would work better if we added something more specific than the default "no artifact" failure mode for audits in this case

brendankenny · 2019-05-29T01:27:34Z

(I can help come up with actual solutions--not just a list of problems--tomorrow, I just had to leave for the day :)

stale

patrickhulce · 2019-05-30T20:50:52Z

My proposal for path forward:

Land this as-is.
Follow-up PR to make all runtimeErrors exit with a non-zero exit code as discussed here and in core(gather-runner): treat NO_FCP as a pageLoadError #8340. This should not impact any environment other than CLI. Other consumers should already be deciding whether to consume the LHR or not based on the existence of runtimeError.
Land a modified version of core(navigation): Add an option to ignore https errors during navigation #7574 to actually make HTTPS pages with bad certs loadable. This PR as-is doesn't fix the core(navigation): Add an option to ignore https errors during navigation #7574 concern because the pages still fail to load.
Land a rebased version of core: bail on gathering if we have a failure in the first pass #8866.
Land a version of core: bail on gathering if we have a failure in the first pass #8866 but for audits that fails them out with a specifc runtimeError error and skips the unnecessary logging.

Specific responses and rationale to @brendankenny's great points below:

but is it strange to be showing all this content when none of it is meaningful?

IMO, yes it's super strange. But that's the paradigm we've settled on elsewhere and I can understand the rationale for certain types of integrations that want some of the information we collect.

a whole bunch of logging from work that didn't need to be done

Fixed by step 5.

also weird that we still take a trace and a devtools log for a page that isn't the page being targeted

As I understand it this is actually the primary desire and reason for providing this mostly useless LHR and returning the runtimeError instead of throwing. So, feature not bug ;)

as a result of how we currently put together runtimeError from the artifacts, NO_NAVSTART doesn't bubble up as a runtime error (even though it's marked as one), so e.g. the current badssl error smoke test doesn't get a runtimeError or an error exit code. That definitely shouldn't depend on what audits are being run.

I think this is fixed by the combination of steps 2, 5, and 6.

brendankenny · 2019-05-30T22:25:50Z

Follow-up PR to make all runtimeErrors exit with a non-zero exit code as discussed here and in #8340. This should not impact any environment other than CLI. Other consumers should already be deciding whether to consume the LHR or not based on the existence of runtimeError.

I think with #8340 this is already done? (for all instances of runtimeError that actually become an lhr.runtimeError)

Land a modified version of #7574 to actually make HTTPS pages with bad certs loadable. This PR as-is doesn't fix the #7574 concern because the pages still fail to load.

with this PR landed, isn't --chrome-flags="--ignore-certificate-errors" sufficient? (see discussion from #6655 (comment) onwards)

steps 4 and 5 sound good

but is it strange to be showing all this content when none of it is meaningful?

IMO, yes it's super strange. But that's the paradigm we've settled on elsewhere and I can understand the rationale for certain types of integrations that want some of the information we collect.

well the strangeness feels bad :) https://googlechrome.github.io/lighthouse/viewer/?gist=b8c85fa12128c68f017e5074c4e50e2d doesn't seem like something we should be returning. lhr.runtimeError and the exit code are good to go in that case, but everything else is at best useless, at worst confusing.

Maybe we need the Lighthouse version of a server error page, a design reminiscent of the main report but not trying to render a bunch of stuff that makes no sense.

also weird that we still take a trace and a devtools log for a page that isn't the page being targeted

As I understand it this is actually the primary desire and reason for providing this mostly useless LHR and returning the runtimeError instead of throwing. So, feature not bug ;)

I agree in the case that --chrome-flags="--ignore-certificate-errors" is passed in, they get a net::ERR_CERT_DATE_INVALID error but since we can still load the page and test it we should. But without the flag, a trace and devtoolsLog for a page (the security interstitial) that's different than the page they were trying to test seems like a bug not a feature :)

It seems like @paulirish's "we shouldn't throw if we can technically finish loading the page" shouldn't apply if we're loading something but it isn't "the page".

So, suggested addenda to suggested plans:

step 5 += a Chrome interstitial detector that leads to a runtime error
removing some base artifacts in the face of an interstitial (e.g. trace and devtoolsLog of the interstitial)
a new error-state report. Maybe it keeps the ? ? ? ? ? header and Consistent style for lh-warnings #7011-ified warning but doesn't render categories or something

patrickhulce · 2019-05-30T22:42:37Z

I think with #8340 this is already done?

Almost but we're not quite there yet. I called it out in the PR description in #8340 but PAGE_HUNG and INSECURE_DOCUMENT_REQUEST both don't abide by this because of https://github.com/GoogleChrome/lighthouse/pull/8340/files#diff-42278a7ee772120215ea8cfe1b0cb1b1R74. INSECURE_DOCUMENT_REQUEST will take care of itself by landing this, so that just leaves PAGE_HUNG

with this PR landed, isn't --chrome-flags="--ignore-certificate-errors" sufficient?

Well depends on what you mean by sufficient, it'll still be subject to all the other problems you've called out and seems like a LH flag to do both for you sounds like a good idea :)

everything else is at best useless

The point as I understand it is that an integration can still take a look at the artifacts we managed to collect to get screenshots/request specifics and display a better debug screen. Maybe we should be building such a screen based on artifacts too, is part of your argument here?

step 5 += a Chrome interstitial detector that leads to a runtime error

👍

removing some base artifacts in the face of an interstitial (e.g. trace and devtoolsLog of the interstitial)

As I understand it, this seems like it destroys the only value we have in not throwing. Why don't we want to make this information available for debugging?

a new error-state report. Maybe it keeps the ? ? ? ? ? header and #7011-ified warning but doesn't render categories or something

👍

brendankenny · 2019-05-31T23:29:53Z

with this PR landed, isn't --chrome-flags="--ignore-certificate-errors" sufficient?

with this PR

lighthouse https://expired.badssl.com --view --chrome-flags="--ignore-certificate-errors"

really does work fine since Chrome can proceed to load the underlying page:

https://googlechrome.github.io/lighthouse/viewer/?gist=6797a0f98ab9c1a9f3f86abad2bea41a

I personally think we should have an audit that calls out the bad certificate even while everything else is green, but everyone else seems to think everything should be all green in this case since the user explicitly opted in to it, so the --ignore-certificate-errors case may be good to go after this lands.

… same data

patrickhulce · 2019-06-01T02:19:45Z

Wait I thought you were just saying it was the other screenshot and that was the main problem! Oh yes yes, still seems like a flag might be useful at some point in the future but not necessary anymore. 👍

Janpot · 2019-06-01T03:47:05Z

I personally think we should have an audit that calls out the bad certificate even while everything else is green, but everyone else seems to think everything should be all green in this case since the user explicitly opted in to it

IMO It makes sense to make this opt in. And when I do opt in, I'd expect lighthouse to behave very similar when a website has no certificate vs. a bad certificate. I opt in, not because I don't care about security, but because a bad certificate shouldn't block me from auditing every other aspect of my website's performance. If there's an audit that calls me out for not having https, then it should also call me out for having broken https.

patrickhulce · 2019-06-03T19:51:00Z

So where do we stand with this then. @brendankenny do you have specific requests for this PR or are all of your concerns with points other than number 1 in the plan?

brendankenny

We should land this, we just have some rough edges to clean up after it lands

cjolif · 2019-07-24T16:21:48Z

any idea on when a release will that fix will be made?

connorjclark · 2019-07-24T16:35:34Z

#9442 release is imminent

cjolif · 2019-07-24T18:47:01Z

Thanks!!

paulirish requested a review from patrickhulce as a code owner May 5, 2019 18:59

paulirish mentioned this pull request May 5, 2019

core: bail on gathering if we have a failure in the first pass #8866

Closed

paulirish requested a review from brendankenny as a code owner May 5, 2019 19:21

vercel bot deployed to staging May 5, 2019 19:21 View deployment

patrickhulce previously approved these changes May 6, 2019

View reviewed changes

connorjclark reviewed May 6, 2019

View reviewed changes

lighthouse-core/gather/driver.js Show resolved Hide resolved

brendankenny reviewed May 6, 2019

View reviewed changes

connorjclark mentioned this pull request May 7, 2019

Skip "production" audits when in development #3228

Closed

paulirish mentioned this pull request May 25, 2019

core(navigation): Add an option to ignore https errors during navigation #7574

Closed

brendankenny reviewed May 29, 2019

View reviewed changes

paulirish added 3 commits May 31, 2019 16:34

core(driver): security errors are no longer their own fatal issue

24b08e8

remove insecure watching entirely as localizedFailDescription has the…

1dc1d45

… same data

errors smoketest

cfb1fd6

brendankenny force-pushed the remove-insecure-fatalerror branch from 2a9b8d0 to cfb1fd6 Compare May 31, 2019 23:34

googlebot added the cla: yes label May 31, 2019

vercel bot deployed to staging May 31, 2019 23:34 View deployment

brendankenny approved these changes Jun 3, 2019

View reviewed changes

brendankenny merged commit 93387cd into master Jun 3, 2019

This was referenced Jun 3, 2019

Test lighthouse on website with invalid certificate fails #7292

Closed

docs: add instructions for testing with self-signed certificate #9112

Merged

patrickhulce mentioned this pull request Jun 4, 2019

core(gather-runner): convert PAGE_HUNG to non-fatal runtimeError #9121

Merged

brendankenny deleted the remove-insecure-fatalerror branch June 6, 2019 00:11

brendankenny mentioned this pull request Jun 6, 2019

tests(smokehouse): always assert lhr.runtimeError #9130

Merged

patrickhulce mentioned this pull request Jun 10, 2019

core(gather-runner): detect Chrome interstitials #9176

Merged

This was referenced Jun 13, 2019

core(gather-runner): don't save trace on pass with pageLoadError #9198

Merged

core(gather-runner): add PageLoadError base artifact #9236

Merged

brendankenny mentioned this pull request Jun 26, 2019

core(config): assert all audit requiredArtifacts will be gathered #9284

Merged

This was referenced Jul 12, 2019

Test site with invalid SSL certificate fails #9359

Closed

Give lighthouse an option to ignore certificate errors #559

Closed

snyk-bot mentioned this pull request Mar 21, 2020

[Snyk] Upgrade lighthouse from 5.1.0 to 5.6.0 godaddy/lighthouse4u#13

Merged

brendankenny mentioned this pull request Aug 24, 2021

core(fr): collect devtoolsLogs on pageLoadError #12980

Merged

brendankenny mentioned this pull request Jul 27, 2023

core: add DevtoolsLogError and TraceError artifacts #15311

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core(driver): security errors are no longer a fatal or pageload error #8865

core(driver): security errors are no longer a fatal or pageload error #8865

core(driver): security errors are no longer a fatal or pageload error #8865

core(driver): security errors are no longer a fatal or pageload error #8865

Conversation

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment