C++: Improve SARIF severity level reporting of extractor diagnostics #6830

henrymercer · 2021-10-07T17:44:10Z

The SARIF spec defines errors and warnings as follows:

"error": A serious problem was found. The condition encountered by the tool resulted in the analysis being halted or caused the results to be incorrect or incomplete.

"warning": A problem that is not considered serious was found. The condition encountered by the tool is such that it is uncertain whether a problem occurred, or is such that the analysis might be incomplete but the results that were generated are probably valid.

The goal of this PR is to report extraction errors that in most cases won't break the analysis in a significant way as warnings rather than errors. This helps set the right expectations when these messages appear in the diagnostic data output by the CodeQL Action and CLI.

MathiasVP · 2021-10-07T17:52:25Z

2. Does this need a changelog note?

I think so, yes.

aschackmull · 2021-10-08T07:34:07Z

I don't claim to fully understand all the moving parts here, but this looks like a serious buildup of tech debt. With this PR the distinction between errors and warnings become really blurred and confusing - extractors distinguish these two categories, but then they're suddenly mixed and matched in the QL. It feels like there are some underlying questions that needs to be answered:

Do extractors emit certain types of errors that would be more reasonable as warnings? If so, then the changes should likely be in the extractors rather than ad-hoc translations in the QL.
Are all errors emitted by the extractors mostly harmless? And are they at the same time more serious than the current warnings? Then should we perhaps have a different severity level in-between? I.e. are the current set of severity levels sufficient?

henrymercer · 2021-10-08T10:37:06Z

Hi @aschackmull, thank you for your comment and I agree with your concerns. I think most of your comments are best addressed by language teams, however for the following:

2. Then should we perhaps have a different severity level in-between? I.e. are the current set of severity levels sufficient?

I'll note that it would be beneficial to stick with the severity levels that are part of the SARIF spec to avoid the need for consumers of our results to introduce custom support to process the severities of diagnostic messages.

The motivation for this change is somewhat of a stopgap solution to address customer concerns. I think the kind of changes that you're proposing to extractors make sense conceptually, but they need to be owned by the language teams, and they will take more time to implement.

A proposal that could address customer concerns quickly while addressing some of the problems you identified with the current state of the PR is to reduce the SARIF level of each extractor diagnostic. For example, extractor errors would be mapped to the warning SARIF level, and extractor warnings would be mapped to the note SARIF level. This perhaps gives a more appropriate semantics to extractor diagnostics (extractor errors are often not serious problems that break the analysis in a significant way), while also preserving the distinction between errors and warnings.

@aschackmull @yo-h @turbo @AlonaHlobina @adityasharad @calumgrant @jbj What are your thoughts?
@aschackmull's proposal seems to me the more sensible long term solution, however do language teams have the capacity to pick it up?

jbj · 2021-10-08T11:51:48Z

For @aschackmull's questions, I'm guessing this is highly language-specific. For C++, I'd say that the extractor errors reported by ExtractionErrors.ql should be seen as warnings from a user perspective, and so the proposed change looks good to me. Just yesterday we had a support escalation caused by a customer misunderstanding the severity.

We have another diagnostic query, FailedExtractorInvocations.ql, for the cases where the extractor aborts completely. It turns out this query has no severity column, and I think "error" would be appropriate for this query.

jbj · 2021-10-08T11:53:27Z

Today, the diagnostic summary for C/C++ looks like this:

| Severity |                           Message                            |
+----------+--------------------------------------------------------------+
| error    | (313 results for diagnostic "Extraction errors")             |
| none     | (84 results for diagnostic "Failed extractor invocations")   |
| none     | (2487 results for diagnostic "Successfully extracted files") |

I'd like the three severities to instead be warning, error, none (in order).

aschackmull · 2021-10-08T11:55:11Z

For C++, I'd say that the extractor errors reported by ExtractionErrors.ql should be seen as warnings from a user perspective, and so the proposed change looks good to me

Then at the very least shouldn't some files/queries/predicates be renamed as well? Having a query named ExtractionErrors.ql emit warnings instead of errors seems quite confusing to me.

RasmusWL

From a user perspective I think it could make sense to treat extractor errors as warnings for Python. (they can't really do anything about it if the problem is on our parser). However, I would like to discuss this with the rest of the Python team, which we will do on Monday afternoon. Leaving a blocked review until then.

igfoo · 2021-10-08T12:23:47Z

Do you mean you want

| warning | (313 results for diagnostic "Extraction errors")             |

@jbj? I'm not sure how much having only 1, rather than 2, instances of "error" would have helped.

jbj · 2021-10-08T12:26:37Z

Together with renaming files, predicates and metadata as suggested in #6830 (comment), I think the table should become

| Severity |                           Message                            |
+----------+--------------------------------------------------------------+
| error    | (84 results for diagnostic "Failed extractor invocations")   |
| warning  | (313 results for diagnostic "Extraction warnings")           |
| none     | (2487 results for diagnostic "Successfully extracted files") |

Maybe none should become note, depending on how that's described in the SARIF spec.

henrymercer · 2021-10-08T13:02:14Z

Copying from the SARIF spec:

"error": A serious problem was found. The condition encountered by the tool resulted in the analysis being halted or caused the results to be incorrect or incomplete.

"warning": A problem that is not considered serious was found. The condition encountered by the tool is such that it is uncertain whether a problem occurred, or is such that the analysis might be incomplete but the results that were generated are probably valid.

"note": The notification is purely informational. There is no required action.

"none": This is a trace notification (typically, debug output from the tool).

Given the quantity of results it produces, I would weakly suggest that the "Successfully extracted files" diagnostic seems like debug output and therefore should have severity none. I don't have a strong opinion against using note though.

AlonaHlobina · 2021-10-08T13:22:40Z

Given the quantity of results it produces, I would weakly suggest that the "Successfully extracted files" diagnostic seems like debug output and therefore should have severity none. I don't have a strong opinion against using note though.

I tend to agree with @henrymercer. From the customer's perspective, it will be less confusing. We do not necessarily want them to fix these errors. In many cases, it is not even possible for customers to do something about them. Reducing the criticality of the message we send here will help to manage the expectations.

henrymercer · 2021-10-08T15:09:15Z

Thanks @jbj @aschackmull @RasmusWL @igfoo @AlonaHlobina for your input. There's a clear way forward for C++, so I'm going to retarget this PR to just C++.

For the other languages, it's great to see that the discussion has started on this. I'll hand over making any adjustments to the severity of the extractor diagnostics to you, and let this PR serve as an example for how we made these adjustments for one language.

henrymercer · 2021-10-08T16:53:02Z

I'm looking for some databases I can use to test the changes I've made to the C++ extractor diagnostics, since there don't appear to be any QL tests for these queries. @criemen I believe you implemented the bulk of these queries in #5414 — do you have any databases you used for testing that you could send me? Thanks!

adityasharad · 2021-10-08T17:17:27Z

I'm looking for some databases I can use to test the changes I've made to the C++ extractor diagnostics, since there don't appear to be any QL tests for these queries. @criemen I believe you implemented the bulk of these queries in #5414 — do you have any databases you used for testing that you could send me? Thanks!

Try a DCA run on the default repos there?

criemen · 2021-10-08T17:18:54Z

@henrymercer There's integration tests for these queries I believe? I don't have any DBs handy, sorry.

henrymercer · 2021-10-08T17:21:00Z

I had just noticed the CI run and was about to comment — thanks! I'll look into this next week.

This PR no longer changes Python

henrymercer · 2021-10-11T18:59:04Z

Checks failure is unrelated.

jbj

Thanks! LGTM.

jbj · 2021-10-12T14:01:27Z

I'd like to merge this PR (and its corresponding internal PR), but I'm not allowed to click the button when Checks doesn't pass. I can't see what the error is. I'll try to re-run it.

henrymercer · 2021-10-12T14:01:32Z

@adityasharad (or another admin) please could you merge this along with the corresponding internal PR? I'm not able to due to the Checks failure, which is unrelated. Thanks!

henrymercer · 2021-10-12T14:04:33Z

@jbj It's a bug which occurs when we trigger the checks job via Qlucie so it uses a custom branch of our internal code. I've linked the internal issue with the bug report above. I don't think the rerun will help, but will be glad to be proven wrong :)

adityasharad

Internal PR has passed all checks.

henrymercer requested review from a team as code owners October 7, 2021 17:44

github-actions bot added C# C++ Java JS Python labels Oct 7, 2021

github-actions bot added the documentation label Oct 7, 2021

github deleted a comment from 05309667522 Oct 8, 2021

RasmusWL previously requested changes Oct 8, 2021

View reviewed changes

henrymercer marked this pull request as draft October 8, 2021 16:46

henrymercer force-pushed the henrymercer/report-extraction-errors-as-warnings branch from f600f56 to d5c8b50 Compare October 8, 2021 16:49

henrymercer changed the title ~~Report extraction errors as warnings~~ C++: Improve SARIF severity level reporting of extractor diagnostics Oct 8, 2021

henrymercer removed C# JS Java labels Oct 8, 2021

henrymercer removed the Python label Oct 8, 2021

henrymercer removed request for a team October 8, 2021 16:50

C++: Improve SARIF severity level reporting of extractor diagnostics

5b26d41

henrymercer force-pushed the henrymercer/report-extraction-errors-as-warnings branch from d5c8b50 to 5b26d41 Compare October 8, 2021 16:54

henrymercer added the depends on internal PR This PR should only be merged in sync with an internal Semmle PR label Oct 11, 2021

henrymercer marked this pull request as ready for review October 11, 2021 18:54

jbj approved these changes Oct 12, 2021

View reviewed changes

adityasharad approved these changes Oct 12, 2021

View reviewed changes

adityasharad merged commit a517a05 into main Oct 12, 2021

adityasharad deleted the henrymercer/report-extraction-errors-as-warnings branch October 12, 2021 16:59

RasmusWL mentioned this pull request Oct 20, 2021

Python: Improve SARIF severity level reporting of extractor diagnostics #6928

Merged

MathiasVP mentioned this pull request Mar 22, 2022

C++: Add internal ExtractionError query #8526

Merged

Dec	JAN	Feb
	01
2025	2026	2027

C++: Improve SARIF severity level reporting of extractor diagnostics #6830

C++: Improve SARIF severity level reporting of extractor diagnostics #6830

Uh oh!

Conversation

henrymercer commented Oct 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MathiasVP commented Oct 7, 2021

Uh oh!

aschackmull commented Oct 8, 2021

Uh oh!

henrymercer commented Oct 8, 2021

Uh oh!

jbj commented Oct 8, 2021

Uh oh!

jbj commented Oct 8, 2021

Uh oh!

aschackmull commented Oct 8, 2021

Uh oh!

RasmusWL left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

igfoo commented Oct 8, 2021

Uh oh!

jbj commented Oct 8, 2021

Uh oh!

henrymercer commented Oct 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlonaHlobina commented Oct 8, 2021

Uh oh!

henrymercer commented Oct 8, 2021

Uh oh!

henrymercer commented Oct 8, 2021

Uh oh!

adityasharad commented Oct 8, 2021

Uh oh!

criemen commented Oct 8, 2021

Uh oh!

henrymercer commented Oct 8, 2021

Uh oh!

henrymercer commented Oct 11, 2021

Uh oh!

jbj left a comment

Choose a reason for hiding this comment

Uh oh!

jbj commented Oct 12, 2021

Uh oh!

henrymercer commented Oct 12, 2021

Uh oh!

henrymercer commented Oct 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adityasharad left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

henrymercer commented Oct 7, 2021 •

edited

Loading

RasmusWL left a comment •

edited

Loading

henrymercer commented Oct 8, 2021 •

edited

Loading

henrymercer commented Oct 12, 2021 •

edited

Loading