Python: Modernize `py/mixed-tuple-returns` by tausbn · Pull Request #19136 · github/codeql

tausbn · 2025-03-27T15:34:48Z

Removes the dependence on points-to in favour of an approach based on (local) data-flow.

I first tried a version that used type tracking, as this more accurately mimics the behaviour of the old query. However, I soon discovered that there were many false positives in this setup. The main bad pattern I saw was a helper function somewhere deep inside the code that both receives and returns an argument that can be tuples with different sizes and origins. In this case, global flow produces something akin to a cartesian product of "n-tuples that flow into the function" and "m-tuples that flow into the function" where m < n.

To combat this, I decided to instead focus on only flow within a given function (and so local data-flow was sufficient).

Additionally, another class of false positives I saw was cases where the return type actually witnessed that the function in question could return tuples of varying sizes. In this case it seems reasonable to not flag these instances, since they are already (presumably) being checked by a type checker.

More generally, if you've annotated the return type of the function with anything (not just Tuple[...]), then there's probably little need to flag it.

Removes the dependence on points-to in favour of an approach based on (local) data-flow. I first tried a version that used type tracking, as this more accurately mimics the behaviour of the old query. However, I soon discovered that there were _many_ false positives in this setup. The main bad pattern I saw was a helper function somewhere deep inside the code that both receives and returns an argument that can be tuples with different sizes and origins. In this case, global flow produces something akin to a cartesian product of "n-tuples that flow into the function" and "m-tuples that flow into the function" where m < n. To combat this, I decided to instead focus on only flow _within_ a given function (and so local data-flow was sufficient). Additionally, another class of false positives I saw was cases where the return type actually witnessed that the function in question could return tuples of varying sizes. In this case it seems reasonable to not flag these instances, since they are already (presumably) being checked by a type checker. More generally, if you've annotated the return type of the function with anything (not just `Tuple[...]`), then there's probably little need to flag it.

As we're no longer tracking tuples across function boundaries, we lose the result that related to this setup (which, as the preceding commit explains, lead to a lot of false positives).

Copilot

Pull Request Overview

This PR modernizes the py/mixed-tuple-returns query by removing the dependency on points-to analysis and shifting to a local data-flow approach to reduce false positives.

Removed false positives by no longer flagging tuples passed as function arguments.
Enhanced handling of functions with annotated return types to avoid unnecessary warnings.

Files not reviewed (2)

python/ql/src/Functions/ReturnConsistentTupleSizes.ql: Language not supported
python/ql/test/query-tests/Functions/return_values/ReturnConsistentTupleSizes.expected: Language not supported

Tip: Leave feedback on Copilot's review comments with the 👎 and 👍 buttons to help improve review quality. Learn more

python/ql/src/change-notes/2025-03-27-modernize-mixed-tuple-returns-query.md

joefarebrother

Looks good 👍
Just one minor comment.

joefarebrother · 2025-03-28T14:43:36Z

python/ql/test/query-tests/Functions/return_values/ReturnConsistentTupleSizes.expected

@@ -1,2 +1 @@
 | functions_test.py:306:1:306:39 | Function returning_different_tuple_sizes | returning_different_tuple_sizes returns $@ and $@. | functions_test.py:308:16:308:18 | Tuple | tuple of size 2 | functions_test.py:310:16:310:20 | Tuple | tuple of size 3 |


I might like to put some comments in the tests explaining the expected results.
(in particular that this case no longer gives an alert)

Good idea. I have done so in 6674288.

Adds a comment explaining why we no longer flag the indirect tuple example. Also adds a test case which _would_ be flagged if not for the type annotation.

tausbn added 3 commits March 27, 2025 15:27

Python: Update test expectations

f601f4a

As we're no longer tracking tuples across function boundaries, we lose the result that related to this setup (which, as the preceding commit explains, lead to a lot of false positives).

Python: Add change note

980c7d8

github-actions bot added documentation Python labels Mar 27, 2025

tausbn marked this pull request as ready for review March 27, 2025 22:20

Copilot AI review requested due to automatic review settings March 27, 2025 22:20

tausbn requested a review from a team as a code owner March 27, 2025 22:20

Copilot AI reviewed Mar 27, 2025

View reviewed changes

python/ql/src/change-notes/2025-03-27-modernize-mixed-tuple-returns-query.md Outdated Show resolved Hide resolved

tausbn commented Mar 27, 2025

View reviewed changes

python/ql/src/change-notes/2025-03-27-modernize-mixed-tuple-returns-query.md Outdated Show resolved Hide resolved

Python: Fix grammar in change note

68668b8

joefarebrother previously approved these changes Mar 28, 2025

View reviewed changes

Python: Update test cases

6674288

Adds a comment explaining why we no longer flag the indirect tuple example. Also adds a test case which _would_ be flagged if not for the type annotation.

tausbn dismissed joefarebrother’s stale review via 6674288 March 28, 2025 15:12

tausbn requested a review from joefarebrother March 28, 2025 15:13

joefarebrother approved these changes Mar 28, 2025

View reviewed changes

tausbn merged commit aacdc70 into main Apr 1, 2025
15 checks passed

tausbn deleted the tausbn/python-modernise-mixed-tuple-returns-query branch April 1, 2025 15:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python: Modernize `py/mixed-tuple-returns`#19136

Python: Modernize `py/mixed-tuple-returns`#19136
tausbn merged 5 commits intomainfrom
tausbn/python-modernise-mixed-tuple-returns-query

tausbn commented Mar 27, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

joefarebrother left a comment

Uh oh!

joefarebrother Mar 28, 2025

Uh oh!

tausbn Mar 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -1,2 +1 @@
		\| functions_test.py:306:1:306:39 \| Function returning_different_tuple_sizes \| returning_different_tuple_sizes returns $@ and $@. \| functions_test.py:308:16:308:18 \| Tuple \| tuple of size 2 \| functions_test.py:310:16:310:20 \| Tuple \| tuple of size 3 \|

Conversation

tausbn commented Mar 27, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

joefarebrother left a comment

Choose a reason for hiding this comment

Uh oh!

joefarebrother Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

tausbn Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tausbn Mar 28, 2025 •

edited

Loading