[fix](test) Fix flaky adaptive scan profile test by BiteTheDDDDt · Pull Request #65117 · apache/doris

BiteTheDDDDt · 2026-07-01T11:18:45Z

Proposed changes

Stabilize query_profile/adaptive_pipeline_task_serial_read_on_limit by removing the fixed sleep before reading query profiles. The test now records the exact last_query_id() after each profiled query, waits for that profile row to reach Profile Completion State: COMPLETE, and then fetches the detailed profile before checking MaxScanConcurrency.

Root cause

The profile list and detailed BE profile are reported asynchronously after query results are returned. Under CI load, the old fixed wait could read /rest/v1/query_profile before the just-finished query appeared, causing intermittent Profile ID ... is not found failures.

Validation

git diff --check
timeout 300 ./run-regression-test.sh --run -f regression-test/suites/query_profile/adaptive_pipeline_task_serial_read_on_limit.groovy

Result: Test 1 suites, failed 0 suites.

hello-stephen · 2026-07-01T11:18:51Z

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR.

Please clearly describe your PR:

What problem was fixed (it's best to include specific error reporting information). How it was fixed.
Which behaviors were modified. What was the previous behavior, what is it now, why was it modified, and what possible impacts might there be.
What features were added. Why was this function added?
Which code was refactored and why was this part of the code refactored?
Which functions were optimized and what is the difference before and after the optimization?

BiteTheDDDDt · 2026-07-01T11:20:27Z

run buildall

BiteTheDDDDt · 2026-07-01T11:20:49Z

/review

github-actions

Automated review summary: I reviewed the single GitHub-changed file, regression-test/suites/query_profile/adaptive_pipeline_task_serial_read_on_limit.groovy, plus the relevant profile helper, last_query_id() implementation, session-variable forwarding, and BE scan concurrency counter paths. I did not find a substantiated issue that should block this PR.

Critical checkpoint conclusions:

Goal and test coverage: the PR targets a flaky query-profile lookup by capturing the exact last_query_id() and polling the matching profile until Profile Completion State: COMPLETE. That matches the existing profile polling pattern used by the regression framework/plugin helpers and preserves the intended serial/non-serial scan cases.
Scope and clarity: the change is narrowly scoped to the flaky regression test helper and call sites.
Concurrency/lifecycle: no production concurrency path is changed. The asynchronous profile lifecycle is handled more directly than the previous fixed sleep.
Configuration/session propagation: the existing test still covers the limit threshold, predicate case, and disabled enable_adaptive_pipeline_task_serial_read_on_limit case. The session variables remain normal query options forwarded to BE.
Compatibility/storage/transactions: not applicable; this is test-only and does not change protocol, storage format, transactions, or persisted state.
Parallel/special paths: I checked the BE MaxScanConcurrency counter semantics and the multi-BE topology concern. The remaining tolerant negative assertion predates this PR and is not a new regression from the changed lines.
Test artifacts/style: this assertion-style query-profile regression test does not need a .out file. Added patch lines have no trailing whitespace.

Validation: I verified the GitHub PR file list and patch hygiene statically. I could not rerun the regression test in this checkout because thirdparty/installed is missing; the PR description reports timeout 300 ./run-regression-test.sh --run -f regression-test/suites/query_profile/adaptive_pipeline_task_serial_read_on_limit.groovy passed.

User focus points: none were provided beyond the normal full-PR review.

Subagent conclusions: optimizer-rewrite found no candidates. tests-session-config proposed TSC-1 about the negative profile assertion; I dismissed it with code evidence as a pre-existing/topology-tolerant assertion rather than a PR-introduced bug. Convergence round 1 ended with both live subagents replying NO_NEW_VALUABLE_FINDINGS for the same ledger/comment set, with no inline comments proposed.

fix flaky adaptive scan profile test

4720147

BiteTheDDDDt marked this pull request as ready for review July 1, 2026 11:19

github-actions Bot reviewed Jul 1, 2026

View reviewed changes

Mryange approved these changes Jul 2, 2026

View reviewed changes

BiteTheDDDDt merged commit 5b86349 into apache:master Jul 2, 2026
35 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[fix](test) Fix flaky adaptive scan profile test#65117

[fix](test) Fix flaky adaptive scan profile test#65117
BiteTheDDDDt merged 1 commit into
apache:masterfrom
BiteTheDDDDt:codex/fix-adaptive-profile-flake

BiteTheDDDDt commented Jul 1, 2026

Uh oh!

hello-stephen commented Jul 1, 2026

Uh oh!

BiteTheDDDDt commented Jul 1, 2026

Uh oh!

BiteTheDDDDt commented Jul 1, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

BiteTheDDDDt commented Jul 1, 2026

Proposed changes

Root cause

Validation

Uh oh!

hello-stephen commented Jul 1, 2026

Uh oh!

BiteTheDDDDt commented Jul 1, 2026

Uh oh!

BiteTheDDDDt commented Jul 1, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants