e2e tests: limit name_len_slow to 3, split e2e tests from others #842

skshetry · 2025-01-21T04:04:27Z

The test_query_e2e takes almost ~8mins to run (whole CI job takes 11 mins). The name_len_slow script is the main culprit, since it sleeps for 1 sec in each udf function and that mapper is run in a single process parallel mode.

474.21s call     tests/test_query_e2e.py::test_query_e2e@tmpfile

This commit adds a limit of 3 files to the name_len_slow script, which is enough, since it's only running a single process.
(We immediately interrupt the running process after seeing "UDF Processing Started" gets printed).

This also split tests into two: one for the e2e tests and one for the rest, so that these things are more obvious in the future.

cloudflare-workers-and-pages · 2025-01-21T04:05:34Z

Deploying datachain-documentation with Cloudflare Pages

Latest commit:	`c019173`
Status:	✅ Deploy successful!
Preview URL:	https://0c5f2ae9.datachain-documentation.pages.dev
Branch Preview URL:	https://name-len-slow.datachain-documentation.pages.dev

View logs

codecov · 2025-01-21T04:34:41Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 87.42%. Comparing base (1b5a585) to head (c019173).
Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #842   +/-   ##
=======================================
  Coverage   87.42%   87.42%           
=======================================
  Files         128      128           
  Lines       11429    11429           
  Branches     1559     1559           
=======================================
  Hits         9992     9992           
  Misses       1042     1042           
  Partials      395      395

Flag	Coverage Δ
datachain	`87.36% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

[The `test_query_e2e` takes almost ~8mins to run][1] (whole CI job takes 11 mins). The `name_len_slow` script is the main culprit, since it sleeps for 1 sec in each udf function and that mapper is run in a single process parallel mode. ``` 474.21s call tests/test_query_e2e.py::test_query_e2e@tmpfile ``` This commit adds a limit of 3 files to the name_len_slow script, which is enough, since it's only running a single process. (We immediately interrupt the running process after seeing "UDF Processing Started" gets printed). This also split tests into two: one for the e2e tests and one for the rest, so that these things are more obvious in the future. [1]: https://github.com/iterative/datachain/actions/runs/12879531971/job/35907168617#step:8:82

mattseddon · 2025-01-21T05:29:12Z

.github/workflows/tests.yml

+        shell: bash
+
+      - name: Run E2E tests
+        run: nox -s tests-${{ matrix.pyv }} -- -m "e2e" $DISABLE_REMOTES_ARG


[Q] Do we get some benefit from splitting this out?

Will the tests be distributed amongst different workers?

[Q] Do we get some benefit from splitting this out?

I hope that this will make these issues obvious by splitting them.

Will the tests be distributed amongst different workers?

Yes, it runs in the same way as the above test run. PTAL at https://github.com/iterative/datachain/actions/runs/12880225335/job/35909694647#step:9:17 where this workflow is actually run (we use pull_request_target which is running workflow from the default branch).

mattseddon · 2025-01-22T10:13:06Z

did this change break the coverage reports?

…#842) [The `test_query_e2e` takes almost ~8mins to run][1] (whole CI job takes 11 mins). The `name_len_slow` script is the main culprit, since it sleeps for 1 sec in each udf function and that mapper is run in a single process parallel mode. ``` 474.21s call tests/test_query_e2e.py::test_query_e2e@tmpfile ``` This commit adds a limit of 3 files to the name_len_slow script, which is enough, since it's only running a single process. (We immediately interrupt the running process after seeing "UDF Processing Started" gets printed). This also split tests into two: one for the e2e tests and one for the rest, so that these things are more obvious in the future. [1]: https://github.com/iterative/datachain/actions/runs/12879531971/job/35907168617#step:8:82

skshetry temporarily deployed to internal January 21, 2025 04:04 — with GitHub Actions Inactive

skshetry changed the title ~~e2e tests: limit name_len_slow to 3, split e2e tests from other tests~~ e2e tests: limit name_len_slow to 3, split e2e tests from others Jan 21, 2025

skshetry temporarily deployed to internal January 21, 2025 04:08 — with GitHub Actions Inactive

skshetry force-pushed the name-len-slow branch from 877d974 to 445dad7 Compare January 21, 2025 04:30

skshetry temporarily deployed to internal January 21, 2025 04:30 — with GitHub Actions Inactive

skshetry force-pushed the name-len-slow branch from 445dad7 to c019173 Compare January 21, 2025 04:38

skshetry temporarily deployed to internal January 21, 2025 04:38 — with GitHub Actions Inactive

skshetry temporarily deployed to internal January 21, 2025 04:39 — with GitHub Actions Inactive

mattseddon approved these changes Jan 21, 2025

View reviewed changes

mattseddon reviewed Jan 21, 2025

View reviewed changes

skshetry merged commit 7d0913e into main Jan 21, 2025
65 checks passed

skshetry deleted the name-len-slow branch January 21, 2025 06:30

mattseddon mentioned this pull request Jan 22, 2025

append e2e tests coverage instead of overwriting #851

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

e2e tests: limit name_len_slow to 3, split e2e tests from others #842

e2e tests: limit name_len_slow to 3, split e2e tests from others #842

skshetry commented Jan 21, 2025

cloudflare-workers-and-pages bot commented Jan 21, 2025 •

edited

Loading

codecov bot commented Jan 21, 2025 •

edited

Loading

mattseddon Jan 21, 2025

mattseddon Jan 21, 2025

skshetry Jan 21, 2025

mattseddon commented Jan 22, 2025

e2e tests: limit name_len_slow to 3, split e2e tests from others #842

e2e tests: limit name_len_slow to 3, split e2e tests from others #842

Conversation

skshetry commented Jan 21, 2025

cloudflare-workers-and-pages bot commented Jan 21, 2025 • edited Loading

Deploying datachain-documentation with Cloudflare Pages

codecov bot commented Jan 21, 2025 • edited Loading

Codecov Report

mattseddon Jan 21, 2025

Choose a reason for hiding this comment

mattseddon Jan 21, 2025

Choose a reason for hiding this comment

skshetry Jan 21, 2025

Choose a reason for hiding this comment

mattseddon commented Jan 22, 2025

cloudflare-workers-and-pages bot commented Jan 21, 2025 •

edited

Loading

codecov bot commented Jan 21, 2025 •

edited

Loading