Using Mutex for can start logic in executor #3414

hymm · 2021-12-22T07:27:23Z

hymm
Dec 22, 2021
Collaborator

I was doing some experiments using a shared struct for controlling component archetype access in the executor. It ended up being a little slower than the current method. I'm making this discussion thread to record my results in case something in it becomes useful in the future.

The majority of the changes are in this file.

Hypothesis

My expectation going in was that this was going to have overhead from cloning the shared access struct and extra logic needed for coordination, but it we might be able to make up for it by being able to start systems while running the prepare_systems function.

We can see in this trace that systems are not allowed to start until after all the systems have been prepared for running. If we could start systems during this time then we could improve parallelism and speed as long as the extra overhead of coordination was not more than the time saved by running during prepare.

Approach

The basic idea was to use channels and a shared conflict access struct to coordinate between different tasks when a system is allowed to run rather than using a central exectutor.

The parallel executor in bevy is responsible for not running a system if the currently running systems conflict in read/write access to that system. This is done through a access conflict fixed bit set. This approach needs to be able to share the bit set between threads. It does this by putting the bit set behind a Arc<Mutex<>>. This is necessary over something like an atomic, because rebuilding the access when a system is removed from the pool of running systems requires looping over all the running systems. Systems cannot be allowed to read the access while it is in this intermediate invalid state.

Besides the shared access systems need to coordinate when they're allowed to start by their before and after dependencies. In main this is done by counting how many dependencies have run. In this approach we use channels to wait for all the dependencies to send that they have run. This gets a little tricky when we're starting systems during prepare. If the receiving channel is not cloned before the finish is sent, it will not see the finish message. So we delay sending the finish, until after the sending channels sees it's expected number of dependants. This required another channel as to avoid a tight loop. It only needs to check this number when a new system is spawned.

Results

The changes do work at least. We can see that systems do run during the prepare_systems function.

When running tracy on the many cubes example, I was seeing very similar frame times between main and my branch. But would see performance loss from 54 fps to 52 fps without running tracy. So in the same ballpark, but not quite the same.

I suspect that a lot of the lower performance is due to the cloning of the new channels and shared access struct. This is seen in that prepare_systems in the first stage takes 900 us vs 600 us on my machine. We make up for some of this with starting systems earlier, but it seems likely that we don't make up all of it.

There is likely some overhead from the coordination too, but hard to say how much. The extra overhead here would come into effect the most in long chains of systems. These can limit how multithreaded things can be.

Overall a fun experiment, but probably not one I'm going to take further. There might be some more possible performance improvements, but I ran out of ones that would be relatively easy to do and felt like they could be clear wins. Also if you see the benchmarks below, the contrived benches have some significant performance regressions that suggests that the coordination through a mutex causes significant overhead when there are a lot of conflicting systems.

Ideas for Improvement

Some random ideas I had that I didn't pursue.

We could create a new type of channel that would clone the active messages on a channel when it was cloned. This would allow one of the channels to be removed as the dependants would now see that the system had finished. Did this and saw a small improvement.
The shared access struct is updated after every system finishes. It might be possible to batch the update into one operation if multiple systems finish at the same time.

Notes

there are still some failing tests here. Don't think any of it is major, but should be fixed if this branch is picked up again.

Appendix: Benchmarks

Tracy Frame Times

In the screencaps here it shows that the new branch is faster than main, but on my machine they would flop back and forth a bit depending on how fast my computer wanted to run. I would say they were very similar when tracy was enabled.

Empty Systems

As expected the empty systems have hilariously bad regressions percentage wise. These might not have mattered if we gained enough time with real systems.

empty_systems/000_systems
                        time:   [43.570 ns 43.649 ns 43.730 ns]
                        change: [-98.681% -98.661% -98.642%] (p = 0.00 < 0.05)
                        Performance has improved.
empty_systems/001_systems
                        time:   [7.5649 us 7.6380 us 7.7300 us]
                        change: [+22.855% +27.017% +31.547%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 5 outliers among 100 measurements (5.00%)
  1 (1.00%) high mild
  4 (4.00%) high severe
empty_systems/002_systems
                        time:   [14.526 us 14.754 us 15.022 us]
                        change: [+97.346% +100.90% +104.66%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high mild
empty_systems/003_systems
                        time:   [23.245 us 23.456 us 23.702 us]
                        change: [+141.60% +147.11% +151.90%] (p = 0.00 < 0.05)
                        Performance has regressed.
empty_systems/004_systems
                        time:   [32.746 us 33.363 us 34.181 us]
                        change: [+202.72% +207.15% +211.38%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high severe
empty_systems/005_systems
                        time:   [54.664 us 55.195 us 55.717 us]
                        change: [+337.55% +342.42% +347.39%] (p = 0.00 < 0.05)
                        Performance has regressed.
empty_systems/010_systems
                        time:   [85.596 us 86.261 us 86.948 us]
                        change: [+332.97% +336.51% +340.26%] (p = 0.00 < 0.05)
                        Performance has regressed.
empty_systems/015_systems
                        time:   [130.47 us 131.30 us 132.12 us]
                        change: [+324.69% +329.16% +333.65%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 4 outliers among 100 measurements (4.00%)
  3 (3.00%) high mild
  1 (1.00%) high severe
empty_systems/020_systems
                        time:   [175.60 us 176.66 us 177.87 us]
                        change: [+368.18% +371.66% +375.06%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 10 outliers among 100 measurements (10.00%)
  4 (4.00%) low mild
  4 (4.00%) high mild
  2 (2.00%) high severe
empty_systems/025_systems
                        time:   [216.55 us 217.95 us 219.58 us]
                        change: [+354.32% +358.54% +362.93%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 6 outliers among 100 measurements (6.00%)
  3 (3.00%) low mild
  1 (1.00%) high mild
  2 (2.00%) high severe
empty_systems/030_systems
                        time:   [264.01 us 267.38 us 271.79 us]
                        change: [+406.79% +419.16% +432.03%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 10 outliers among 100 measurements (10.00%)
  2 (2.00%) high mild
  8 (8.00%) high severe
empty_systems/035_systems
                        time:   [301.54 us 303.06 us 304.61 us]
                        change: [+402.44% +405.96% +409.72%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 4 outliers among 100 measurements (4.00%)
  3 (3.00%) low mild
  1 (1.00%) high mild
empty_systems/040_systems
                        time:   [344.70 us 345.95 us 347.30 us]
                        change: [+402.36% +407.89% +412.87%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 5 outliers among 100 measurements (5.00%)
  1 (1.00%) low mild
  4 (4.00%) high mild
empty_systems/045_systems
                        time:   [395.03 us 397.08 us 399.59 us]
                        change: [+425.41% +429.75% +434.09%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 3 outliers among 100 measurements (3.00%)
  1 (1.00%) low mild
  2 (2.00%) high severe
empty_systems/050_systems
                        time:   [456.94 us 461.31 us 465.94 us]
                        change: [+409.91% +415.60% +422.56%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 6 outliers among 100 measurements (6.00%)
  5 (5.00%) high mild
  1 (1.00%) high severe
empty_systems/055_systems
                        time:   [487.01 us 490.10 us 493.68 us]
                        change: [+421.81% +427.18% +432.32%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 11 outliers among 100 measurements (11.00%)
  8 (8.00%) high mild
  3 (3.00%) high severe
empty_systems/060_systems
                        time:   [536.76 us 540.11 us 543.64 us]
                        change: [+434.90% +441.42% +448.34%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 5 outliers among 100 measurements (5.00%)
  4 (4.00%) high mild
  1 (1.00%) high severe

Busy Systems

We see some small improvements and regressions in these tests. The changes
here are small enough that they could just be noise.

busy_systems/01x_entities_03_systems
                        time:   [72.940 us 73.227 us 73.513 us]
                        change: [-7.1422% -6.1720% -5.2344%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 2 outliers among 100 measurements (2.00%)
  2 (2.00%) high mild
busy_systems/01x_entities_06_systems
                        time:   [165.59 us 166.72 us 168.01 us]
                        change: [+9.8818% +10.814% +11.780%] (p = 0.00 < 0.05)
                        Performance has regressed.
busy_systems/01x_entities_09_systems
                        time:   [255.03 us 255.64 us 256.28 us]
                        change: [+14.706% +15.666% +16.555%] (p = 0.00 < 0.05)
                        Performance has regressed.
busy_systems/01x_entities_12_systems
                        time:   [339.77 us 340.82 us 341.93 us]
                        change: [+12.463% +13.368% +14.315%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 11 outliers among 100 measurements (11.00%)
  4 (4.00%) low mild
  4 (4.00%) high mild
  3 (3.00%) high severe
busy_systems/01x_entities_15_systems
                        time:   [423.06 us 424.41 us 425.85 us]
                        change: [+13.801% +14.574% +15.418%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 7 outliers among 100 measurements (7.00%)
  1 (1.00%) low mild
  3 (3.00%) high mild
  3 (3.00%) high severe
busy_systems/02x_entities_03_systems
                        time:   [136.31 us 136.57 us 136.86 us]
                        change: [-10.016% -9.5177% -9.0425%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  4 (4.00%) low mild
  2 (2.00%) high mild
  1 (1.00%) high severe
busy_systems/02x_entities_06_systems
                        time:   [293.93 us 296.26 us 298.68 us]
                        change: [+0.1490% +1.3529% +2.7542%] (p = 0.04 < 0.05)
                        Change within noise threshold.
Found 5 outliers among 100 measurements (5.00%)
  4 (4.00%) high mild
  1 (1.00%) high severe
busy_systems/02x_entities_09_systems
                        time:   [414.11 us 415.53 us 417.12 us]
                        change: [-4.4746% -3.6193% -2.8708%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  7 (7.00%) high mild
  2 (2.00%) high severe
Benchmarking busy_systems/02x_entities_12_systems: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 3.0s. You may wish to increase target time to 3.1s, enable flat sampling, or reduce sample count to 60.
busy_systems/02x_entities_12_systems
                        time:   [579.78 us 583.36 us 587.69 us]
                        change: [+3.6251% +4.4989% +5.5323%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 11 outliers among 100 measurements (11.00%)
  1 (1.00%) low mild
  7 (7.00%) high mild
  3 (3.00%) high severe
Benchmarking busy_systems/02x_entities_15_systems: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 3.0s. You may wish to increase target time to 3.9s, enable flat sampling, or reduce sample count to 60.
busy_systems/02x_entities_15_systems
                        time:   [764.58 us 768.64 us 773.04 us]
                        change: [+8.3591% +9.3240% +10.238%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 10 outliers among 100 measurements (10.00%)
  4 (4.00%) low mild
  6 (6.00%) high mild
busy_systems/03x_entities_03_systems
                        time:   [212.59 us 215.15 us 217.83 us]
                        change: [-2.5074% -0.9045% +0.5887%] (p = 0.25 > 0.05)
                        No change in performance detected.
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high mild
busy_systems/03x_entities_06_systems
                        time:   [401.34 us 402.35 us 403.59 us]
                        change: [-4.2404% -3.6528% -3.0679%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 9 outliers among 100 measurements (9.00%)
  5 (5.00%) high mild
  4 (4.00%) high severe
Benchmarking busy_systems/03x_entities_09_systems: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 3.0s. You may wish to increase target time to 3.2s, enable flat sampling, or reduce sample count to 60.
busy_systems/03x_entities_09_systems
                        time:   [626.87 us 629.04 us 631.28 us]
                        change: [+0.2084% +1.5383% +2.7521%] (p = 0.01 < 0.05)
                        Change within noise threshold.
Found 9 outliers among 100 measurements (9.00%)
  1 (1.00%) low mild
  4 (4.00%) high mild
  4 (4.00%) high severe
Benchmarking busy_systems/03x_entities_12_systems: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 3.0s. You may wish to increase target time to 4.3s, enable flat sampling, or reduce sample count to 50.
busy_systems/03x_entities_12_systems
                        time:   [851.67 us 861.35 us 871.57 us]
                        change: [+1.5388% +2.4177% +3.3746%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 5 outliers among 100 measurements (5.00%)
  4 (4.00%) high mild
  1 (1.00%) high severe
Benchmarking busy_systems/03x_entities_15_systems: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 3.0s. You may wish to increase target time to 5.8s, enable flat sampling, or reduce sample count to 50.
busy_systems/03x_entities_15_systems
                        time:   [1.0354 ms 1.0385 ms 1.0419 ms]
                        change: [+3.0916% +3.9028% +4.9169%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 8 outliers among 100 measurements (8.00%)
  1 (1.00%) low mild
  3 (3.00%) high mild
  4 (4.00%) high severe
busy_systems/04x_entities_03_systems
                        time:   [259.44 us 259.69 us 259.94 us]
                        change: [-13.669% -12.007% -10.608%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  5 (5.00%) high mild
  2 (2.00%) high severe
Benchmarking busy_systems/04x_entities_06_systems: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 3.0s. You may wish to increase target time to 3.1s, enable flat sampling, or reduce sample count to 60.
busy_systems/04x_entities_06_systems
                        time:   [596.57 us 604.16 us 611.58 us]
                        change: [+1.7023% +4.5425% +6.9572%] (p = 0.00 < 0.05)
                        Performance has regressed.
Benchmarking busy_systems/04x_entities_09_systems: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 3.0s. You may wish to increase target time to 4.2s, enable flat sampling, or reduce sample count to 50.
busy_systems/04x_entities_09_systems
                        time:   [814.46 us 817.62 us 821.06 us]
                        change: [-4.2715% -3.4639% -2.6534%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  4 (4.00%) high mild
  3 (3.00%) high severe
Benchmarking busy_systems/04x_entities_12_systems: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 3.0s. You may wish to increase target time to 6.0s, enable flat sampling, or reduce sample count to 40.
busy_systems/04x_entities_12_systems
                        time:   [1.1027 ms 1.1100 ms 1.1180 ms]
                        change: [-1.1063% -0.0187% +1.0405%] (p = 0.97 > 0.05)
                        No change in performance detected.
Found 7 outliers among 100 measurements (7.00%)
  7 (7.00%) high mild
busy_systems/04x_entities_15_systems
                        time:   [1.4020 ms 1.4105 ms 1.4200 ms]
                        change: [+1.7034% +2.4596% +3.2595%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 4 outliers among 100 measurements (4.00%)
  2 (2.00%) high mild
  2 (2.00%) high severe
busy_systems/05x_entities_03_systems
                        time:   [386.45 us 394.22 us 401.65 us]
                        change: [+6.3047% +8.1458% +10.278%] (p = 0.00 < 0.05)
                        Performance has regressed.
Benchmarking busy_systems/05x_entities_06_systems: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 3.0s. You may wish to increase target time to 3.5s, enable flat sampling, or reduce sample count to 60.
busy_systems/05x_entities_06_systems
                        time:   [667.48 us 669.87 us 672.40 us]
                        change: [-5.4081% -4.4903% -3.6411%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 8 outliers among 100 measurements (8.00%)
  4 (4.00%) high mild
  4 (4.00%) high severe
Benchmarking busy_systems/05x_entities_09_systems: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 3.0s. You may wish to increase target time to 5.8s, enable flat sampling, or reduce sample count to 50.
busy_systems/05x_entities_09_systems
                        time:   [1.0254 ms 1.0326 ms 1.0411 ms]
                        change: [-4.8372% -3.9098% -2.9520%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 7 outliers among 100 measurements (7.00%)
  4 (4.00%) high mild
  3 (3.00%) high severe
busy_systems/05x_entities_12_systems
                        time:   [1.3963 ms 1.4127 ms 1.4326 ms]
                        change: [+2.3382% +3.6605% +5.1619%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 8 outliers among 100 measurements (8.00%)
  2 (2.00%) high mild
  6 (6.00%) high severe
busy_systems/05x_entities_15_systems
                        time:   [1.7013 ms 1.7079 ms 1.7146 ms]
                        change: [-6.3848% -5.0644% -3.8375%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high mild

Contrived

Seeing some significant regressions here. The conflicts are probably causing
extra overhead as the futures need to be repolled.

contrived/01x_entities_03_systems
                        time:   [50.939 us 51.531 us 52.189 us]
                        change: [+7.0876% +8.1842% +9.3297%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 15 outliers among 100 measurements (15.00%)
  3 (3.00%) low mild
  2 (2.00%) high mild
  10 (10.00%) high severe
contrived/01x_entities_06_systems
                        time:   [114.95 us 115.66 us 116.37 us]
                        change: [+29.785% +30.910% +32.002%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 2 outliers among 100 measurements (2.00%)
  2 (2.00%) high mild
contrived/01x_entities_09_systems
                        time:   [170.26 us 170.89 us 171.55 us]
                        change: [+35.751% +37.131% +38.433%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 7 outliers among 100 measurements (7.00%)
  1 (1.00%) low severe
  5 (5.00%) high mild
  1 (1.00%) high severe
contrived/01x_entities_12_systems
                        time:   [234.40 us 235.57 us 236.95 us]
                        change: [+41.259% +42.471% +43.694%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 7 outliers among 100 measurements (7.00%)
  2 (2.00%) low mild
  3 (3.00%) high mild
  2 (2.00%) high severe
contrived/01x_entities_15_systems
                        time:   [305.85 us 307.97 us 310.67 us]
                        change: [+43.491% +45.097% +46.738%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 7 outliers among 100 measurements (7.00%)
  4 (4.00%) high mild
  3 (3.00%) high severe
contrived/02x_entities_03_systems
                        time:   [84.840 us 85.349 us 85.845 us]
                        change: [+23.448% +24.881% +26.258%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 18 outliers among 100 measurements (18.00%)
  12 (12.00%) low mild
  4 (4.00%) high mild
  2 (2.00%) high severe
contrived/02x_entities_06_systems
                        time:   [164.57 us 166.31 us 168.02 us]
                        change: [+16.822% +18.234% +19.595%] (p = 0.00 < 0.05)
                        Performance has regressed.
contrived/02x_entities_09_systems
                        time:   [242.09 us 243.30 us 244.57 us]
                        change: [+20.641% +22.236% +24.339%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 9 outliers among 100 measurements (9.00%)
  5 (5.00%) high mild
  4 (4.00%) high severe
contrived/02x_entities_12_systems
                        time:   [335.46 us 339.69 us 345.24 us]
                        change: [+29.197% +30.839% +32.872%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 10 outliers among 100 measurements (10.00%)
  5 (5.00%) high mild
  5 (5.00%) high severe
contrived/02x_entities_15_systems
                        time:   [443.32 us 450.60 us 459.39 us]
                        change: [+36.790% +39.643% +42.599%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 3 outliers among 100 measurements (3.00%)
  3 (3.00%) high mild
contrived/03x_entities_03_systems
                        time:   [102.18 us 102.98 us 103.88 us]
                        change: [+3.6129% +4.9459% +6.3335%] (p = 0.00 < 0.05)
                        Performance has regressed.
contrived/03x_entities_06_systems
                        time:   [211.42 us 212.69 us 213.91 us]
                        change: [+9.4948% +10.453% +11.374%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 2 outliers among 100 measurements (2.00%)
  2 (2.00%) high mild
contrived/03x_entities_09_systems
                        time:   [330.78 us 331.91 us 333.14 us]
                        change: [+13.336% +14.454% +15.616%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 3 outliers among 100 measurements (3.00%)
  1 (1.00%) high mild
  2 (2.00%) high severe
contrived/03x_entities_12_systems
                        time:   [439.49 us 441.59 us 443.82 us]
                        change: [+15.346% +16.391% +17.404%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 3 outliers among 100 measurements (3.00%)
  3 (3.00%) high mild
contrived/03x_entities_15_systems
                        time:   [550.18 us 558.94 us 569.97 us]
                        change: [+17.635% +19.036% +20.616%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 11 outliers among 100 measurements (11.00%)
  5 (5.00%) high mild
  6 (6.00%) high severe
contrived/04x_entities_03_systems
                        time:   [123.22 us 123.78 us 124.46 us]
                        change: [-0.3639% +0.5776% +1.5085%] (p = 0.24 > 0.05)
                        No change in performance detected.
Found 16 outliers among 100 measurements (16.00%)
  9 (9.00%) high mild
  7 (7.00%) high severe
contrived/04x_entities_06_systems
                        time:   [259.91 us 261.10 us 262.55 us]
                        change: [+5.5555% +6.2790% +7.1001%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 8 outliers among 100 measurements (8.00%)
  6 (6.00%) high mild
  2 (2.00%) high severe
contrived/04x_entities_09_systems
                        time:   [394.27 us 401.07 us 408.73 us]
                        change: [+6.5627% +7.5623% +8.6245%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 8 outliers among 100 measurements (8.00%)
  5 (5.00%) high mild
  3 (3.00%) high severe
contrived/04x_entities_12_systems
                        time:   [536.11 us 538.14 us 540.28 us]
                        change: [+10.711% +11.428% +12.151%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 3 outliers among 100 measurements (3.00%)
  1 (1.00%) low mild
  1 (1.00%) high mild
  1 (1.00%) high severe
Benchmarking contrived/04x_entities_15_systems: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 3.0s. You may wish to increase target time to 3.5s, enable flat sampling, or reduce sample count to 60.
contrived/04x_entities_15_systems
                        time:   [668.51 us 672.61 us 677.46 us]
                        change: [+9.9072% +11.005% +12.077%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 9 outliers among 100 measurements (9.00%)
  1 (1.00%) low mild
  5 (5.00%) high mild
  3 (3.00%) high severe
contrived/05x_entities_03_systems
                        time:   [154.51 us 155.95 us 157.39 us]
                        change: [+5.0254% +6.2034% +7.3075%] (p = 0.00 < 0.05)
                        Performance has regressed.
contrived/05x_entities_06_systems
                        time:   [322.79 us 324.37 us 325.99 us]
                        change: [+8.7334% +9.4472% +10.116%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 2 outliers among 100 measurements (2.00%)
  2 (2.00%) high mild
contrived/05x_entities_09_systems
                        time:   [473.35 us 475.41 us 477.48 us]
                        change: [+7.3336% +8.1186% +8.8882%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 5 outliers among 100 measurements (5.00%)
  4 (4.00%) high mild
  1 (1.00%) high severe
Benchmarking contrived/05x_entities_12_systems: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 3.0s. You may wish to increase target time to 3.3s, enable flat sampling, or reduce sample count to 60.
contrived/05x_entities_12_systems
                        time:   [628.37 us 631.41 us 635.42 us]
                        change: [+6.1379% +7.5976% +8.8589%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 5 outliers among 100 measurements (5.00%)
  4 (4.00%) high mild
  1 (1.00%) high severe
Benchmarking contrived/05x_entities_15_systems: Warming up for 500.00 ms
Warning: Unable to complete 100 samples in 3.0s. You may wish to increase target time to 4.1s, enable flat sampling, or reduce sample count to 60.
contrived/05x_entities_15_systems
                        time:   [799.23 us 814.07 us 833.01 us]
                        change: [+10.279% +11.430% +12.771%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 7 outliers among 100 measurements (7.00%)
  1 (1.00%) high mild
  6 (6.00%) high severe

hymm · 2021-12-22T07:28:41Z

hymm
Dec 22, 2021
Collaborator Author

I'm going to go back to the current executor and see if I can try starting a system once when it's spawned. This won't allow as many systems as this approach to run, but might get a lot of the gains.

1 reply

hymm Dec 28, 2021
Collaborator Author

This ended up being slower.

https://github.com/hymm/bevy/blob/prepare_try_start/crates/bevy_ecs/src/schedule/executor_parallel.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using Mutex for can start logic in executor #3414

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Using Mutex for can start logic in executor #3414

hymm Dec 22, 2021 Collaborator

Hypothesis

Approach

Results

Ideas for Improvement

Notes

Appendix: Benchmarks

Tracy Frame Times

Empty Systems

Busy Systems

Contrived

Replies: 1 comment · 1 reply

hymm Dec 22, 2021 Collaborator Author

hymm Dec 28, 2021 Collaborator Author

hymm
Dec 22, 2021
Collaborator

Replies: 1 comment 1 reply

hymm
Dec 22, 2021
Collaborator Author

hymm Dec 28, 2021
Collaborator Author