improve one_size_heap_list: use rwlock to speedup the allocation/free #6370

JiakunYan · 2023-10-16T16:35:48Z

The original implementation uses an HPX mutex to ensure thread safety. This PR replaces it with a pthread rwlock, which greatly improves its performance. Multiple allocation/free calls can proceed simultaneously under the read lock, and the pthread read lock only costs an atomic fetch-and-add in most cases.

Need to discuss: directly using pthread rwlock in HPX codebase may not be a good idea, but implementing the whole pthread rwlock algorithm in HPX can also be cumbersome.

StellarBot · 2023-10-16T16:40:14Z

Can one of the admins verify this patch?

codacy-production · 2023-10-16T19:42:51Z

Coverage summary from Codacy

See diff coverage on Codacy

Coverage variation	Diff coverage
-0.05%	91.27%

Coverage variation details

	Coverable lines	Covered lines	Coverage
Common ancestor commit (`5ffdfbe`)	193075	164320	85.11%
Head commit (`49e0553`)	189172 (-3903)	160902 (-3418)	85.06% (-0.05%)

Coverage variation is the difference between the coverage for the head and common ancestor commits of the pull request branch: <coverage of head commit> - <coverage of common ancestor commit>

Diff coverage details

	Coverable lines	Covered lines	Diff coverage
Pull request (#6370)	355	324	91.27%

Diff coverage is the percentage of lines that are covered by tests out of the coverable lines that the pull request added or modified: <covered lines added or modified>/<coverable lines added or modified> * 100%

See your quality gate settings Change summary preferences

hkaiser · 2023-10-17T17:16:20Z

@JiakunYan I think we can improve the implementation of our shared_mutex by placing its state data into a single atomic and start locking things internally only if really needed; i.e. replace

hpx/libs/core/synchronization/include/hpx/synchronization/shared_mutex.hpp

Lines 29 to 35 in 103a7b8

    
           struct state_data 
        
           { 
        
               unsigned shared_count; 
        
               bool exclusive; 
        
               bool upgrade; 
        
               bool exclusive_waiting_blocked; 
        
           };

JiakunYan · 2023-11-08T04:28:30Z

@hkaiser I did some brief benchmarking using Octotiger on Rostam. The performance of the new shared lock is pretty good! (around 5% improvement for a specific octotiger configuration)

hkaiser · 2023-12-03T20:37:09Z

@JiakunYan this looks good now and the test errors are unrelated. Can this be merged now?

JiakunYan · 2023-12-03T20:39:23Z

@JiakunYan this looks good now and the test errors are unrelated. Can this be merged now?

Sure!

hkaiser · 2023-12-03T20:57:31Z

@JiakunYan this looks good now and the test errors are unrelated. Can this be merged now?

Sure!

Could you fix the conflicts, please?

JiakunYan · 2023-12-03T22:06:56Z

@JiakunYan this looks good now and the test errors are unrelated. Can this be merged now?

Sure!

Could you fix the conflicts, please?

I tried rebasing this PR to the current master branch. However, git is behaving somehow weirdly. It kept missing to apply some changes in the Trying to streamline shared_mutex. Could you try rebasing it on your end and see what will happen?

hkaiser · 2023-12-03T23:01:02Z

@JiakunYan this looks good now and the test errors are unrelated. Can this be merged now?

Sure!

Could you fix the conflicts, please?

I tried rebasing this PR to the current master branch. However, git is behaving somehow weirdly. It kept missing to apply some changes in the Trying to streamline shared_mutex. Could you try rebasing it on your end and see what will happen?

If I rebase my branch, then you will not be able to merge it to your branch because of conflicts. I'll resolve the conflicts manually and merge your work manually as well. Thanks!

hkaiser added category: AGAS type: enhancement category: LCOs type: compatibility issue labels Oct 17, 2023

JiakunYan marked this pull request as ready for review November 8, 2023 04:24

JiakunYan requested a review from hkaiser as a code owner November 8, 2023 04:24

JiakunYan force-pushed the improve-heap branch from dbe436f to c51afc0 Compare November 10, 2023 16:18

JiakunYan and others added 5 commits November 27, 2023 17:26

improve one_size_heap_list: use rwlock to speedup the allocation/free

ec41d8f

Trying to streamline shared_mutex

d20c278

Using shared_mutex for one_size_heap

b84fda4

fix unused variable error

7830828

fix a false negative assertion in wrapper_heap

0f41e91

JiakunYan force-pushed the improve-heap branch from c51afc0 to 0f41e91 Compare November 27, 2023 23:26

Fixing testing issues

49e0553

JiakunYan force-pushed the improve-heap branch from d813e5f to 49e0553 Compare December 2, 2023 21:26

hkaiser merged commit 2c3216f into STEllAR-GROUP:master Dec 3, 2023
4 checks passed

hkaiser added this to the 1.10.0 milestone Jan 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve one_size_heap_list: use rwlock to speedup the allocation/free #6370

improve one_size_heap_list: use rwlock to speedup the allocation/free #6370

JiakunYan commented Oct 16, 2023 •

edited

Loading

StellarBot commented Oct 16, 2023

codacy-production bot commented Oct 16, 2023 •

edited

Loading

hkaiser commented Oct 17, 2023 •

edited

Loading

JiakunYan commented Nov 8, 2023

hkaiser commented Dec 3, 2023

JiakunYan commented Dec 3, 2023

hkaiser commented Dec 3, 2023

JiakunYan commented Dec 3, 2023 •

edited

Loading

hkaiser commented Dec 3, 2023

improve one_size_heap_list: use rwlock to speedup the allocation/free #6370

improve one_size_heap_list: use rwlock to speedup the allocation/free #6370

Conversation

JiakunYan commented Oct 16, 2023 • edited Loading

StellarBot commented Oct 16, 2023

codacy-production bot commented Oct 16, 2023 • edited Loading

Coverage summary from Codacy

See diff coverage on Codacy

See your quality gate settings Change summary preferences

hkaiser commented Oct 17, 2023 • edited Loading

JiakunYan commented Nov 8, 2023

hkaiser commented Dec 3, 2023

JiakunYan commented Dec 3, 2023

hkaiser commented Dec 3, 2023

JiakunYan commented Dec 3, 2023 • edited Loading

hkaiser commented Dec 3, 2023

JiakunYan commented Oct 16, 2023 •

edited

Loading

codacy-production bot commented Oct 16, 2023 •

edited

Loading

hkaiser commented Oct 17, 2023 •

edited

Loading

JiakunYan commented Dec 3, 2023 •

edited

Loading