Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Collection list refresh: Don't update fetched homesets #1222

Open
wants to merge 10 commits into
base: main-ose
Choose a base branch
from

Conversation

sunkup
Copy link
Member

@sunkup sunkup commented Jan 6, 2025

Purpose

At collection list refresh DAVx5 updates homesets when detected a second time. This leads to homesets detected as personal at first encounter (directly within the user principal), wrongly being overwritten as non-personal homesets (when found in a group principal afterwards).

This PR alters CollectionListRefresher.discCoverHomesets() to remember homesets already saved to database during the same refresh and won't overwrite/update them. They might only be updated at a second refresh if changed on server.

Short description

  • add a set of http urls alreadyFetched property to CollectionListRefresher
  • before saving/updating a homeset check wether it is in the set / has already been fetched
  • after saving/updating a homeset, add it to the alreadyFetched set
  • test the new behaviour

Checklist

  • The PR has a proper title, description and label.
  • I have self-reviewed the PR.
  • I have added documentation to complex functions and functions that can be used by other modules.
  • I have added reasonable tests or consciously decided to not add tests.

@sunkup sunkup added the bug Something isn't working label Jan 6, 2025
@sunkup sunkup self-assigned this Jan 6, 2025
@sunkup sunkup requested a review from ArnyminerZ January 7, 2025 10:17
@sunkup sunkup marked this pull request as ready for review January 7, 2025 10:17
@rfc2822
Copy link
Member

rfc2822 commented Jan 7, 2025

I'm still not sure that I understand the difference between "queried" and "saved". However maybe this example illustrates it:

  • We query a principal and find N homesets. So we mark one URL (the principal) as "queried", but all the N homesets as "saved".
  • We query another related URL and find again M homesets. So we mark the related URL as "queried", but all the M homesets as "saved". The homesets from the step before won't be added again, because they're already "saved". So "saved" applies to the operation of saving, while query makes sure that we don't query twice. Which are different things.

Is that correct?

But shouldn't the "saved" then apply to everything we find/save? There are other calls to homeSetRepository.insertOrUpdateByUrl() which are not checked against "alreadySaved". I wonder if we even should add/check the collections to "alreadySaved", too?

I understand that the change fixes our current issue. But I wonder if one would still understand the code / what it does without knowing about the current issue. However this should be the aim.


Another thing: we get the service and thus servcice.type as argument. Can we take the

val homeSetClass: Class<out HrefListProperty>
val properties: Array<Property.Name>
when (service.type) {
    Service.TYPE_CARDDAV -> {
        homeSetClass = AddressbookHomeSet::class.java
//

block out of the discoverHomesets and make it properties of the object? It's not need to evaluate it more than once and I think it makes the discoverHomesets even more difficult to understand.

@sunkup
Copy link
Member Author

sunkup commented Jan 8, 2025

Is that correct?

That's exactly right. I renamed alreadyFetched to alreadySaved to make it more clear. alreadySaved should hold the URLs of home sets which have been saved to the database already. alreadyQueried should hold the URLs of principals wich we have made propfinds on and don't want to send a propfind to (query) again.

But shouldn't the "saved" then apply to everything we find/save? There are other calls to homeSetRepository.insertOrUpdateByUrl() which are not checked against "alreadySaved". I wonder if we even should add/check the collections to "alreadySaved", too?

I don't think so. I think by that argument already"Queried" would then have to apply to (remember) everything we query too - which we don't, since it does not really make sense to do that.

I can see however that both alreadyQueried and alreadySaved are only used in discoverHomesets() - so do not necessarily need to be properties accessible by other methods in CollectionListRefresher. I think alreadyQueried was made a property because discoverHomesets() is recursive, but we could simply add them both as parameters to the function. That makes their usage clearer and the class less confusing.

Furthermore I will rename alreadySaved to alreadySavedHomeSets, since it will only store home sets and alreadyQueried should only contain URLs of queried principals so that will be alreadyQueriedPrincipals. This should make it clear that other queries and "saves" are not held by these sets of URLs.

See the changes in 1f45c60


Can we take the [...] block out of the discoverHomesets and make it properties of the object?

I guess we could, but by aforemented reasoning I would leave it in the discoverHomesets method. Semantically it belongs there and that is also the only method where it's used. Or what we could do is to create another discoverHomesets method which takes care of the "setup" and then starts the actual recursion.

See the changes in 7247691

@rfc2822 rfc2822 force-pushed the 847-homesets-are-not-personal-when-they-are-detected-two-times-1x-personal-1x-non-personal branch from 7247691 to 205ca22 Compare January 9, 2025 11:32
@rfc2822 rfc2822 force-pushed the 847-homesets-are-not-personal-when-they-are-detected-two-times-1x-personal-1x-non-personal branch from 5c06481 to 7c4538d Compare January 9, 2025 11:45
Copy link
Member

@rfc2822 rfc2822 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good! It's more more clear to me especially because alreadyQueriedPrincipals and alreadySavedHomeSets have speaking names and are defined in method context (not in the class context).

Didn't have a look at the tests but as they're green I hope they're fine.


Also extracted homeSetClass and homeSetPropertyNames from the method to the class. While doing so I found that the home-set properties are not used in refreshHomesetsAndTheirCollections:

DavResource(httpClient, homeSetUrl).propfind(1, *RefreshCollectionsWorker.DAV_COLLECTION_PROPERTIES) { response, relation ->

Shouldn't we use homeSetPropertyNames here, too, instead of *RefreshCollectionsWorker.DAV_COLLECTION_PROPERTIES?

However a bit later we also check whether the response is a usable collection. So we should actually query all properties (homeSetPropertyNames + DAV_COLLECTION_PROPERTIES), right?

Or maybe it's wise to put all relevant properties into DAV_COLLECTION_PROPERTIES and always query everything?

Also both DAV_COLLECTION_PROPERTIES and DAV_PRINCIPAL_PROPERTIES are defined in RefreshCollectionsWorker, but only used in CollectionListRefresher. Should probably moved.

What do you think?

@sunkup
Copy link
Member Author

sunkup commented Jan 9, 2025

Shouldn't we use homeSetPropertyNames here, too, instead of *RefreshCollectionsWorker.DAV_COLLECTION_PROPERTIES?

However a bit later we also check whether the response is a usable collection. So we should actually query all properties (homeSetPropertyNames + DAV_COLLECTION_PROPERTIES), right?
Or maybe it's wise to put all relevant properties into DAV_COLLECTION_PROPERTIES and always query everything?

Since we query for collections we will have to use the collection properties in the propfind. We don't need to combine them. I don't see where that would be beneficial to us besides having a single property list.

It's an interesting idea to query all properties at once, however it is of little advantage for us and may put unecessary additional strain on some servers? I would be surprised if there was no RFC standard defining whether we are allowed to do that in our propfinds. Seems like allprop existed but is now deprecated.

Also both DAV_COLLECTION_PROPERTIES and DAV_PRINCIPAL_PROPERTIES are defined in RefreshCollectionsWorker, but only used in CollectionListRefresher. Should probably moved.

True. I did that now, renamed them both with camelcase and made all properties private.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Homesets are not "personal" when they are detected two times (1x personal, 1x non-personal)
3 participants