You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
indexing the 4MB haystack takes ~1400ms. after that, giving it a quick try with the term "twilight", the results are missing or mis-ranking many exact matches? or perhaps i'm just holding it wrong 😅 ?
Hi @leeoniya, thank you very much for your input and for adding this project to the demo / benchmark. Your work is really nice! I have added a comment to your commit with minor suggestions.
It looks like we have chosen very different implementations. The top 10 results are very similar, except that the term twilit shows up on rank 8 on the right hand side. The other matches shown on the left hand side are indeed ranked very low in my implementation. E.g. the query twilight matches Twilight Phenomena: The Lodgers of House 13 Collector's Edition only with a quality of 0.13.
The reason for this is that the quality increases linearly with the number of common n-grams and the word twilight is only a small part of the full term. The best use case for my library are terms that have 1-3 words, e.g. names of places or persons. For terms with more words your library is clearly doing better for exact matches!
I will think about how to improve the library in this respect. Thank you again for your input!
hey @m31coding!
great to see someone else working on a better fuzzy search :) i found this project on HN.
i've added it to the uFuzzy demo/bench in leeoniya/uFuzzy@5ebe1ba.
indexing the 4MB haystack takes ~1400ms. after that, giving it a quick try with the term "twilight", the results are missing or mis-ranking many exact matches? or perhaps i'm just holding it wrong 😅 ?
cheers! 🍻
https://leeoniya.github.io/uFuzzy/demos/compare.html?libs=uFuzzy,fuzzy-search&search=twilight
The text was updated successfully, but these errors were encountered: