Generate_msa --start-query for custom file #52
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Dear EvoDiff authors,
Thank you for making the code to EvoDiff available and showing the many facets of single sequence and MSA generation.
I had a closer look at generating MSAs from the query sequence and made some adjustments that I would like to contribute.
Changes
I tried to keep the code as similar as possible, except for more use of pathlib.Path. The defaults should still be the same and give the same output.
Tests
I ran it with the default parameters on my system for --start-query, but changed the input, output and batch size as I have not downloaded the whole OpenFold database. All tests gave me an output and I could see the different steps of diffusion ending in an MSA that was expectedly behind the results of the experiments in the pre-print, for which the models were built and much effort went in.
Please let me know about any feedback and changes that are still needed :)