-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WGS-whole-region-deleted branch error #495
Comments
Hi @mbosm, Thanks for using CRISPResso! And sorry to hear that you are having trouble. I have merged the master branch into the Also, do you know if the latest release v2.3.1 still excludes the larger deletions? Thanks, |
Cole, I tried the https://github.com/edilytics/CRISPResso2/tree/wgs-whole-region-deleted-v2.3.1 branch as you suggested, and it threw the same error. I tried master-branch v2.3.1 a couple days ago and it worked fine, but did not incorporate the larger deletions. This is from the v2.2.14, but shows the trend... I have five potential cut sites, all near each other on the genome. One 1000bp amplicon covers all of them. The third one, target_3290, is the cas12 guide in use on this sample. Because all five sites are on the same amplicon, if all reads are being processed, there should be approximately 40,000 reads at each site. Roughly a quarter are being thrown out. |
Thanks for trying the other branch, would you mind rerunning with the |
|
If it helps at all, I don't really know my way around python, but I looked at the code where it was erroring...
...and looked at what was being processed for my reads. Here's an example:
...and what it looks like is that the soft-clipping on the edges of the reads (where minimap2, the aligner I use, leaves off the barcodes and sequencing adapters) are throwing off the line, because the first and last base in the read is 'none', hence the inability to compare an integer to a non-integer. As an experiment, I took my aligned BAM file of reads, and ran it through a utility that cuts off all the soft clipping. This allowed the program to go past this point, because the first and last array items in positions were integers. However, it crashed again later in the run, with the following error:
...which is outside my area of knowledge. |
Thanks for the additional information and sorry for the delay in responding! As for the new error after you trim the soft-clipped reads, I have pushed a fix that will hopefully resolve it. Would you mind pulling the latest version from https://github.com/edilytics/CRISPResso2/tree/wgs-whole-region-deleted-v2.3.1 and see if that works? Thanks, |
Hello,
I was attempting to use the WGS-whole-region-deleted branch of of CRISPResso2 because I have some cas12-edit nanopore amplicon reads which are 1000bp in length and have frequent 150bp deletions, which are being excluded from the master branch of CRISPResso2
I created a new conda environment, downloaded the source code, and set it up with setup.py, according to the command in build.sh.
While it builds fine, when I run CRISPRessoWGS on the same pre-made bam files / bed files / fasta files that work with the master branch of CRISPResso2, I get the following error:
Looking at the analyzed regions folder, it has constructed .bam and .bam.bai files of appropriate size, but the .fastq.gz files are empty, 40 bytes.
I'm wondering if I did something wrong during the installation of the branch, or if I need to make some change to my input files. master-branch CRISPResso2 that works is version 2.2.14, while the WGS-whole-region-deleted branch reports 2.2.13, so I'd expect most input files and settings to be the same.
The text was updated successfully, but these errors were encountered: