Comments (13)
Thanks @alexomics. Very helpful. Just ~1,000 reads were sequenced without being unblocked; that's close to what I'd expect for 1.5% of the genome. I minimapped these reads to the genome and saw about 4x enrichment on chr12, so looks like it worked fine! I'll run a bigger test later today to verify ....
from readfish.
Hi - can you post your toml file here so we can have a look?
from readfish.
Could you give an example of the format that your targets are in? For entire contigs/chromosomes you can specify just the target name; for targets with coordinates you need to specify as contig,start,stop,strand
. See the target formats in of TOML.md
.
There is also a validator script that will let you know if your targets are recognised.
from readfish.
Hi Matt, Sorry for posting that prematurely. TOML is as follows: The validate script tells me all is OK.
[caller_settings]
config_name = "dna_r9.4.1_450bps_fast"
host = "127.0.0.1"
port = 5556
[conditions]
reference = "/home/kevbrick/genomes/hg38/Minimap2Index/genome.mmi"
[conditions.0]
name = "select_chr_12_start"
control = false
min_chunks = 0
max_chunks = 12
targets = ["chr12,1,50000000,+","chr12,1,50000000,-"]
single_on = "stop_receiving"
multi_on = "stop_receiving"
single_off = "unblock"
multi_off = "unblock"
no_seq = "proceed"
no_map = "proceed"
from readfish.
What is the output from ru_validate
?
from readfish.
ru_validate output:
😻 Looking good!
Generating experiment description - please be patient!
This experiment has 1 region on the flowcell
Using reference: /home/kevbrick/genomes/hg38/Minimap2Index/genome.mmi
Region 'select_chr_12_start' (control=False) has 1 target of which 1
are in the reference. Reads will be unblocked when classed as
single_off or multi_off; sequenced when classed as single_on or
multi_on; and polled for more data when classed as no_map or no_seq.
from readfish.
I see @alexomics is on - I'll let him take this one!
But can you confirm - are you running with our bulk file here?
from readfish.
OK, great. Yep. I'm running with your bulk fast5 file.
from readfish.
You are attempting to select 1.5% of the genome. How long are you running this test for? Are you seeing all reads being unblocked; are some reads being sequenced (no unblock sent)?
from readfish.
Yeah. I realise that a better test case would be to specify more of the genome. I ran this for ~2hr and get about 93,000 reads. How do I check if the reads are being unblocked / sequenced?
from readfish.
There are a couple of log files that you can check. The first is the chunk_log.log
(unless changed) and will be in the same directory read until was run from. You can run the following command to see how many of each mode was seen throughout the run. You are looking for single_on
or multi_on
which are the modes for when reads matched your range.
cut -f8 chunk_log.log | sort | uniq -c
Or in the run output directory (generated by MinKNOW) there will be a file called unblocked_read_ids.txt
this is where all the read ids that read until sent are logged. You can search for all read ids that are in the sequencing summary that an unblock signal was not sent for:
grep -vFf unblocked_read_ids.txt sequencing_summary.txt
This last command might be very intensive/take a long time depending on the size of the sequencing summary or the unblocked read ids files.
from readfish.
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.
from readfish.
This issue was closed because there has been no response for 5 days after becoming stale.
from readfish.
Related Issues (20)
- Readfish and boss-runs in 2024 HOT 3
- readfish with just depletion? HOT 4
- Not being able to finish the tutorial/test HOT 6
- Provide additional TOML files which provide examples of host depletion and other use cases for readfish.
- location of readfish output HOT 4
- toml_live handling issue
- Depletion setting and unexpected behavior HOT 2
- Question about Readfish basecalling performance for messy samples HOT 6
- Playback issue HOT 14
- Problem communicating with basecaller after MinKnow update HOT 10
- Detect MinKNOW version and warn. HOT 1
- Circular Binary Segmentation HOT 3
- Having only one region in toml caused issue HOT 13
- chore: update github workflow actions HOT 1
- Handshake failed with fatal error SSL_ERROR_SSL: error:1000007d:SSL routines:OPENSSL_internal:CERTIFICATE_VERIFY_FAILED HOT 4
- comparison between MinKNOW and Readfish HOT 4
- Error with flongle flow cell splitting HOT 5
- chore: update FAQ
- next release candidate checklist
- High count of no-barcode that map to my targets HOT 15
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from readfish.