Comments (9)
from juicer.
Here is a couple of them, with their fastq file pairs.
-bash-4.1$ gzip -cd X_1_AHJNHNAFXX.1009_NEXTSEQ-2017-04-25.fq.gz | head -n 30
@NS500348:170:HJNHNAFXX:1:11101:16680:1036 1:N:0:1
CTCTANAGAGAAGCATTCTCAGAAGCTTCATTGGGATGTTTCAATTGAAGTCACAGTGTTGAACAGTCCCTTTCA
+
AAAAA#EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAAEEE
@NS500348:170:HJNHNAFXX:1:11101:13757:1036 1:N:0:1
TCCCTNGCTTGTTTAAGCAAAATCAACCTTGTGCCATTCTGCACGACAACAGCTCTGCCCAGCAGTCACCAATCA
+
AAAAA#EEEEEEEEEEEEEEEEEEEEEAEAEEEEEEEEAEEEEEEEEAEEAEEEEEA<EEEEEEEEEEEEEEAEE
@NS500348:170:HJNHNAFXX:1:11101:5644:1036 1:N:0:1
CCTCANGACGTAGCACCCTCCCAGCATATTGGTGTCTATTGGGTTAGCTCACCTAAGCTTTGAGTATCCAGTTTT
+
AAAAA#EEEEEEEEAEEEEEEEEEEEEEEEEEEEAEEEEEEEEEEEEEEEEEEEEEEEEAEEEEEEEE<EEEEEE
@NS500348:170:HJNHNAFXX:1:11101:16309:1036 1:N:0:1
ATAACNCTCCTTGCACACAAATCTCCAGCTCAGAGTTTTGTTCCCAGGGAAGTCATCCTATGACACCTGCTATCA
+
AAAAA#EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAEEEEAEEEE6<EEAEEEAAEEEEEAAEEEE
@NS500348:170:HJNHNAFXX:1:11101:15987:1036 1:N:0:1
GAGTANGTGACTTTAATTTTTCAGCTGTGCCAACAGAAAAAGTACTTCAACTAAGAATTTTTAAAAATTATTTCA
+
AAAAA#EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
@NS500348:170:HJNHNAFXX:1:11101:12977:1036 1:N:0:1
GCGAANTGTAATGAGCACAAGAGCAAAGCAGATGTTTTAAAGAAACACTTTATAAACCTTTCCTCAACTTATCTC
+
AAAAA#EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE6EEEEEEEEEEEEEEEEEEEEEAEEEEEEEA
@NS500348:170:HJNHNAFXX:1:11101:16344:1036 1:N:0:1
TTAATNAATAATATGTCTCTGTTTGCTTCGGTAGCTCTTCTGTCTCTTATACACATCTCCGAGCCCACGAGACCG
+
AAAAA#EEEAEAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE6EAEEEEEAAAEE</EEEEEE/AE
@NS500348:170:HJNHNAFXX:1:11101:19625:1036 1:N:0:1
TAGCANACACGAGAGAGATGTGAACTATGGAAAGGCCAGCAGGGACCATGGACTGTCTCTTATACACATCTCCGA
-bash-4.1$ gzip -cd X_2_AHJNHNAFXX.1009_NEXTSEQ-2017-04-25.fq.gz | head -n 30
@NS500348:170:HJNHNAFXX:1:11101:16680:1036 2:N:0:1
NNNNNNNNNGNGNNCNNNANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
+
#########E#E##E###E########################################################
@NS500348:170:HJNHNAFXX:1:11101:13757:1036 2:N:0:1
NNNNNNNNNGNCNNCNNNANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
+
#########E#A##E###/########################################################
@NS500348:170:HJNHNAFXX:1:11101:5644:1036 2:N:0:1
NNNNNNNNNTNTNNANNNANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
+
#########E#E##E###E########################################################
@NS500348:170:HJNHNAFXX:1:11101:16309:1036 2:N:0:1
NNNNNNNNNCNANNCNNNTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
+
#########E#E##E###E########################################################
@NS500348:170:HJNHNAFXX:1:11101:15987:1036 2:N:0:1
NNNNNNNNNTNCNNGNNNTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
+
#########E#E##E###E########################################################
@NS500348:170:HJNHNAFXX:1:11101:12977:1036 2:N:0:1
NNNNNNNNNANANNANNNCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
+
#########E#E##E###E########################################################
@NS500348:170:HJNHNAFXX:1:11101:16344:1036 2:N:0:1
NNNNNNNNNCNANNCNNNCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
+
#########E#E##E###A########################################################
@NS500348:170:HJNHNAFXX:1:11101:19625:1036 2:N:0:1
NNNNNNNNNCNTNNTNNNCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
-bash-4.1$ gzip -cd X_1_AHJY2LAFXX.1005_NEXTSEQ-2017-04-18.fq.gz | head -n 30
@NB501402:27:HJY2LAFXX:1:11101:12561:1061 1:N:0:1
GNAATTGCCTGAAGGTATAGACTTAGAAATTTAACATTTAAAAACATTTTCTTCATTTTCTAAGCTAGCTTTTTT
+
A#AAAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAEEEAEEEEEE
@NB501402:27:HJY2LAFXX:1:11101:16276:1061 1:N:0:1
GNGTAAGACCACAAAAGCACAGGAAACAAAAGCAAAATAGACAAATGGTATTATATCATGCTAAAAAGCTAGCTT
+
A#AAAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAEEEEEEEEEEEE
@NB501402:27:HJY2LAFXX:1:11101:19495:1061 1:N:0:1
GNGTTCTAGGGCAAGCCAAATCTTTCCAGATCAACAATGACAACTAGATCCATTAAGACACTGGCCTGTTTAAGT
+
A#AAAEEEEEEEEEEEEEEEEEEEEEEEEEEEE/EAEEE/AEEE/EEEEAEEEEEAE/EAEE/A<E<E/EEEEEE
@NB501402:27:HJY2LAFXX:1:11101:17883:1061 1:N:0:1
CNCATAGACAGCTCCTGAATCAATGACTCCAGCCTGGCTCAAGCTAGCTTCCTCTGGCCTCCTCCCTCATTAAGG
+
A#AAAAEEEEEEEEEEEEEEEEEAEAEEEEAEEEEEEEEEEEEEEEEEEEEEEEEEE<EEEEAEEEAEE//EE/E
@NB501402:27:HJY2LAFXX:1:11101:5852:1061 1:N:0:1
GNAACCTTCTGTGTGCCAGAATGTCAGGATAAGGGGGTCACGGTCTCTTCTTTCTCTATCCCCTGTCTCTTATAC
+
A#AAAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAEEEEAEEEEEEEEEEEE
@NB501402:27:HJY2LAFXX:1:11101:9026:1061 1:N:0:1
GNGCAGGTAGAGGAGACCAGACAGTCCCAGGCTCAGTGGTAGAAGAGTCACCCAGGGCTACTCCAGCCCCTTCTC
+
A#AAAEEEEEAEEEEAEEEEEEEEEEEE/EEEEEEEEEEEEEEEEEEEE/EEE/EEEEAEEAEEAEEEEE<EEEA
@NB501402:27:HJY2LAFXX:1:11101:4690:1061 1:N:0:1
CNGTATCACTAGAGTTAAAATTATGAGCAACAGAAAACTGGTTCTCCATATCCTGGATGAGGGGCAGGGTTAGGG
+
A#AAAEEEEEEEEEEE6EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE/EEEEAEEEA
@NB501402:27:HJY2LAFXX:1:11101:10845:1061 1:N:0:1
CNATATGGAGGCAAACTTGAAAAACAAAAACTCAGTTTTGTTAAAATATGTGAGGAGGCAGCTTTAGGCCTGTCT
-bash-4.1$ gzip -cd X_2_AHJY2LAFXX.1005_NEXTSEQ-2017-04-18.fq.gz | head -n 30
@NB501402:27:HJY2LAFXX:1:11101:12561:1061 2:N:0:1
NNNNNNNAATTAAAATATATAAAAATTATTAAAAAATTAAAAAAAATTAAAATAATAATATAAGAAAAAATATTA
+
#######/E//////////////////A//A6///A//////E//////A/A//////////<//AA6A6/////
@NB501402:27:HJY2LAFXX:1:11101:16276:1061 2:N:0:1
NNNNNNNGGGTTTTGCTTATTTGAAAACTGTAAAAGGTACTCCAGTCTCCCCACATGGCAGCTTCAAGGCCCTGA
+
#######EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAEEEEEEEEEEEEEEEEEEEEEA
@NB501402:27:HJY2LAFXX:1:11101:19495:1061 2:N:0:1
NNNNNNNAAGGAACTCCTTTAGAAGGAACTCCTTGACTAGAACAACTTAAACAGGCCAGTGTCTTAATGGATCTA
+
#######EEEEEEEEEEEEEEEEEEEEEEEAEEEEEEEEAAEEEEEEEEEEEEEA<AEEEEEEE/EEE/EEEE<E
@NB501402:27:HJY2LAFXX:1:11101:17883:1061 2:N:0:1
NNNNNNNACCTGCAGCTCGTCCCTTCTCCTTAATGAGGGAGGAGGCCAGAGGAAGCTAGCTTGAGCCAGGCTGGA
+
#######EEEEEEEEEEEEEEEEEEE/EEE/EEEEEEEEEEEEEE<EE/EEEAEEEAEEEEEEAAE6AEAEEEE/
@NB501402:27:HJY2LAFXX:1:11101:5852:1061 2:N:0:1
NNNNNNNAGAAAGAAGAGACCGTGACCCCCTTATCCTGACATTCTGGCACACAGAAGGTTCCCTGTCTCTTATAC
+
#######AEEEEEEEEEEEEEEEE/EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAEA
@NB501402:27:HJY2LAFXX:1:11101:9026:1061 2:N:0:1
NNNNNNNCAGGGTGAGGCTGGGCAAGCGTGCTTCTGAGAAGGGGCTGGAGTAGCCCTGGGTGACTCTTCTACCAC
+
#######EEEEEEEEEEEEEEEEEAEEEEEEEEEEEEEEEEEEEEEEEEEEAEEAEEEEEEEAE<E/<E</EE/E
@NB501402:27:HJY2LAFXX:1:11101:4690:1061 2:N:0:1
NNNNNNNATATTAGTCAAGCTAGCTTATACCACCCATCCCTAACCCTGCCCCTCATCCAGGATATGGAGAACCAG
+
#######EEEEEEEEEE/EEEEEEEEEE/EEAEEEEEEEEE/6EEEEEEEEEE<E<EEEEEEAE/EEEE<//EAE
@NB501402:27:HJY2LAFXX:1:11101:10845:1061 2:N:0:1
NNNNNNNGCTGCCTCCTCACATATTTTAACAAAACTGAGTTTTTGTTTTTCAAGTTTGCCTCCATATAGCTGTCT
from juicer.
What else can I give you that might help? What else can I look into to discover the cause of this issue?
from juicer.
from juicer.
I'll give that a go and rerun. I have been looking through documentation and it's not clear where precisely the chimeric ambiguous reads end up. If it is indeed something experimental that might cause this, it would be worthwhile to see examine the reads and verify that everything is spaced as we expect and with the content we expect. Is that in abnormal.sam or is it in collisions.txt?
from juicer.
from juicer.
Fair enough. One more question. I'm trying to compare these to "normal paired" reads. What are the meanings of the columns in merged_nodups.txt so that I know what I'm looking at?
from juicer.
from juicer.
Thanks! This fixed things:
Sequenced Read Pairs: 159,166,775
Normal Paired: 145,519,585 (91.43%)
Chimeric Paired: 8,484,360 (5.33%)
Chimeric Ambiguous: 1,934,090 (1.22%)
Unmapped: 3,228,740 (2.03%)
Ligation Motif Present: 31,202,559 (19.60%)
Alignable (Normal+Chimeric Paired): 154,003,945 (96.76%)
Intra-fragment Reads: 78,985,214 (49.62% / 69.37%)
Below MAPQ Threshold: 7,553,693 (4.75% / 6.63%)
Hi-C Contacts: 27,315,825 (17.16% / 23.99%)
Ligation Motif Present: 18,987,324 (11.93% / 16.68%)
3' Bias (Long Range): 96% - 4%
Pair Type %(L-I-O-R): 25% - 25% - 25% - 25%
Inter-chromosomal: 3,237,353 (2.03% / 2.84%)
Intra-chromosomal: 24,078,472 (15.13% / 21.15%)
Short Range (<20Kb): 12,927,319 (8.12% / 11.35%)
Long Range (>20Kb): 11,151,153 (7.01% / 9.79%)
from juicer.
Related Issues (20)
- Java memory error
- ***! Error! Either inter.hic or inter_30.hic were not created HOT 4
- The sample data link in the wiki docs are no longer accessible
- More restriction enzyme sites of Arima than Juicer provided, how to modified juicer.sh HOT 4
- Juicer memory usage
- Installation files absent HOT 1
- Error while Normalizations HOT 1
- juicebox format to fasta: coordinates conversion error HOT 1
- error with Java when generating out_JBAT.assembly following Yahs protocol (Yet another HIC scaffolding tool)
- strange output of merged1.txt
- Matrix size
- .hic file chr names NC_000001.11
- insufficient memory for the Java Runtime Environment - CPU HOT 1
- samtools sort: fail to open "./samtools.36604.8950.tmp.1019.bam": Too many open files HOT 2
- Trio-binning genome assembly and juicer error
- Sorted file and dups/no dups files do not add up in dedup step HOT 1
- Juicer changes chromosome names HOT 1
- Not able to see the specified normalizations (using -k) in the hic file after juicertools pre
- Treat hap1 and hap2 to as "parents" for diploid.sh
- Provision of enzymes
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from juicer.