Basic Statistics
| Measure | Value |
|---|---|
| Filename | Nf14.r_1.fq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 20755188 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 50 |
| %GC | 24 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GATCGGAAGAGCACACGTCTGAACTCCAGTCACGCGCTAATCTCGTATGC | 562175 | 2.7085998931929693 | TruSeq Adapter, Index 16 (97% over 36bp) |
| GATCGGAAGAGCACACGTCTGAACTCCAGTCACGCGCTAACCTCGTATGC | 35789 | 0.17243399577975396 | TruSeq Adapter, Index 16 (97% over 36bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GCTCGGA | 140 | 0.0 | 44.04218 | 1 |
| TCCCTAT | 20 | 7.8596314E-4 | 43.99995 | 44 |
| GCGCAAA | 20 | 7.8602845E-4 | 43.99921 | 34 |
| CGCTCAC | 20 | 7.8602845E-4 | 43.99921 | 35 |
| CCCGTCC | 40 | 8.3255145E-9 | 43.999 | 26 |
| CCAGACA | 25 | 4.4460714E-5 | 43.998997 | 26 |
| CGTCCCG | 40 | 8.3255145E-9 | 43.99889 | 28 |
| CCGATCT | 20 | 7.860657E-4 | 43.998787 | 9 |
| AGCCCAC | 80 | 0.0 | 43.998787 | 10 |
| CCCACGT | 75 | 0.0 | 43.998787 | 12 |
| GCACCAC | 20 | 7.860657E-4 | 43.998787 | 11 |
| CACTCGT | 30 | 2.5302943E-6 | 43.998787 | 40 |
| AAGCCCG | 20 | 7.860657E-4 | 43.998787 | 39 |
| GAGCCCA | 85 | 0.0 | 43.998787 | 9 |
| GAGCCAC | 75 | 0.0 | 43.998787 | 9 |
| GAGCACC | 135 | 0.0 | 43.998787 | 9 |
| CACCCGT | 70 | 0.0 | 43.998783 | 12 |
| GATCGGA | 84480 | 0.0 | 43.7372 | 1 |
| CCCGCGC | 1170 | 0.0 | 43.623146 | 31 |
| TCCCGCG | 1135 | 0.0 | 43.611553 | 30 |