Sun 29 May 2011
dataset_100.dat
Summary
Basic Statistics
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per base GC content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Kmer Content
Basic Statistics
Measure | Value |
---|---|
Filename | dataset_100.dat |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 5000 |
Sequence length | 54 |
%GC | 43 |
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGGAAGAGCGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAAT | 7 | 0.13999999999999999 | Illumina Paired End PCR Primer 2 (96% over 33bp) |
CGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTTCAGCAGGAATGCCGAGAC | 7 | 0.13999999999999999 | Illumina Paired End PCR Primer 2 (100% over 35bp) |
Kmer Content
Sequence | Count | Obs/Exp Overall | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CCCCC | 1110 | 9.557241 | 23.193876 | 44 |
AAAAA | 2130 | 4.1300144 | 5.698024 | 7 |
CTCCC | 530 | 3.5749238 | 11.56272 | 11 |
CCTCC | 510 | 3.440021 | 11.56272 | 14 |
CTGGG | 465 | 3.1575248 | 8.314476 | 45 |
TGGAA | 785 | 2.929684 | 6.397646 | 17 |
GGGGG | 335 | 2.9167063 | 10.660779 | 6 |
GAATG | 775 | 2.8923633 | 10.053445 | 20 |
CCCAG | 450 | 2.8820548 | 7.8420706 | 22 |
GTGTG | 520 | 2.7661672 | 6.513526 | 23 |
TGTGT | 660 | 2.744307 | 6.109576 | 27 |
GGGAG | 425 | 2.7401958 | 6.315732 | 6 |
ATGGA | 730 | 2.7244196 | 6.397646 | 20 |
GCTGG | 395 | 2.6821985 | 6.651581 | 16 |
GGAGG | 410 | 2.643483 | 7.894665 | 7 |
GGAAT | 700 | 2.612457 | 7.3115954 | 19 |
GCAGG | 400 | 2.5732677 | 14.17877 | 16 |
CCTGG | 370 | 2.5068476 | 6.6367774 | 7 |
CAGCC | 380 | 2.4337354 | 6.273657 | 4 |
TGGGA | 480 | 2.419065 | 7.40506 | 20 |
AATGG | 645 | 2.4071925 | 5.4836965 | 18 |
GCCTC | 355 | 2.3998654 | 6.622006 | 2 |
CCCTC | 355 | 2.394524 | 8.259085 | 13 |
CCAGC | 370 | 2.3696895 | 6.2736564 | 14 |
GGAAG | 495 | 2.3634295 | 14.031037 | 2 |
GGCTG | 345 | 2.3426795 | 9.97737 | 24 |
CAGGA | 490 | 2.3343499 | 13.99981 | 17 |
CCAGG | 360 | 2.3107867 | 6.2876506 | 11 |
CTGTG | 435 | 2.3088553 | 9.09864 | 44 |
GCCTG | 340 | 2.3035896 | 8.295971 | 5 |
TCCCA | 460 | 2.3028264 | 6.1297736 | 9 |
CTGCC | 340 | 2.2984629 | 8.277508 | 10 |
CAGCA | 480 | 2.2816207 | 10.47649 | 14 |
TTCCT | 545 | 2.256056 | 7.0961456 | 1 |
CTCCT | 415 | 2.1929073 | 6.4701333 | 15 |
TCCCC | 325 | 2.1921701 | 6.6072683 | 28 |
CAGGC | 335 | 2.1503155 | 6.2876506 | 14 |
CCCCA | 335 | 2.1407545 | 6.259694 | 20 |
TGGTG | 400 | 2.127821 | 6.513526 | 9 |
CCCCT | 315 | 2.1247187 | 6.607268 | 12 |
GGCAG | 330 | 2.122946 | 6.301676 | 16 |
CTCTG | 400 | 2.1183603 | 5.187652 | 16 |
TTCCA | 540 | 2.1177678 | 6.7228537 | 28 |
CAGAG | 440 | 2.0961509 | 5.833255 | 28 |
CCCTG | 310 | 2.095657 | 6.622006 | 24 |
AGGGA | 435 | 2.0769534 | 5.846266 | 1 |
CTCAG | 410 | 2.0570977 | 6.143447 | 24 |
CTTCC | 385 | 2.034384 | 6.4701333 | 17 |
AGCAG | 425 | 2.0246909 | 10.499857 | 15 |
AGGAA | 570 | 2.015381 | 10.390455 | 18 |
TCCTC | 380 | 2.0079634 | 5.1761065 | 1 |
TCAGC | 400 | 2.0069244 | 12.286894 | 44 |
AGAGC | 420 | 2.0008712 | 10.499858 | 5 |
TCCTG | 375 | 1.9859626 | 6.4845653 | 8 |
TCTCC | 375 | 1.9815428 | 5.1761065 | 8 |
CACCC | 310 | 1.9809967 | 6.259694 | 9 |
TGTGG | 370 | 1.9682345 | 5.210821 | 4 |
GATGG | 390 | 1.9654902 | 6.1708827 | 1 |
TGGCT | 370 | 1.9638538 | 9.098641 | 45 |
TCCCT | 370 | 1.9551222 | 5.1761065 | 37 |
CAGGG | 300 | 1.9299511 | 6.3016763 | 21 |
AAGAG | 545 | 1.9269869 | 9.524584 | 4 |
CAGAA | 545 | 1.9226985 | 6.0476103 | 6 |
CTGAG | 380 | 1.9108309 | 6.15715 | 11 |
TCTCT | 460 | 1.9041945 | 5.068676 | 8 |
GAGGC | 295 | 1.897785 | 6.301676 | 22 |
CCCAC | 295 | 1.885142 | 6.259694 | 4 |
CCTGC | 275 | 1.8590509 | 9.93301 | 6 |
CCTGT | 350 | 1.8535651 | 6.4845653 | 6 |
TTTCC | 445 | 1.8421009 | 5.0686755 | 14 |
TGCCT | 345 | 1.8270856 | 5.187652 | 12 |
CACAC | 380 | 1.8022629 | 5.8073177 | 17 |
CCTCT | 340 | 1.7965988 | 7.7641597 | 2 |
TTCCC | 340 | 1.7965986 | 6.470133 | 18 |
GAAGA | 500 | 1.767878 | 7.792842 | 26 |
CTGGC | 260 | 1.7615684 | 6.636777 | 8 |
TTCAG | 440 | 1.7294377 | 9.6255 | 12 |
CTCTC | 325 | 1.717337 | 5.1761065 | 3 |
GTGGT | 320 | 1.7022567 | 5.2108207 | 24 |
GGGTG | 250 | 1.7013805 | 8.333022 | 28 |
TGCTT | 410 | 1.7010024 | 5.0799813 | 4 |
CTTTG | 405 | 1.6802584 | 5.0799813 | 2 |
AGAAG | 475 | 1.6794841 | 5.1952276 | 22 |
ACTCC | 330 | 1.6520276 | 6.1297736 | 10 |
CTGCT | 310 | 1.6417291 | 6.4845653 | 3 |
GCCCC | 190 | 1.6395733 | 6.339712 | 37 |
AGTGG | 325 | 1.6379085 | 6.1708827 | 3 |
TGGCA | 325 | 1.6342632 | 6.1571493 | 16 |
GGCTC | 240 | 1.626063 | 6.636776 | 25 |
ACACA | 460 | 1.6192161 | 5.1721287 | 14 |
GCTTT | 390 | 1.6180266 | 5.0799813 | 5 |
TTCTG | 390 | 1.6180266 | 5.0799813 | 3 |
TCCTT | 390 | 1.6144258 | 6.0824113 | 16 |
CCCAA | 335 | 1.588837 | 5.8073177 | 10 |
GCACA | 330 | 1.5686142 | 5.8202715 | 18 |
GTTCA | 395 | 1.5525635 | 7.700401 | 11 |
TCTTC | 375 | 1.5523325 | 5.068676 | 4 |
AGGCA | 325 | 1.5482932 | 5.8332543 | 6 |
GGGGA | 240 | 1.5474048 | 6.3157325 | 7 |
CCTTC | 290 | 1.5323931 | 6.4701333 | 17 |
ATGCC | 305 | 1.5302798 | 12.286893 | 22 |
GAAGG | 320 | 1.5278738 | 5.8462653 | 8 |
GGGAT | 295 | 1.4867171 | 7.40506 | 21 |
TGTGC | 280 | 1.4861598 | 5.199224 | 7 |
AACAG | 420 | 1.4817125 | 5.1836653 | 10 |
TCTTT | 455 | 1.4755235 | 5.5590916 | 1 |
TTGGC | 275 | 1.4596211 | 5.199223 | 15 |
GAGAT | 385 | 1.4368514 | 6.397646 | 27 |
TTTGG | 345 | 1.4345239 | 6.1095753 | 3 |
GGTTC | 270 | 1.4330826 | 10.398446 | 10 |
CCTTG | 265 | 1.4034137 | 5.187652 | 18 |
CCCTT | 265 | 1.4002901 | 5.176106 | 14 |
CTGTT | 335 | 1.3898433 | 5.0799813 | 10 |
CCTTT | 335 | 1.3867503 | 5.068676 | 8 |
GGCAC | 215 | 1.3800533 | 6.2876506 | 17 |
TTGAA | 465 | 1.3564935 | 6.4295163 | 7 |
CTTGG | 255 | 1.3534669 | 5.1992235 | 18 |
GTGTT | 325 | 1.3513632 | 5.091313 | 28 |
GGCCC | 155 | 1.3405302 | 6.353853 | 2 |
TGGTT | 320 | 1.330573 | 5.091313 | 25 |
TTCTT | 410 | 1.3295923 | 7.941558 | 4 |
GGGGT | 195 | 1.327077 | 8.333022 | 45 |
AAGGA | 375 | 1.3259085 | 5.1952276 | 44 |
CAGAT | 350 | 1.3033215 | 5.471493 | 10 |
CTGGT | 245 | 1.3003898 | 5.1992235 | 12 |
TGTCC | 245 | 1.2974956 | 5.187652 | 21 |
GCTCC | 190 | 1.2844352 | 8.277508 | 26 |
GACAG | 265 | 1.2624544 | 5.833254 | 27 |
GAGCG | 195 | 1.2544682 | 12.603353 | 6 |
TTGGG | 235 | 1.2500948 | 6.5135255 | 6 |
AATGC | 335 | 1.2474647 | 10.031068 | 21 |
GGCTT | 235 | 1.2473125 | 5.199223 | 4 |
TTTGC | 295 | 1.223892 | 5.0799813 | 9 |
GGTGT | 230 | 1.223497 | 5.21082 | 27 |
GATCG | 235 | 1.181698 | 6.1571493 | 45 |
AGCTC | 225 | 1.1288949 | 6.1434464 | 25 |
ACCCT | 225 | 1.1263824 | 6.129773 | 10 |
CGGAA | 235 | 1.119535 | 9.333206 | 1 |
CCCAT | 220 | 1.1013516 | 6.129773 | 21 |
TCTTG | 265 | 1.0994283 | 5.0799813 | 21 |
ATGCA | 295 | 1.0985136 | 5.4714913 | 37 |
GCTTG | 205 | 1.0880812 | 5.199223 | 2 |
AGCGG | 165 | 1.061473 | 12.603352 | 7 |
AACCA | 300 | 1.0560105 | 5.1721287 | 22 |
AACCT | 280 | 1.0403365 | 5.4593143 | 13 |
CCGAG | 160 | 1.0270164 | 12.575301 | 25 |
TTGCC | 190 | 1.0062209 | 5.1876516 | 3 |
ACTCA | 265 | 0.9846043 | 5.4593153 | 11 |
CTCAT | 250 | 0.98044825 | 5.7624474 | 4 |
GTTCC | 185 | 0.9797416 | 6.4845653 | 27 |
GAACC | 205 | 0.97444224 | 5.820272 | 25 |
GGCGG | 110 | 0.9555928 | 10.637051 | 44 |
GCGGT | 130 | 0.8827489 | 13.303162 | 8 |
GCGGG | 100 | 0.86872077 | 6.3822303 | 45 |
TGCCG | 125 | 0.846908 | 14.93275 | 23 |
AGGCG | 120 | 0.7719804 | 6.3016763 | 15 |
CGGTT | 140 | 0.74307984 | 10.398446 | 9 |
GCCGA | 115 | 0.738168 | 12.575301 | 24 |
CGAGA | 130 | 0.61931723 | 9.333206 | 26 |
CGGGA | 85 | 0.5468194 | 6.301676 | 24 |
CACGG | 85 | 0.54560244 | 6.2876506 | 37 |