Computational Biology

(BIOSC 1540)

Jan 16, 2025

Lecture 02B

DNA sequencing

Methodology

Announcements

  • Assignment P01A is due tomorrow (Jan 17) by 11:59 pm
  • Assignment P01B will be released tomorrow (Jan 17) at 11:59 pm
  • CByte 01 will be released tomorrow (Jan 17) at 11:59 pm
  • Quiz 01 is in two weeks (Jan 28) and will cover lectures 02A to 03B

After today, you should have a better understanding of

Sequencing data formats

Databases

Previous lecture: Experimental side of DNA sequencing

1. Isolate and purify DNA

2. Prepare DNA library

3. Sequence

Where do we put all this data?

And how do we store it?

Genomic sequencing data is archived and stored in public databases

Centralized databases make genomic sequencing data accessible for research and analysis

These repositories support collaboration and transparency in science

Just imagine if the only copy of a genome sequence was stored on Bob's 10-year-old external hard drive

One sequencing run can be 100s of GB

National Center for Biotechnology Information (NCBI) is a key source of sequencing data

SRA is what we will use the most

Deposited data are given unique accession numbers

An accession number is a unique identifier assigned to a specific dataset, sequence, sample, or experiment in a public database

Experiment

SRX______ or ERX______

Run

SRR______ or ERR______

Run

SRR______ or ERR______

Experiment

SRX______ or ERX______

Run

SRR______ or ERR______

Run

SRR______ or ERR______

Experiment

SRX______ or ERX______

Run

SRR______ or ERR______

Run

SRR______ or ERR______

Project

SRP______ or ERP______ or PRJ______

Example experiment

Run ID: ERX384810 — Unique identifier for the experiment

Platform: Illumina HiSeq 2000 — Sequencing instrument used

A spot is a unit of data generated during high-throughput sequencing. It typically represents a single sequencing read or, in paired-end sequencing, a pair of reads from the sequencing instrument.

A base refers to a nucleotide (A, T, G, or C) in the sequence data. The total number of bases represents the cumulative length of all reads in a sequencing run.

Example run

After today, you should have a better understanding of

Sequencing data formats

FASTA files

FASTA files store nucleotide or protein sequences in a simple format

>ERR1197981.1/1
>crab_anapl ALPHA CRYSTALLIN B CHAIN (ALPHA(B)-CRYSTALLIN)
MDITIHNPLIRRPLFSWLAPSRIFDQIFGEHLQESELLPASPSLSPFLMR
SPIFRMPSWLETGLSEMRLEKDKFSVNLDVKHFSPEELKVKVLGDMVEIH
GKHEERQDEHGFIAREFNRKYRIPADVDPLTITSSLSLDGVLTVSAPRKQ
SDVPERSIPITREEKPAIAGAQRK

DNA

Protein

Consists of a header line starting with > followed by a descriptive identifier

Contains one or more lines of nucleotide

AGTTTGATCCAAAGCATGAGTGTTTACAATGTTTGAATACCTTATACAGT
TCTTATACATACTTTATAAATTATTTCCCAAGCTGTTTTGATACACACAC

or protein sequences

FASTA files help us use Python to analyze DNA sequences

Calculating Sequence Length

DNA_SEQ = "GAGCAATTTCGAGGATCGCTTGTTGGTATTACTCGGGCTTTTC"
seq_len = len(DNA_SEQ)  # 43

DNA_SEQ: The DNA sequence is stored as a string in Python

The len() function returns the number of characters in the string

Each character in the string corresponds to a nucleotide

In this context, it calculates the total number of nucleotides in the sequence

Counting Nucleotides with a Function

def count_nucleotides(seq):
    nucleotide_counts = {"A": 0, "T": 0, "G": 0, "C": 0}

    for nucleotide in seq:
        if nucleotide in nucleotide_counts:
            nucleotide_counts[nucleotide] += 1

    return nucleotide_counts

Functions in Python allow us to organize reusable blocks of code for specific tasks.

A dictionary is initialized with keys for the nucleotides (A, T, G, C) and values set to 0.

  • This structure allows for efficient updating of counts.

The updated dictionary with nucleotide counts is returned as the function's output.

count_nucleotides(seq) takes a string that represents a DNA sequence as input

Breaking Down the For Loop

def count_nucleotides(seq):
    nucleotide_counts = {"A": 0, "T": 0, "G": 0, "C": 0}

    for nucleotide in seq:
        if nucleotide in nucleotide_counts:
            nucleotide_counts[nucleotide] += 1

    return nucleotide_counts

Strings in Python are iterable, meaning you can process one character at a time using a for loop.

  • Here, the loop processes each nucleotide in the DNA sequence.

Conditional Check:

  • if nucleotide in nucleotide_counts ensures that only valid nucleotides (A, T, G, C) are processed.
  • Prevents errors from unexpected characters like N or other symbols.

Updating Dictionary Values:

  • nucleotide_counts[nucleotide] += 1 increments the count for the current nucleotide.
  • Each nucleotide's count is stored as the dictionary's value, which is updated dynamically during the loop.

Results and Their Significance

def count_nucleotides(seq):
    nucleotide_counts = {"A": 0, "T": 0, "G": 0, "C": 0}

    for nucleotide in seq:
        if nucleotide in nucleotide_counts:
            nucleotide_counts[nucleotide] += 1

    return nucleotide_counts
  
nuc_counts = count_nucleotides(DNA_SEQ)
# {'A': 7, 'T': 16, 'G': 12, 'C': 8}

Output: The dictionary shows the counts of each nucleotide in the sequence.

What is GC Content?

GC content is the percentage of guanine (G) and cytosine (C) nucleotides in a DNA sequence

Biological Relevance:

  • High GC content indicates stronger hydrogen bonding, leading to more stable DNA.
  • GC content variation is useful for species identification, genome analysis, and detecting contamination.

Python Function to Calculate GC Content

def get_gc_content(seq):
    nuc_counts = count_nucleotides(seq)
    gc_content = (nuc_counts["G"] + nuc_counts["C"]) / len(seq) * 100
    return gc_content

get_gc_content(seq) calculates the percentage of G and C nucleotides in a given sequence.

The function calls count_nucleotides(seq) to get a dictionary of nucleotide counts, demonstrating code reuse.

Percentage of G and C nucleotides are then calculated and returned

Breaking Down the GC Content Calculation

def get_gc_content(seq):
    nuc_counts = count_nucleotides(seq)
    gc_content = (nuc_counts["G"] + nuc_counts["C"]) / len(seq) * 100
    return gc_content

The nuc_counts dictionary contains the counts for each nucleotide.

  • Access specific counts using keys: nuc_counts["G"] and nuc_counts["C"].
  • Divide by len(seq) to calculate the proportion of G and C bases in the sequence.
  • Multiply by 100 to convert the proportion to a percentage.

Interpreting the Output

def get_gc_content(seq):
    nuc_counts = count_nucleotides(seq)
    gc_content = (nuc_counts["G"] + nuc_counts["C"]) / len(seq) * 100
    return gc_content
  
seq_gc_content = get_gc_content(DNA_SEQ)
# 46.51162790697674

The calculated GC content for the sequence DNA_SEQ is approximately 46.5%.

After today, you should have a better understanding of

Sequencing data formats

FASTQ files

Sequencing methods can introduce potential errors

Lagging and Leading Synthesis

Normal sequencing by synthesis (i.e., Illumina) has each cluster strand in sync to amplify signal

Errors can occur when synthesis gets out of sync

Lagging synthesis by failure to remove blocking fluorophore. 

Leading synthesis by addition of dNTP instead of ddNTP

Clean

Noisy

Densely packed flow cells can also cause signal overlap

Fluorescent signals from neighboring clusters can overlap

Signal-cross talk degrades quality

FASTQ files store sequencing reads and their quality scores

Combines sequence data (as in FASTA) with quality scores for each nucleotide.

Organized in a four-line structure for each read:

  1. Header: Contains the read identifier, starting with @.
  2. Sequence: The nucleotide sequence.
  3. Separator: A + line (optionally repeating the header).
  4. Quality Scores: Encoded in ASCII characters.
@HWI-M01876:76:000000000-AF16W:1:1101:10853:1000 1:N:0:CGTGACAGAT
NTGTACTTCATCCGAAACTCGTGCTCATCTCTGCTCAGATCGGAAGAGCACACGTCTGAACTCCAG
+
#8ABCFGGGFCEDCFGGGGGGGFFCGEFGGGGGGFGGGGGGGGDEFGGGGGGGGGGGGGGGGGFFF
@HWI-M01876:76:000000000-AF16W:1:1101:16471:1000 1:N:0:CGTGAACTTG
NTTCCAGATATTCGATGCATGTGCCGCTCCTGTCGGAGATCGGAAGAGCACACGTCTGAACTCCAG
+HWI-M01876:76:000000000-AF16W:1:1101:16471:1000 1:N:0:CGTGAACTTG
y8BCCGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGEGGGGFGGGGGGGGGGGGGGGG

Quality scores in FASTQ files represent sequencing confidence

Quality scores are associated with each base and indicate the probability of a sequencing error.

Encoded using the Phred quality score system:

  • Higher scores = Higher confidence = Lower error probability.
Dec  Char                           Dec  Char     Dec  Char     Dec  Char
---------                           ---------     ---------     ----------
  0  NUL (null)                      32  SPACE     64  @         96  `
  1  SOH (start of heading)          33  !         65  A         97  a
  2  STX (start of text)             34  "         66  B         98  b
  3  ETX (end of text)               35  #         67  C         99  c
  4  EOT (end of transmission)       36  $         68  D        100  d
  5  ENQ (enquiry)                   37  %         69  E        101  e
  6  ACK (acknowledge)               38  &         70  F        102  f
  7  BEL (bell)                      39  '         71  G        103  g
  8  BS  (backspace)                 40  (         72  H        104  h
  9  TAB (horizontal tab)            41  )         73  I        105  i
 10  LF  (NL line feed, new line)    42  *         74  J        106  j
 11  VT  (vertical tab)              43  +         75  K        107  k
 12  FF  (NP form feed, new page)    44  ,         76  L        108  l
 13  CR  (carriage return)           45  -         77  M        109  m
 14  SO  (shift out)                 46  .         78  N        110  n
 15  SI  (shift in)                  47  /         79  O        111  o
 16  DLE (data link escape)          48  0         80  P        112  p
 17  DC1 (device control 1)          49  1         81  Q        113  q
 18  DC2 (device control 2)          50  2         82  R        114  r
 19  DC3 (device control 3)          51  3         83  S        115  s
 20  DC4 (device control 4)          52  4         84  T        116  t
 21  NAK (negative acknowledge)      53  5         85  U        117  u
 22  SYN (synchronous idle)          54  6         86  V        118  v
 23  ETB (end of trans. block)       55  7         87  W        119  w
 24  CAN (cancel)                    56  8         88  X        120  x
 25  EM  (end of medium)             57  9         89  Y        121  y
 26  SUB (substitute)                58  :         90  Z        122  z
 27  ESC (escape)                    59  ;         91  [        123  {
 28  FS  (file separator)            60  <         92  \        124  |
 29  GS  (group separator)           61  =         93  ]        125  }
 30  RS  (record separator)          62  >         94  ^        126  ~
 31  US  (unit separator)            63  ?         95  _        127  DEL

Lowest quality:       !

Highest quality:     ~

Why subtract 33 from ASCII values in FASTQ?

Dec  Char                           Dec  Char     Dec  Char     Dec  Char
---------                           ---------     ---------     ----------
  0  NUL (null)                      32  SPACE     64  @         96  `
  1  SOH (start of heading)          33  !         65  A         97  a
  2  STX (start of text)             34  "         66  B         98  b
  3  ETX (end of text)               35  #         67  C         99  c
  4  EOT (end of transmission)       36  $         68  D        100  d
  5  ENQ (enquiry)                   37  %         69  E        101  e
  6  ACK (acknowledge)               38  &         70  F        102  f
  7  BEL (bell)                      39  '         71  G        103  g
  8  BS  (backspace)                 40  (         72  H        104  h
  9  TAB (horizontal tab)            41  )         73  I        105  i
 10  LF  (NL line feed, new line)    42  *         74  J        106  j
 11  VT  (vertical tab)              43  +         75  K        107  k
 12  FF  (NP form feed, new page)    44  ,         76  L        108  l
 13  CR  (carriage return)           45  -         77  M        109  m
 14  SO  (shift out)                 46  .         78  N        110  n
 15  SI  (shift in)                  47  /         79  O        111  o
 16  DLE (data link escape)          48  0         80  P        112  p
 17  DC1 (device control 1)          49  1         81  Q        113  q
 18  DC2 (device control 2)          50  2         82  R        114  r
 19  DC3 (device control 3)          51  3         83  S        115  s
 20  DC4 (device control 4)          52  4         84  T        116  t
 21  NAK (negative acknowledge)      53  5         85  U        117  u
 22  SYN (synchronous idle)          54  6         86  V        118  v
 23  ETB (end of trans. block)       55  7         87  W        119  w
 24  CAN (cancel)                    56  8         88  X        120  x
 25  EM  (end of medium)             57  9         89  Y        121  y
 26  SUB (substitute)                58  :         90  Z        122  z
 27  ESC (escape)                    59  ;         91  [        123  {
 28  FS  (file separator)            60  <         92  \        124  |
 29  GS  (group separator)           61  =         93  ]        125  }
 30  RS  (record separator)          62  >         94  ^        126  ~
 31  US  (unit separator)            63  ?         95  _        127  DEL

We are not able to see values below 33 on our screen, making it harder to work with

Why use ASCII encoding for quality scores?

We need to store millions upon millions of floats (e.g., 0.92829) per nucleotide

ASCII characters require ~1/4 the memory, making them more memory efficient

One million float32 values are about 3.8 MB

Seems small, but one E. coli genome is ~5 million base pairs, and our DNA library will have multiple copies of sequences

ASCII characters map directly to numerical quality scores

Dec  Char                           Dec  Char     Dec  Char     Dec  Char
---------                           ---------     ---------     ----------
  0  NUL (null)                      32  SPACE     64  @         96  `
  1  SOH (start of heading)          33  !         65  A         97  a
  2  STX (start of text)             34  "         66  B         98  b
  3  ETX (end of text)               35  #         67  C         99  c
  4  EOT (end of transmission)       36  $         68  D        100  d
  5  ENQ (enquiry)                   37  %         69  E        101  e
  6  ACK (acknowledge)               38  &         70  F        102  f
  7  BEL (bell)                      39  '         71  G        103  g
  8  BS  (backspace)                 40  (         72  H        104  h
  9  TAB (horizontal tab)            41  )         73  I        105  i
 10  LF  (NL line feed, new line)    42  *         74  J        106  j
 11  VT  (vertical tab)              43  +         75  K        107  k
 12  FF  (NP form feed, new page)    44  ,         76  L        108  l
 13  CR  (carriage return)           45  -         77  M        109  m
 14  SO  (shift out)                 46  .         78  N        110  n
 15  SI  (shift in)                  47  /         79  O        111  o
 16  DLE (data link escape)          48  0         80  P        112  p
 17  DC1 (device control 1)          49  1         81  Q        113  q
 18  DC2 (device control 2)          50  2         82  R        114  r
 19  DC3 (device control 3)          51  3         83  S        115  s
 20  DC4 (device control 4)          52  4         84  T        116  t
 21  NAK (negative acknowledge)      53  5         85  U        117  u
 22  SYN (synchronous idle)          54  6         86  V        118  v
 23  ETB (end of trans. block)       55  7         87  W        119  w
 24  CAN (cancel)                    56  8         88  X        120  x
 25  EM  (end of medium)             57  9         89  Y        121  y
 26  SUB (substitute)                58  :         90  Z        122  z
 27  ESC (escape)                    59  ;         91  [        123  {
 28  FS  (file separator)            60  <         92  \        124  |
 29  GS  (group separator)           61  =         93  ]        125  }
 30  RS  (record separator)          62  >         94  ^        126  ~
 31  US  (unit separator)            63  ?         95  _        127  DEL

Phred quality (Q) is the integer associated with the ASCII symbol

P=10(3333)/10=1.0P = 10^{-(33-33)/10} = 1.0
P = 10^{-(33-33)/10} = 1.0

Probability that an error occured

==
=
PP
P
10Q/1010^{-Q/10}
10^{-Q/10}
P=10(6333)/100.001P = 10^{-(63-33)/10} \approx 0.001
P = 10^{-(63-33)/10} \approx 0.001

!

Examples

?

Phred quality scores help identify reliable bases

Phred scores relate directly to confidence:

  • Q=20Q = 20Q=20: 99% accuracy (1 in 100 chance of error).
  • Q=30Q = 30Q=30: 99.9% accuracy (1 in 1,000 chance of error).
  • Q=40Q = 40Q=40: 99.99% accuracy (1 in 10,000 chance of error).

Bases with low Phred scores may need filtering to improve data quality

Sequencing runs store millions of FASTQ entries

@SRR14933407.1/1
TGTAGGTGTTACAATCTTGCCAGAAATCATGATGAAAAATATCAGCAAAGAACAATTTGAGTTTGAAAAAGTAGAAATTGATAATGAACCGCTGATTCGTTCGACATTTATGAGTTATGATCCGAGTATGTTGCAATTACCACAAGTTGATTCATTTGTAAATCTTATGACGAGCTTTGTTGAAGAACCAAAGGCGTAGTCTTAGACTAATTTAAGGTTAGTATTTAATTTTAAAGATCGGAAGAGCACAC
+
DDDDDIIIHIIIIIIHIIIIHIIIGIIIIIIIIIIHHHIIIHIIIIIIHIHHIGFIIIIIIHHIIIIIIIIHHIIIIE<DFHHIIIHHIEHCHIIIIIIIHHHHIIHHIIIIFIIIIIIIIIIIIIIIIIIHIIIIIIIIIHIIIIHIIHIIHHIIIIIIIIIIHIIHHIIGIIGIHIHHHHGIIIIIIIIHIHIIIIIIIGIIHHIIIHIHHG@GHGCGHHHHIIIHHIIH?HGCEGH?EHHIHEHIFH@
@SRR14933407.2/1
CTAGAAATATTGATTTATTGCATGTATAATGTTAAAAGTGCCCTTTTATAACGCTTACATATAAAAGCTTATTTAGGGAGAGGGATATTCAACAAGGGGGATTTGAAAATGATAGAACTTAATGCAATTACAACATTATGTTTAGCCTGTATACTTTACTTACTTGGTAAAGCTATCGTTAATCACGTTAATTTTTTAAAACGCATTTGTATACCAGCACCAGTGATTGGTGGCTTAATCTTTGCTATTTT
+
DDDDAIIHIIHIHIHIIIIIIIIIIIIIIIIIIFGHIIIIIIFIIIIIIIIIIIIIIHIGHHIGHIIIEHIIIHIHHIDHHHGIIIIIIIIHIHHHFHHHCEHHIIIIGHCHHHHIIIIIIIIIHIIIIIIIIIHIHIIIIHIHIIIIIIHIHHHIIHIHIIIIIIHIIIIIIIHHHIIHIGHHHIIIIIIIIHIIIHIIIIIHHGIHIIFHHGEHIIHEHIIIIHICHHHHC??HHAGEEHHHEHHIIII
@SRR14933407.3/1
NTTTGTTCTACCAGGAATTGGTGGTTTTTCATGAATATGCTTTGATACTTCTCCAATTCCAACGACAGATTGATTTTTCGTTCGATTATAAAAAATAATATTGTCGCCTTCTTCTAACTGAGTATAAAAATGATAACCATTACGTTTAATACCGTTGTACGTGTGCGTATAAATCGTATATTGGTTTCCAGGTTCAAATTCTTCAGTTTCAGCTAAAAAGAAATAACGTGGTATCTTAATTTCGCCTTTAC
+
#<<DDEEHIIGHICCECHHHHIIIIIIE?GEGCCHIIEFHIIECEEHGHHIHIHHHHCHIEFFEHHHDCHII?HHHHIEHHIHIGHHHIIHFFHD1GHEHGF1FHEHII=HIIHHCH?FHEHHHFHIIGFHGHHHIIIIGHHHHEHHIFEHHIIEF=E?FE@EHHEH0EHHIHEHDH@HHHEE/<CGFCGC?DEHIIHHIIIGC.F@GH?FEHH@FGFFEEFHHHEHHH.GA@CEHIIGHHIGGH-BA7A.
@SRR14933407.4/1
ATGAAAAATTAGCGCAAGCGAAACAAGATGCAATAGCAAATCTAGATACGTTGCGTGACTTAAATCAACCACAACGTGATGCGTTACGTAATCAAATCAATCAAGCACAAGCGTTAGCTACAGTTGAACAAACTAAACAAAATGCACAAAATGTTAATACAGCAATGAGTAACTTGAAACAAGGTATTGCAAATAAAGATACCGTCAAAGCAAGTGAAAACTATCACGATGCTGATGCCGATAAGCAAACA
+
DDBD<<FHE@GHICHIIIGIF<CHIGFEHHHHHH@HHFH?GGHHGHG@HIIH@HDHH0EHEGHFHIIHIGDEGDGHHHHECHHDGCHEDHGHEFHFHHHHHCH@DFEH1F?EEEEHHHHHHHID@HCCFHGHIIIHHHHHHFHE?GH?HGHIHHIICEHEHEEHIHFHHHIIHHHHIHHEEG/C<GE.;DHHHGEC@.C@HHEEBE.ABB.B.BHHEEEG@FHFHEEB7BBEE.6AAGEDE57@GEHH@EC
@SRR14933407.5/1
AAAATAAAACAAATGCTCACATAGTAGATTCATTTTATATCGTTAACTATTAACGTTTAAATTGATATGTCCCTATGTGAGTTTTTATTTTACAAATTTATACTATTTTAATCTCTGACTAAAGTTTTACATCGCTATCAGTATTATGTATGATTTATTTTAACAACTATAAATAAGCTTGAATTTTGTAACTAGTAAGTGTATAACAGTAATGAATACGAATGTTAGATATTCAATAAACTTAAAAGGGG
+
DDDDDHIHIIIIIIIIIIIIIIHHHIIIIIIIIIIHIIIHHIIIIIHIIIIIIIIDHGIHHIGHIIIIGIIIIIIHFHCHIGHIIIIIIIHHIIHIIIIHIGHIIIIIIIIGIIIHIGIIIIIIFIHHHIIIHIIIIIIIIIIIIIIICHHIIIIIHIIIEHHIIIIIGIIGHHHHHHHHEHIIIGHHIIIHGHHIIHHIIIHHHIIHIIGHHHEEHIIFIGIICEF/<GF@GHCHGHIHII?EEHIIIGE
@SRR14933407.6/1
AGATAAATATGATTACTTAACAGGCTTAGGTAATGTAAAAGAATTTTATAGACATTTAAATGAAATTTCACGAAAAGCTGAAAAAGAACATCAAAGTATCGCATTATTATTAATCGATATCGATGGATTTAAAGATGTCAATGATACCTACTCACACAAATCAGGTGATGCTGTATTAAAACAAATGTCTCAATTACTTAAAAACTATGTGCCGAATCAATTTAAAATTTTTAGAAATGGTGGCGAAGAGT
+
DDDDDIIIIIIHIIIIIIIIIIGHHHHIHIIIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIHIIIHIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIDHIIIHIIIIIIIIIIIIHIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIIIIIIIIIIGHHIIHIIIIIIIIIIIIHIIIIIIIIIIIHHIIIIIHHEGHIIIIIIIHIHIIIIIHIHHIGHIHIEHIIIGIIHHIIIIIIIII.
@SRR14933407.7/1
GGCACTTTATTGTAAGGTGTTTACAACCACGAAAACACATTTAAGTATTGCCAATATGAGCGATAACTATCTTATACTAACTGAGCACGTATAGAGAAAATTATTCTTTAAGACCTATTGCACGTAAATTAGACCGATCCGCATCTACTATCTTACGTGAAATTAGCAGAAATAATGTAAATAAGTTATATCAATCTGAAACGGCTCAAAAGAATTATGTAACTACAAGAAAACTTTGTGGTCATCCTACT
+
DDDDDIIGIIIHIIIIIIIIIIIIIIIIIGIIIIIIIIIIIIIIIIIIIIGIIEHIIHHHIIIIIIIIIIIGIIIIIIHIIIIIIIIIICEHHHIIIIIIIDHIIIIIIIIIIIIIIIIIIIIHIIIIIIIGFHIIHIIIIIIIIIIIIIGIGIIIIIIIIGIIHHHIIIIIIIIIIHHHHHIIIIHHIIIIIIIIHIGGHIIIIGHFHIIHHHIFHHHEHIIIIIIIIIIIHHHIIIHHDHFAFHII?G?
@SRR14933407.8/1
ATTAATAATCATTAACTTTTGCGAACCTTCTTTTAAATAAATAATGCTTCCTATTGTTTCCATTTATTTTCCTCCTTTGTCACCATCTCGATTTCGATATTTCTTCGCATTTCCAAGTCAATCATCTCTTCTTCCGCTTTACGAATGTCATGTTGAATTTGCTGTTGCTGCTCGTATAACGCTTCCGTTGAACGTCGTTTCGCAAAATGAAATTCATCGCGGAACTGTTCCATACGACTAGCCATATTGTT
+
0<DDDHIEE?GEEHHHE1DHDF?HH/CF1C1FFHIH@CHHHGHCDE1D1GCEHHHFH?HGE1E<F<C<<GECFHHIHGHIHHCCE@@CGC1C1EGCCEHHHCHDHHCHDGHFCCH11<<1@<1<G@CGCE?DG@G00EHHH?C1@GCF??G@DCC1<DEH1CHCGH1CH?@G10<CHH0CG?E=0F/<F/CCEH@C-CGHHCC.:-.DEE@..;;C@.9.?,,-7G.;FA..66..8-B5..8.9.BE7.;
@SRR14933407.9/1
TATTAGTAGCAGTGCTTGCGGATTTAGTAACTATGATTTTATTAACGGTCTATGGTGCAATCAATGGTCAAGGCGGCAGTACAATATGGTTAATAGGTATATTAGTTGTTTTCACAGCAATTTCATATATTTTAGGTGTTCAATTTAAAAGAATGTCATTTTTACAAAAATTGATGGATGGTACGACGCAAATCGGTATTCGTGCGGTGTTTGCATTAATAATATTATTAGTAGCCCTAGCAGAGGGAGTT
+
@D?DDHH1GHEHIGEGHE@FHD/C@G?HHIIIIGF<GGHHFHFEHIIIEHHEEEHHIGIHGIICHIIIIIEGHIIGGHIIIIIGHE@GEHHHHHHIIHHGIEHIIHIIIHHIIHGHHHHHHIHIIHIIHHHHIIIIHHIIHHHHIHIGIEEHHHIIIIIIIIIIIIEFEHICGEHEEEFH?GHIGHHDHIIIHIH=HHIIIE@EGC<.EAGA.AHHHIIIEFHHHI?@@BFHHHHIHFHCFEHGECHHC9.
@SRR14933407.10/1
GTAATTTTCTAGCTTCATCCATCGATAATTCAATGACTTTTTTAGCTCTAGGTGTATAATGCAATGTACCAACATGATCTTGACCATGTCCGATTAATTTTTCAACTTCTTCAATTACTTTATCTTCAGTGATATTAAAACTTTCTAATACTTTTGCAGCAATTCCTTCAGGTTCTTTCATTAACCCCAATAATAGGTGTTCTGTTCCTATATTTGAATGATTTAAACGAATTGCTTCTTCTTGTGCATGG
+
DDDBDGEEHGHE@FHCCHH@HCGHCFHI@EH?HHE1<<FEGHII??GHEHEHI@GHHHHEHEHHGEHCHFHHHH@@FHHIF?EEEHIHH1CEH=HHIIIHHG@HHGI?1CG?GHF1<CGFG1CG1FCGHEGEHCHHFEGHF@?GFEEFHFHIHHHH1FG@GFH?C@CEEHIF@G?HHIII1DH?D@G0CGCH<FG?<FFEHCH///<DGHHIGHHI?/</<C/F/FH@/D=GCC.8F.D..C7DDD.:.9.
@SRR14933407.11/1
TAACTGAAGGTATGGTATTAGCTATTGAACCGTTTATCTCATCAAATGCATCATTCGTTACAGAAGGTAAAAATGAATGGGCTTTTGAAACGAGCGATAAAAGTTTTGTTGCTCAAATTGAGCATACAGTTATCGTGACTAAGAGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAAAGAAGAAAGAAGCAATGCCAGAGGAACAAAC
+
DDBDAFHHHIFHFGHGCHIGHIIIIIIIIHHHHIHHIIIIGIGHIHHFIIIIIIHIIIIGIIHFHIIHIIHHHHHIHHHEHIEHIHHHEHHHHIIHHIIIIIIIIGHGHHGHHIIHHHIEHHIIIIIIHIIIIIEHIIIIHHIIGHIIIHHHIIHHHHIIIGHIIHFHHIIIHHHHHIIHHEEHIICFHHHIIIGHHH8CC.:;.::@FHHHHI<EH?E7,--;6.8.;.-.-8-----8-858.8--...
@SRR14933407.12/1
ACCAGCTGCATTAGAGGCAATTACAACCGAGAACGGGTTAATAGTTGAAAAGGTACTACCGACAGAGCTGGCAAGGAATATGGCACCAACAGAAACGATAGAATCGTATCCTAACGCTATAAATATAGGAACTAAAATCGGATAGAATGCGACAGCTTCTTCTTCAATACCACATAACGTCCCGCCGATAATCATTAGTATTGATACAAACACAATTAGCATAAATTCATGCCCTTTAGTTTTCTTTGTTA
+
DB@DD1FG?HHHGIIIIIIHIHIIIIHIIHIIHHIIIHHIHIGHHEHHHEHHIEHIIHI?EDEHIHHIIHHIHHIIGHHHCFHGF@EHIHHFHEHHI=GH@FHHIEHHIIIIIIIIIIIHIHGHHIIHH1DHHIIIIIIIIIIGIIGEHIIIIIHGIIGHGHIIIHIIIIIIIIIHHIIIHIIIIGH<EDEBGH?FHGBHEHICEHC@HHEHHHFHHGIFHH8F6AHHG@GHIIHHHHEHHEEHEHHIGG7
@SRR14933407.13/1
TTTGTCGCCATGGCATTTCTCTTTGATTTTCATCAAACCAGTGTATCAAATTTTCTTTAAAACTAGACTGCTGATACATTTATAAAACCCTTTCCTCACCAAAATTAATTGTCTTTACTCATAATGTTTTTATTGTACATTAAAATCATGGTTAGTATGTAAGTTAATTTAGTTATTTGCGAAATTGGATTATAATAGTATATATAATATTATGAAATGAGTGAACTGATATGGACACTGCAACACATATC
+
@DDD@HEE0EEHHGGHHHEHIGHHG<<<CFHHIE1DGHE@HHHHECFEFFHIIIHEHHHHIIHIIIHIIIIIFCHHIHHHEIIIIIIE=GEHIHHHHHIIIFHHIHFDHEFHHHIIHHEHECH?GHEHHHEEHHHEFHHHGG1GHGFHECGCHCGEHFEHIC1<CFCHEHEEEEGGHH1D?<EHHHHGIHCGHHHHI?FCEHHFGHHHHHHHIIF@F?GH?FEHE//DGGHCC/D?EEE<?CD.7C7CCCD
@SRR14933407.14/1
ATAAAAAATAAATATGATAAAACAGTTTGCGTTAGTAGTCATGTTAAAATACAAACAAATGTAACTCACATTTAATTTGTCATAATGGGAATGTGCGTTTAAAATAGATTTGTTCAAAGGAAAGTGGAGGTGCAATTTTGGCCAAGAAAAAAGTGATTTTTGAATGTATGGCTTGTGGTTATCAATCTCCTAAATGGATGGGGAAATGTCCTAATTGTGGCGCTTGGAATCAAATGGAGGAAATTGTTGAA
+
DDDDDIGIIHHIIIHIIIIIHIIHIIIIIIIIIIHIIIIIIHHFGHIIIIIHIHIIIIIIHIIEHIIIIIIHIHIIIIIIIIIIHIIIHHHIIIIIIIIHHEHHIIIGHHIIIHIIEGHIIHIFHHHHHHHHIIIIIHHICHHHCHHIIIIHHIHIIIIIIIIIIIIIIHHHIIHHHGHIHIIIHIIHIIIIIGHGG//GHHIIEDHHHHFFHHH.?HHHHIIHFHHIIIIHH8@EHHHEHFIHIIIEHHH
@SRR14933407.15/1
GTTCGAATCCCGTCTTCTGCTCCATTATTTTGCCGGGGTGGCGGAACTGGCAGACGCACAGGACTTAAAATCCTGCGGTGAGAGATCACCGTACCGGTTCGATTCCGGTCCTCGGCACCATTTTAGCGCCCGTAGCTCAATTGGACAGAGCGTTTGACTACGGATCAAGAGGTTATGGGTTCGACTCCTATCGGGCGCGCCATTTTTAAATTAATTGAATAACGGGAAGTAACTCAGCTTGGTAGAGCACT
+
D<D@D<FG@CFE1DFEHH?C1FEC1C1<<<F<1CC/?C<</1/C/E/1D1D10<FHHC?HFE@?@FCGHI1FEECEH/D/E?E1@F1FHHHH101<<ECED11<<C1/<CD1<EHHDHGGFEHHIHHIHHHDHDGGCC?FHIHI?00=DF??D@DE?DDHFECEHDFEEG@DBHHECE?::B?E::8BA?8/EDCHH<?H<-../.;B.;FH.AF@@?FHHCGE,.579AF.88-8GAGHID?FHFHA.B9
@SRR14933407.16/1
CTAATAATGATTGTTAAAGTGGTCGTCATACCTATAAATGGCAGTGTTCGATATTTAAACTGAATACCATAAGAAATAATTGCGACACCAACCGGGAACATCCAAGTGACCAACAATGTCGTCTTAATCATATCATCTGATACCGGTAGCAACACATGTACTAACAATCCCGCAACTAATGCTAATCCATAATGTAAACATAAATATTTAATAGTAGCAGGTATATACTTTCTTTCCAGAGTAAAATTCAA
+
D@DBDHHIFGHEEF11DCEFC1<GC/C<GFHHIHIIEGFE?GEHICG1DGCEHGFHHHIHEHHIIHCHCCGHHIIIEHHHIIF=HHIDHHEFHCHI?EHFDDFHHIIHHHIGHIH?GHHEGGHIHIIHIIIIIIIIIIGHIIIIHIIIIIHIIIIGIIIIIGEHHIIIHHFIHIIHIIIICHHHIIIIIHIGHIHHIIIHIIHHEFFAAGCHHHIIICHIFGHIHG?FHHHEHHH@A.BGHIIIIIIECC@
@SRR14933407.17/1
NAAGGCAACATGGTAGAAACTTCTAAAGCCATATTCTTTAGATTATATGAGTTTATGTAAATTATTTAACGATAATAGCAAATTTTCGGCATTTTTTCAATAACTGCTTAGGTAAACTTTTAATAGTTTTATAAATGTACAGAGACGTTTAATGCGAATGAAGACTTATAACTTAATAATGTAAGGGTTTTACTTTATAATAAGAAGGGTAAAAGATTGAATAAACAAATTTTTAAAGGGGCGACAACCAT
+
#<DDDIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHHIIIIIIIIIIIIIIIIIIHIIHHIHIIIIHIIIIIIIIIIIHIIHIIIIIIIIIIIIIIGFHHHIIIIIIIIIIIIIIIIIIIHIIIIIIIIIIIIIIIIIIIIIFIIHIIIIIIIIIIIIHIGIIIIIIIIIIIIIIIIIIIGIHHHHIIIIIIIHIIIIIHEHIHHHGHIHGHHIHHHHHIHHIHFIGEHIIIHEEDDHIIIIIII@
@SRR14933407.18/1
AATACTGATATCATTAAAAATGATGTTCGTATTCAAAAAGTTATCGATTTATTAAGAGATAACGCAAAGTTCGTTGAAGGAACTAAAGAAGATTAATCTTCATTAAATATTAAATTACAAAAATGAGTAGCAGATGCATAGCTTATGTATCTGCTACTATTCTTTAAGCAAAAAGTTTGTATGTTAATATGTTGCATTGTAATATCCAATCTAGTATAGTCTTTAACGAATAGGGGTGTAAAAAGAATGTT
+
DDDDDIIIHIIHHIIHIIIIHIIIIIHHIHHIGIIHIIIIHHHHIIIIIIIIIIIIIIGHIIHIIIIIIHIIIHHHHHIHIIIIHHHIIIIHHIIIIIIIIDHIHIIIIFIIIIGIIIIIIIIIIIIIIIIIGIHIIIIHIIIIIIIIHHIIIIIFFHIIHHIIIIIGIIHHHHIHHIHIIHIHIIIIIIIIGHEHIHHIIIGHIIIHHHHIIHHHFHHIIGHHIIHIIIEHHHHDEH,DA9B.GEHFHEH
@SRR14933407.19/1
CTCTACTTGCAATCGCATCACTGATGAAATTCAACAACTCTACTTTCTCTTGTTGTAACTTTTCATTGAAATGAATAATAACTGTTCTTTCATCGTCAACAGGACGCTCTTTGTAAATCCATTCTAATTCAACTAAATTATTATACGTTCTAGTACGCTTATACGGTTTAACTTCAACAAATCTGTCCATTTCTTTAAGCGTCATAGAACCTTTTTGCCATAAAGTTAGTAAAATTAAAATTTCTTCTCTA
+
DDDDDIIIIIIIIIIIIIIIIIHHHHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIIIIIIIIIIIIHIHHHIHIIHIIIIIHIIHIHIIIHIIIIHIIIIGIIIHIIIIIIIIIIHIIIHIIIIIIHIIIIIIIHHIIIIIIIIIIIIIIHHIIIIIIHEHIIIHHHHIIIIIIIIIDFHIIIIHHHHHIIGHHHHHIIIIIIGHHHHIHIHHIIIIHIIIGIIIIIHEH.
@SRR14933407.20/1
NCAATTTTATGCATTACCAACCCTCCCGATCATGACATTCTTATTCTTCTTTAAATATAGTATACAATGTCACATTTAATTTAAAAAGTTCATATCAAGAAAGTAAATTGGCTGTAATAAAATTTTAATATACGACTTCTTTCTTCACTTATTAAGGCGAAATTTTATCTCAAATCATGTGCGCTATTTCAAATTGAATAATGCCACTGTCTCAACATGTGTTGTTTGTGGAAACATATCTACCGGTGTTA
+
#<<DDHHIHIHIIIGHIIIIIIIIIIIGHIIHIHIIIIIIIIIIGGIIIGHHIHIIIIIHIIIIIIGHHHHIIIHIIIGHFHIIIHHGFHEHIIIIIIIIIF@HHHHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHGHIIIHEHCGHIHEHHHIIHIIIEHHHIHEHHHEHIIHIIIIHIIIHHFFHHIIHIIHHHCHIIICHHHHHH?HH@GHHEGHFHHFHIH?GHIIIHIG5?EH.
@SRR14933407.21/1
NAACTTTGATAAAGATGTATATAGAAATAACACTGGATCACTGATTGAAACATATCAAATATTTTTAAACAAATTGGAGGATTTAAAATAATGAAAACAATTGAACTACATATCACATTACAACCACAAGTATTAGATACGCAAGGACAAACGCTTACTCGAGCTGTACATGATTTAGGTTATGCACAAGTGAATGATATTCGTGAGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTATCT
+
#<<@DHIIGHIIIHHIHHHHGIIIHHHHHHHIHIIIIIIIIIIGIIIIIIIIIIIIIIHIHHIIIIIIIIIIHHHIIIIIHHIIFHHIIIIHIIIHIIIIIHHIIIIHIIIIGHHIIIIIIIHIIIIIIIGHIIIIIIIIIIIHHIIIIIGIIIHHIIIIHIIGHIHIIIIIHCEHHHEHHHIIIIIIIIIIHIIIIIIHHIGHHIIIIIIIHDEHIIHIHIIHIIIIHHHIEHIHE@GHDHDHHHEH7@.
@SRR14933407.22/1
TATCAGGATCAAAAGCACCTTCTTGAATAGCAGCTGCTAACCCATATGATTTAAATGTTGATCCAGGCTCATATGTGTTTTGATATAGATCATTTGCCCACTTTTTACCAAAATCTTTACCAGTTTCAGGATTAAATGTTGGTCGCTGACTGTATGCTAAAATTTCTCCAGTTTTGGCATCCATGACAACCGCAAATAAATCTTTCGGCTGGTATCTTTCAACCATGCCATCTAAAGCCTCTTCAACAAAT
+
ABDDDIE?DHHIIIHHIHEHHIHIHGIIFCHCHHIHEHGHHHIHIIHIIIIIIIIIHHE@FHEGHHGFGHFHHHHIIHHIHIHFHGIIHIIGHIIHFHHIIFHEHHIHIHFHHHIHIHECHCHHIIIIIIGHIIHIIHHC?HEHIIIIIIIIIGHIIHIIHIICCHHHIHGHIHIHHIHH0D0@@CF?HHGHIIDHFH<CCGFEHHI-C/-DFHCGFEGHEIHCCHEHHFG@.BFHEEEC..8BGHH?GH.
@SRR14933407.23/1
TCAATCAAAAAGGGGAAATTGCAGGGGTAGCACAACGTGAGTTTAAGCAATATTTTCCACAATCAGGTTGGGTTGAACATGATGCAAATGAAATTTGGACATCTGTGTTAGCTGTAATGACGGAAGTAATTAATGAAAATGATGTTAGAGCTGATCAAATTGCAGGTATCGGTATTACAAACCAACGTGAAACAACGGTTGTTTGGGACAAACATACTGGCCGCCCAATTTATCACGCAATTGTTTGGCAA
+
DDDDDIIIIIHIIIIIHIIIIGIIIIIIIIIIIIIIIIIIIHIIHGIIIIIIIIIIIIIIIIIIIIIF?GHIIGHHHIIIHIIIIIIIIHIIEHIIIIIIIFHHIIIIIIIIHIIIIHIIIIIIIIIIIIIIHIIIIIHIIIIIIIIIIIIIIIGIIHIIIIIIIIIIIHIIHHHIIIIIIIGIHIIHHIIHHIIIIIIIIIIIIIIIIIGIIIIGHIIHIIIIIIIEHIIIIGIHIII<HHIIIIIDHHH
@SRR14933407.24/1
NCAAATTTATCAGGTGCAAATGACGTAGTAGCACATGAGTTAACACATGGCGTGACACAAGAAACGGCGAATTTAGAGTATAAAGATCAATCTGGTGCGTTAAATGAAAGCTTTTCAGATGTTTTTGGATACTTTGTAGATGATGAGGATTTCTTGATGGGTGAAGATGTTTACACACCAGGAAAAGAGGGAGATGCTTTACGAAGCATGTCAAACCCAGAACAATTTGGTCAACCATCTCATATGAAAGA
+
#<DDDIIIIIHIIGIIIIIIIIIIIIIIIIII<<<DFHHGHGHHIIIIIIIIIHIIIIIIIIIIIIIIIIIIHHIGIHHGHIIIIIIIIIIIIIHIIIIIICGHHIHIIIIIIIIHIIIIIHHEHHIIHIIIIHGHIIIIIIIIIIIIHIIIIIIIHIHIIG=HHHIIHHHHIHIGHIHHIHEHHIIIHIGHIIIHIHGHHHEIIIIHHIHIGGHHGHHHHIIIIIIGIGHIHIHAEHGHIIIIFHHHICB
@SRR14933407.25/1
TAAATAACTGAAGCATGTTCGGTTTTAAAATGAGATTCAGCAATAATTTTCAATGTTTCTAATTTATTTCTTGCATCACCGTATGTGGTACTTTCTGATAGAACACCTTCTAAAATTTGTTGAACTCGATAATCTAAAAGTTTTAAGTCTTTATTGATGCATTGTTCGACACACTCTTCTTTGGTTAATGTGATTTGTTCCATAGTGTTCTCCTATTAAGATGTTTGTTTTTCTCCTAAAAACTTATTAAC
+
DDDDDIIHHIHIIIIHIIIIIICHCHGHIIIIIH@@HHIIIIIGIHIIHIHHHIHHIIIIGIHHIIGIHIIIIIHIIHIIFHIIIIIIIIIIIHIIIIIIIGHIHHIIIIIIIIIIIIIIIEGHEHIHEHIIIGHHGHHIIHHIIIGHIIIEHHIIGHEHIHHIIIHHHHHHI=1FHHHGIEEHHIIGFHH0ECHHHHGHEEHHEFHHH@?HHEEHHHFH@FHH/FHGHIIIEHHEHHID.7DHFHHGIE?
@SRR14933407.26/1
GAAATATTCGTCCAAATGTCGTTGTAGAAACAGATCGATTCGAATCAGCAGTTGGATTTGTTCATCTTGGCTTAGGTTACGCTATCATTCCGAGATTTTATTACCAATCATTTCACACGTCCAATTTAGAATATAAAAAAATTCGTCCAAACTTATGCCGAAAAATTTATATCAATTATCATAAAAAGCGTAAACACTCCGAACAAGTACATACATTCGTACAACAATGCCAGATCGGAAGAGCACACGTC
+
DDDDDHIIHHHHIIIHHIIIHIIIIHIFHHHHIIIIIHIIIIIHHGHHIHHIGHGEHGHIHIIIIIIGHHHIHIIHIIIIHIIHHIIGHHHHHGHIHIIIIDGHFHIHHHHHHHIIGIIIIIIIIIIIEHIIIIHHHHIIHIIIEGHHIIIIHIIHCEHHIHHHHEHHIHIIHIHIEHHHHIHEHHIHHIIIIHIIIIGHHIGIIHHHIIIGHEHGHGIIFEHHIGHIEGHHH@CHHHDHEEEHE@GHDH?
@SRR14933407.27/1
TAATTGTGTAGAAATATGAATTTCACTAAATGTTAATAACTTTGTGACGTTTTAGTTAACAGACTAATAAAAATTTGAAAATACTATATATAGTGGTATAACGTAATGAGTAGACACAATATATAGGAAGAAGGGGTAAAATGAATCAAATCGAAGAAGCATTAACGGGTTTGATTTCTAAAGATCCTGCTATTGTTAACGAAAATGCCAACAAAGATAGTGATACATTTTCAACAATGAGAGATTTAACA
+
DDDCDHHFHIHHIFHIIIIIIIIHHHHGGHHIIIIHHIFHIEHIEGHHHGHIEHHDHHHHGHHIHIHFHHIIIIEF@EHHHHHHIIIGHHICEHHHHHHIIGHHHHIIIIFHCHFHHHEHIHHHIGIIHHHHFGCECHEHHFHHEEHIIIGIIIGHHHIIHIIHHHICEHEHHC?@@HHHIHCGHHIH@HIIHH0DEGFH=EHH.GEHCEEHH/D@E/FHCGHEHHFHHEHIHIHEHHGE?GHHIHHGIIH
@SRR14933407.28/1
NACGTAGTATTGCGTTACAATTAGCAGAAGAAGGATATAATGTAGCAGTAAACTATGCAGGCAGCAAAGAGAAAGCTGAAGCAGTAGTCGAAGAAATCAAAGCTAAAGGTGTTGACAGTTTTGCGATTCAAGCAAATGTTGCCGATGCTGATGAAGTTAAAGCAATGATTAAAGAAGTAGTTAGCCAATTTGGTTCTTTAGATGTTTTAGTAAATAATGCAGGTATTACTCGCGATAATTTATTAATGCGT
+
#<DDDHHIIIIIIIHIIIIIIIHIIIHIIIIIIIIHIIIIIIHIIIIIIIIIIGHIIIHHIIIIHIHIIIIIIIIIHHIIHIII1FHHIIIIIIIIIIIIHFHHHHHIIHHIFIHIIIIIHIIGIIIIIIIIIIIHHIIIIIIIIIIIIIIIIIIIIGHIIIIIIIIIIIIIIIIIHCHHIEHIIIHIIGIIIHHIIIIIIICHIIIIIHGHHHHIEHHIIIIIIIHCHCHHI<?EHIIIFGHGHGEHIHB
@SRR14933407.29/1
TCCAATTACAACCTGGTGATATTATTATTACAAAAGGCCCTGTCATGTGGGGATTTTTTGGTCATTGTAGTATCGCGATTGATGATAAAACGATTTTGCAAATTGAGGGGCCTGGCGACAAACCAACTACACAATCCTTCGAATCTTTTAAATATAATTATGCAAGTGGCAAAAATGATTGGATGAAAGTTTATCGTTGCAGCTATCCTGGTGCAGGTAAAAAAGCGGCTGACTGGGTTAAGAAAAACTAT
+
ADDCBHIIIGIIIHHIHIGEHIIIHIIIIIHIIIIIIIHHIIIIIIIHHICH@EHIIICHIHHIHEHIIIIHHIIHICHIIHIHIHIGHIIIIFFHHIIIE1CGHCHHHIIIIIIIHIIIIIIIIIHHIIHHIEHIIIIIIIIIIIIGIIIHGGHIIIIIIEHHHIIIIII?HHIIIIIIHHHHIFGHHDGHIIIIIHCHHIIGHHGHFH?HGH?HHHHHIIIIHIHC<ECEHCGHGAGHEHGIIHEHEH.
@SRR14933407.30/1
ACAATTTTTTGTGTGAAGAGGAAGGAGTCTGTACCATGAATACGATAAGAAATAGTATTTGTTTAACGATAATAACTATGGTATTGTGTGGATTTTTATTCCCGTTAGCTATCACGTTAATTGGTCAAATATTTTTATCAACAAGCAAATGGTAGTTTAATTACGTATGATAATCGTATAGTTGGATCAAAATTGATTAGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTATCTCGTATGC
+
DDDCDIIIHIIEFGHIIHHHHHIHGHHHIIIIIHIIIHIIIIIHIEGHIIIIIIIIIGEHIIHIIIIIIHHIIIIIHGEHHHHIIIHEFHHHHIHHIIIIHHHIHHEEHIIIIIII1FHHIIIIHHIIHGIIHIHIIHHIHHIHHEHIIHIHIHIIIIIHHIHIIFIFHHIHHGHHIHIGHII0FCHEHEFGHEHEF@GHHIFHHIGHHHHHIIIIIIHHIGHCGHHH@@BFHHHEHIHICCEHEEHHH@F
@SRR14933407.31/1
TTTTCATCTTATACTATGTGATTTTCCGTGTAGTAATCCAAGTATTTAACTTGAATACGATTGGTAGAGGTGAAAATGAATTAGTTGATCCAACAGTTGTAAAAGATAATATTGCTCCTGGTGAAAATGATATTAAACAAAGTAAATATCATAATCATGCTATACAAATATTAGAAGGTTTAGGTGGTCAAGAGAATATTGTTAATTTAACCAATTGTGCAACAAGGTTACGTCTAGAGTTAAAAGACACA
+
DDDDDIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIEHHIHHIIIIHHIIIIIIIIIHIHIIHIIIIIIIIIIIGHIFHHIHHIIIHHIHIIIIIIIIIIHIIIIIIIIIIIGHHIHHGHIHHIIHHHHHHEEHFHHH?HHIHHHHHHHHIEHHHHIII?HHHGHHHHIHIGEHH?FIGI?GHHIHIHHFHHHHEHHIIIIIGHIE@CC@/D@FGHIHEHHHCFHH.
@SRR14933407.32/1
TGCATTAGAACATTTAGATAATCAGCAACTACACGAATATTTAGCTCAATTTGACGAGGCTTCTGCCAAAAATATTCATCCAAACAATCGACAGAGAGTATTGCGCGCTATCGAATACTATTTTAAAACAAAAAAACTTTTAAGTAATCGTAAGAAAGTGCAACAATTTACTGAAAATTATGATACATTATTAATAGGGATTGAAATGTCGCGTAAAACATTATATTCAAGAATAAATAAACGTGTTGATA
+
DDDDDIIIIIIIIIIIIIIIHIIIIIIIGIIIIIIHIIIIIIIIHIHIIIIHIIIHIIIIIIIIIIIIHHIIIIIIIIIIIIIIIHIHIIHIIIIIIIIIIHHIIGIIIIIIIIHHIIIIIIIIIIIIIIIIIIIIIHIIIIIIIIHIIIIIIIIIHIIIIIIIIIIIHIIIIIIIIIHIIIIIIGHHIIIIHIIIICHIIIIGIIHIEHIIIIIHHIHIHIIIIIIIIGGAGHIIIIIIIHHHHHIIHI.
@SRR14933407.33/1
CTGCTTTAGCAATGAGTATGATAAAGTTTGGCATGGATTCTTATGGACACTCACAATTACCGAGTGATGGCTTAAAACGTTTAAATCGTGTTGTTGAAAAGAATATTAATCAAAATATGTTCGTCACAATGTTTTATGGTTTATATGAAGAAATGAACCATTTATTATATTGTAGTTCAGCTGGTCATGAGCCTGGATATATTTATCGCGCTGAAAAAGAAGAGTTTGAAGAAATTTCAGTTAGAGGTAGA
+
DDDDDIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIHIIIIIIIIIIHIIIIIIHGHIIIIIIIIIIIIIIIIIIIHIIIIIIIIIIHIIIHIIIIIIIIIIHIHIIHHIHHIIIHIIIIIIIIIIIIIIIIIIIGIHIIIIIIIIIIIIGHIIIIIIIIGHHIHIHGHIIIIIIIIGHHIIIIIIIFHHEHIIHIHIHIIHHH?
@SRR14933407.34/1
TGCTATAATTTTGAATTAGAAGTAATTGCGAAAAATGAAAATAACGATGTCGTTGGACACGTTTTATTAATTGAAGTAGAAATGAATAGTGATGATAAGACGTATTATGGTTTGGCGATTGCTTCTTTATCAGTCAACCCTGAATTGCGTGGACAAAAATTAGGTCGTGGCTTGGTTCAAGCAGTAGAAGAGCGTGCAAAAGCGCAAGAGTATAGTACGGTTGTTGTAGACCATTGTTTTGACTACTTTGA
+
DDDDDIIIIIIIIIIIIHIIIIHIIIIIIIIGIIIIIIIIIIIIIIIIIIIEHIIIIHIIHH??EHIIIIIIIHHHHIIGIHIIIIIIIHICHHHIIEHHH<DHIIIIIIIIIHIIIEHGIHHIIIIGIIIIIIHHHGHHIIIIIIIHHIIIIHHFHIIIIIGGHIHIIIHHIIIHHHIIIIHIIIIIHIIIIHIEHIHIEHIIIIGHCHHHHHHHHHIIIDHIIIIGHHHHG?HHF.GH?EHHHCFHII.
@SRR14933407.35/1
TACACCACCAGGCATTAAAGTTTCAGCAACCTTCATTGCTTCTTCTGACTTCGTATATCTCATAAAGTAAATTCCTCCTATAGTTTATGGAAAATCATACATATAAAACCTTATTTATCTAAATAGCGACAAATGTCCTTTGCAAAATACGTAATAATCATATCAGCACCTGCACGTTTCATTGAAACCATTTGTTCCATAACGACACGTTCTTCATCAATCCAACCATTTTGTGCCGCTGCTTTAGTCAT
+
DDDDDIIIIIGIIIIIIIIHIIIHIIIIIIIIIIGIIIIIIIIIHIIHIIIIIIIIIIIIIIIIHHIIIIIIIIIIIIIIIIIIIIIIIIIIIHHIHIIIIFHIIHHIIIIIIIIIIIIIIIIIIHHHIIIIIGIHHIIIIHIHHIIIIIIIIIHIGHHFHHIIIHHIIIIIIIFHHIHIHIIIIIIIIIIIIIIIIIHIHFHHHHIHHHGEHHEHHIIIHCHHIIIIIHIIIHHIHII,6CHIHGCHIGE
@SRR14933407.36/1
GACTAAGAATAATTGTTCTTGAAGTCTTTTCTTTAAATGATGTTCATTATATGAAGCTTCTAACAAGTGATTAACTGTTGTCGCAGCGTATATATTTAAGTATGTATTAAACCAAGCTTTAGTTGCGACATCTCTAATTTGAACAACATCTTTTTCAGTTGCTTGTCTTACCTTGAACATGACTTTCTCCCCTTATTAACAAGTTTTAATAACGGCATTATACCACAACTTGCTCAATACTTAATAAACAA
+
BADDCIIIIIHIIIIIIIIIHIIIIIIHIIHIIIIIGICHHHHHHIIIIIIIIIIHIGEHIHIIIIIIIHIIIIIIIIIIIIHGIIIIIIIIIIGIIIIII<FHHIIIIIIIIIHIIIIIIIIIIIIIIIIIIIIHIIIIGIIIIIIIIIIIIIIIHHIIIGIIGIIIIIIIIIIIIHHIHIHHHHHIIFH=GHHIHIIIGGHFHHHHIIIIIIDHDHIHHII@HHHHHIHCEGHIIGHHIHIHHHHHIIH
@SRR14933407.37/1
AAAATTGGGAAGATAGTTGTACGACTATCTATGAATAATTCTCTGTTTTCAGGTGGGAAAACTGTTAATACATTGTCGATATCAATATTTGATTCTGTTTGTTTAACTTCCATTAATCGTTCTGTATATTCACTTGGAATATGTCTTTCTTCCAACATTTGAATAATTCTTTGACTTTTTAATAATTCATTTAGACTAGATCCTAAAATTTTACCTCGACGCGATACAATAAATACATTTGTTACAGTTAC
+
DDDA?FHDEGHGHHHHIIFGCCFHIHHIHEF@<CGHHHHIHHHHH<DE@FC?1<<<?HHHIHEEHCH11CG@HHECHHHICCHHEHHIHE@1FCH@GHHH@DFGEHEHC@HIIEH@1FEGF11DHCCHHIGCEHHH?<@GHHE<<11@GHIEECHHHHHHC@<1C1<<@CEHCHCH1<FHHEHHICEE?<F@CG0<<0<0<FE0DGHE@/CFHFHHGC/EHC<-,.79;CHE.BGE66AGAHHEE..9B79
@SRR14933407.38/1
CCTATGAAGAACTTGTAAGTTTAATTTTAAAATTAAAAGATATTATTTGCTAGTAGGAATTCTGTATAATTAGTATATGTTAACAGTTAAATAAGTTATAATAGAGGAGTGTAGCACTTTGCTATATTCCTTTTTTATTTTTAGGAAGGGTGGGCTTTTATGAAAGATTTGACAATGTTATTACACGAATTAAAAGATATGTCCTTTTTTAATAAAGGAGATATATGTTTAATTGGATGTTCAACATCGGA
+
DDDDDIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHHFIIFHIIIIIIHIHIIHIHIIIIIHIHIHIIIFHIIHIIIHIIIIIIIIHHIHHIIIIIIIIIGHIIIHIIIIIIIIIIHHIIICHIIIIIIIIIIIHIIIIIIIHHIIEFEDGHHHHIGHIGIIIIIIIIIIHHIGGEHIIIIIIHIIIIIIIIIIIIIIIIIIICEHH?EHIIIG@GEHHGEG/<?.DGC..GHEHH..DDGEHHFGHFDA
@SRR14933407.39/1
TTTAACAACGGATCAATCGCGTAATACCGACCATCAATTTATCAACACACAAATTGCGACACTCTTTTTAATCGATATCGTGAGCTATCATTTATTAGAAAATACGAATCTGAGTCAAACTTATCAACATACTAAATCTATTATCCTAAGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAAAACAAAAGCAACAACCGTAGATATAG
+
DDDDDIIHIIIGHIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIHIIIHIIIIFHEHHDHIIIIIGIHHIIHHHIIIIIIIIIIHHHIGIIIIIHIIHHIIIIIHIIHHGHIIIIGHIIGIHHIIIIIGEHHIIIIHIIHHIIIIIIIHHIIGIGHHHIIIIIIIIIIIHIIIIHHGIIIIGHEHH<DDHHHCGHIGHIIEHECBE?HHIGCEFEEGHGHEE?HH776.B.-.-----8885,5-.---..;
@SRR14933407.40/1
GTCATCTTCTTAGGTATCACAGTTTGTGCTTTGTCATTATTTTCGTCATTTTCAACAGTAGCACTAATTAAATCATGTATTTTTTTATTAATTGTTTTAGGTGTAATACCATGTTTTTCATTATGTTTCATCTGTATTTCTCGACGACGTTGTGTCTCATCAATTGCATACTTCATCGAATCAGTCATTTTATCGGCATACATAATGACTTCACCTTTATCGTTACGCGCAGCTCTACCTATTGTTTGAAT
+
DDDDDHCHIHGHHIIHHHIIIHIHHHHIHHEHHHIHIIIIIIIHDEHIIHIHHEEHHHGHHHFHHE@GHIII?CGHHHHHH@DHHGHHH<DHEHHHH?G?CGEHHIIIHHIIIHHHHHIIHHHIIEE?GEGHI?GFIIHIHHIGHHIHHEECHIHIGIHGHEHHHIIIHFGHHHIIHHHEHHEHHIHGGHHHHE?CEHHHGHIHGH?ECG?HH?HFEHHFE.CFCG@G--EHIIIIHIG.8GH.;BF.B.B
@SRR14933407.41/1
CCATCTAATTCAGTAATTACATCACCTTTTTTCAGACCAGATTGATCTGCTAAACCATTGTTGTCAACTTGATCTACAACAACACCGTTCTTAACTTTTCCTAGCAATTTAACTGCTTGTCTTTCAAAACTATTTAGACTGGCAATATTCTTCATTTTAACACCTACATCGGGATAGTCAATTTTACCTTTTGTTTCTAATTCTTTTACAATCTTTTGTACTTCATTAACAGGTATTGCAAATGACATATT
+
D?@DDHIIHIIHIHFHHHIHHHHIIIIIHHHIEF@GEHII?GGFFHHGGHIEH?HHIEEHIHIIIHIIIIIHIIIIIIIIHEHIIHIHHHIHIIIHH?HHHGHHHHIIHHIECHHIHEHIHIIIGIICHHHHHIGHIHHHIGHCFHGHHHIIIIGHHIIIG?GHFFHGEHGDC/CGHDGHFEHHHHIIHHHEHHIGIIGIHHEHIHHIFHHEHIHHHFEHIIIHIEGFHHHHHHGF?.DC:FG..8.AGGH
@SRR14933407.42/1
ATTACCTCATATACTGGCCACATAAAAGGTTCTGCCTCCATGTATCGAGTACCAAACTCTAAGAAACCACTATAAGCTGCATGCGATGTGATAGTGTATTGCAAATCGCCAGTTTTTTTATATCTGATATTGCGTGATAAATTACCAGTCCAATAACCCTTATTCATTACTTCTCTAGCTTTCAATTTAGCTCGTACTACATATTCTTTGGCGTTTTCCTGTAAAATATCATCTACATCATCATCAATGTT
+
0DDBBGFEGHHHE1FH?H?<CHEHH?GGGHC@G<F@11<1<<111DGCGHEC@11<<<CF1CEH?CCHHH1F@C<DEFFG?CHE<0<<DFHIH@1<DF1<F<<1<<10<D@E@1F1</<FHCHFFCCFHHHGH=<0<<FDGEE<<11D<GCF@C@CH1<1<F1<0000D0D@<DF0<FG<0</C<C/<CG///<</<F//FCEHC</</D.CE.:-.CF?@HE....B97B.:@8.8B@?.8..;A.6.B9
@SRR14933407.43/1
CCCGAGCAATGCACCTCTTAAACAACATTATACACGAAAGGAGCATAAACAAATGAACACACTATAAAAAACAACCTTCCTCATCACAATGGCAGTTGCGACTTGGAAGGTTTGGAAGATTGAGAAAAACACAAGATTTAAACTTAGAAATTTTGATTATCCAAAAATTAATAATGCTCAGAGCAAATCATTGTTGGATATTGCTAGTAACGATTTAAAAGATATTTAACTGTATTCAAAATTTTCATATC
+
DAADDHCHHHIFHFHHGIIIHCHIIIIGIIGIHHHHCC0/DCGECHIIHHIHCHH?C@CGHH1<CG1DGE/CC@H=FHC@E1DC1DHH1FCHFI@1<<1/<<<EC<1<C@<C<EEHH@E?GHHFE@GHE=C<GHH@HHGECHHHHEG<CEHHH0CGHHHH?FHFDF<GEHHHIHFHFEH/G/GH?HHHHHFGH@C?C..:.CFEEHHH.:A987.;AE@H.-:BAHFEHHHHIHIIGHFHEA.AB.FH?7.
@SRR14933407.44/1
AGATCATTTATGGCGTGTAACAAGTTTAAATCAAGCCATTGCATTAGGTATCGATGATAGAAAAGGTAGTATTAAAGTAAATAAGGATGCAGATCTTGTTATTCTAGATGATGACATGAATGTAAAATCTACAATAAAACAAGGTAAGGTTCACACATTTAGCTAATAGATAATCATAATTAAATGTATGCAATAGATTTAATCTGTTAACATAAGCACTTTATATTATGATAAAATAGAGGCAATAACAT
+
@B@DDIFHHHHGIHHC?CHEHIHC1GHHHHGFHH??CHEC@EG11<FHFF1CGC1DGHHHHIEHFIHHIHIIEHHEH?GHHCHHF?GHIIIHHIHGHHIIIIHIIHHIIHIIIIIIHHHIHHHHHHHIFHHHHHHIHHHIIIIIHGIIIH@GGHIIIGHHIIGHHHFH?GEHIIHHHIGIHHHHEEEHE@GC1CHHF?GHIGHFHHHHHEHHIIIFHHEHH@HGHIFH?CEHIIGHH?CCH/<:FHHHIG@
@SRR14933407.45/1
AGCTCTAACTACTGCATCTGAAACCAGGTCAACCGCTTTATCTTGACCTACAACACGTTTATGCAAGATATCACTTAAGTGAAGTAATTTTTCACGTTCTGTTTCAACTAATTTTGAAACTGGTATGCCTGTCCATTGGCTGACAATATCACCAATTTCTTCGTCTGTTACAACTTCACGAATCATTCGATCTGAATCTTCACCTTGCTCATCTTGGAAATTATCCTCTAATTCTCTAAGCTCTTTTTCCA
+
?DADDHCCHIHHHIIIHIGIIGIHIIHIHEEHHIIH?=GHHHHIHFHIGHHHEHHHHHHHHIGIHHIIIHIIHIHIHIIGHHHIIIHHIIIIIIIIHIIIIHIIIIIIIIIIIGHHIIIIIIIIIIIIIIHIIIIIIIIIIIIIHIIIIIIIIIHIHIIIIIIIIIIIIIIHIHHIIHIHIIIIIIHIIIGICHHHFHFHIIIIIIIGHHIGHICEHC@GHHIGHHIHIEHFHHHHIGIIIHHIIHIIHH.
@SRR14933407.46/1
ATAAGAATTCAACTGATTTATCAGTTGTTGAAGCACCTGCTTTTAACATATCTAAGCCGTACTTTTGAATATCAACTTTAGCAATATCTTTTAATTCTTCAGCTGCTTTAACATCTTGTTGTGTACATGTTGGTGATTTGAAAAGTAAGCTATCTGAAATAATTGCTGATAACATTAAACCAGCAATTTCAGGTTTAATTTCAAAGCCACGTTCTCTAAACATTTTGTATAAAATTGTAGCTGTACAACCA
+
DDDDDIIIIIIIIIIIIIFHIHHHHHIFHGIIIIHIIIGHEHIIIIIIIIIIIIIFIHDHIIIIIIIIIIIEHHHIIGFHHIIHHIGHHIIIIHIIGIIHHGHHIIHIIIIIHIIIIHHIHHEHHIIIGHHHFH?HEHHHHIGIIHHHIIGEHIHIIIHIHDGHHHIIIIHICHHIIIHHIIIHHHIFGHG?GHIIHEHFHHEECHH@GH.DEHIIIHHHHIFC</DGG@HHHFHHIHHCHGHEHHHHHHG
@SRR14933407.47/1
ATGACCGCAACGATACCTAATCCCGCTTCCATACTCTCACGAAGCATACCGCTATTCCCTAGTGTTTCTACGAATGTAATTGCTAAAACGATACTCAATACAAGTCCGGCAATTGCACCACCAATCACACTTGCAGTACCCTTCTTATCTTTGACATTACGCGTCATAGTAGTCAATGTCATTACAATTAATAACACTTCTAGTCCTTCACGTAAAAAGATAATCATCACATCGACGAAGCTATAACTATG
+
D@@BBHDHHIIIIIHHHEIHIIIEHHCCGFHHIIHIHHIEHEEHIHIIIHHHDHEHGHHIFGHHHHEEHHIEHIIHIEHGH?HFHHIHIIGHHD?FGHIIICH@FHIIHIHHIIIIIIEHGHHHIIIHGIII1CGHHHI?HHHIHGFHH<FH0FHCHHHGHHIIIHIIIHICGHIGH?8/=C@CC8.:;?FCEHICGHCHHBC.;AFG?.BABH?AAB@G6GHFFECHHHHHI-BD,EC@EHCF?AHGHF?
@SRR14933407.48/1
ATTAATATGCGTAAAATATTTAGTACAAAATTTACTCACTATTTTACCTTGAAACCTATCTGACTTGGTAATAAATTTTACTTGTTCCTTATTAGTAACGATTGTCATTGATTTTATTGATGGATGCTTAAAAAATGTAAATAAATCATATTCTGAAAATCCTGATTGGCCAGGATAGTTATGTAACATAACAATTGAATTCGGTTTACTGTCAAATAACAATTCGGTTGCTTGTTACCCTGGCACAAAAG
+
0DDD?HHIIFIHHDHCHGEH@FHFHEIIIIC@1CGEGGHHHGE<FHIIGIHHGHIIEG@1CE@G<CCHGHHHI11<GFEC@GECC?<<F1DCEGEFHHHHCHCC<1CFHHHE@@?FHC?FHEFH@DCH@<CGHFCC@EH11F?HHHHIIHEHHHC@HHHIHHCFHFHFGEGHE@@GH1D1D?1CECCF?E?EH0FG0FG0G?0CC=EFEC@DGHGH/<</<FG?G//GH?GHE:DC.FH?GH.8FG?.@B.
@SRR14933407.49/1
GTTATCTGATGATTTGTGTTGAATTATTTGTGTATATATTAACCGATTAACGTCTTGTTCTTCTTGAACGTCTTTCTTCTCTATTTCTAATGAAGCTAGGAATATCATCTTCTTTAGTTGTATGTGTTCTTTCACTTACACTATCAGTTGCTTGTGCATTTGATGAATTTGAAGTGAATGATTCATCTTTAGAAGTTGCATTGCTAGAAGTATTTACGCTTGTTCCGAATCCAGTGCTACCAGATTTACGA
+
DDDDAHIIHHHIHHIIHIIHHIIIIIIIGIHH<CHHIIHHHIIIHHHFIHEHHHIIH?GHIHGIG@HHIHIHHIIHHIIEHHHHIIIIIHIHIHHHHEHHG<FHHIH?GCEHIIIHHGHHHHIEHEFEEIFIIIIIIEGEEHHHHHIIFHIHIEHHHHHF11<D@CCHIH@C?EHHHHHHEHIGHGCGHHHDHH?GHGHHHHHEHHHII@HGFGHIIIGHE?<CHECCEHHGHHHHHIIECGHGHIIHHHE
@SRR14933407.50/1
TCTTAATATAAACATCTATTTCAGTGTTAATTTACAACTATCTACTGAAAATTATGTCATCATAATAAATTATGCAATAAGTATTTGATACCTTTACATTGCTATGTATCTCTAAAAATTGCTCCCACACATTTATTTTATACCCAAATATTTAACTATTTATAATTATAGTAAAAAACATTTTAATAGTATATACTTTTCTCATATTTTTTTGATATAAGTTTTAAAAAATAAGTCGATGTACATCAACT
+
DADDDIIIGHHIIIIIIIIIIIIIIIHIIHIIIIIHIIHIIIIHIIIGGHIIIIHHIIGHHIIIIGEHHIIHHGHHIIIHIIHHHIFHIHHIIFFHHIIHIHHFHCHHIIIGHIIIGIHHHHIIIIIIIIHIIIF1DGHIIIIHHHGIIHHIIHIIHIIHIIHEFHIIIHCHFG@HIIIIIIIIIIIHEHIIIIEEHIIIIIIIIEHHIHEHGI///GCHEHHIHGHHEHHHIHEHCC.CFGHIHHHFEHH
@SRR14933407.51/1
GAAAGACGTTTCAGATTATGAAGACATTTCGGATTTATACTTAATCAGCGATGCGTTAGTTACCGACTACTCATCTGTCATGTTCGACTTCGGTGTATTAAAGCGTCCGCAAATTTTCTATGCGTATGACTTAGATAAATATGGCGATGAGCTTAGAGGTTTTTACATGGATTATAAAAAAGAGTTGCCAGGTCCAATTGTTCAAAATCAAACAGCACTCATTGATGCATTAAAACAAATCGATGAGACTG
+
DDDDDIIIHIIIIIHIHIIIIIIIIIIIIIIIEHIHIIHIHIIIIIHHIIIIIIIFFHIIIHIIIIIIIIIIIHHIIIHIIIIIIIIIIHIIIIIIHIIHIFHIIHIHIIIIHIIIIIIIIIIIIIHIHIIIHIIIIIIIHIIIIIEHHFHIFIIIIIIIIIIHIIIHHHGHHIIIIIIIIHIIIIIIIIIIIIIGHHIIIIIIIIGHHIIIIIIIIIIHIIIHIIIIHHIIIEHHIHHFHIIIIIHIIIC
@SRR14933407.52/1
TATTTCATGGTATGGCTGAACATATGGAACGTTACGATAAATTAGCACATGCACTTTCTAAGCATGGCTTCGATGTGATACGTCATAATCATCGAGGACATGGTATTAATATTGATGAATCAACAAGAGGGCATTACGATGATATGAAACGAGTTATCGGTGATGCCTTTGAAGTAGCGCAAACAGTGAGAGGCAATGTTGATAAACCATACATTATAATCGGACATTCAATGGGATCCGTTATAGCTAGA
+
DCBDDIIIIIIIIIIIHIIIIIIHIIIIIIIHIGGIHIHIIIIIHEHIIIIIIIIIIIIIIHIIIIIHIIIHGIIIIIIIIGG1CGHHECHFHIIIIII?F<GHG?EHIGIHHIGHIIIIGFHIII@HIIHHGHHHHIHHHCHIIIHIIIHEHEGIIIIHHIHIHIIHGCHFHIHIIIIHI<EHHHIIIEFHEDHHIIIIIGEHHHHHEEHHHHGHHHHHHHIIIHHFHIGIIIIHHHHIIEHIEEIHC?.
@SRR14933407.53/1
AAGGATTAAGAAGTTTGAATGGTGGTCGCATGGCAAGATTTGGACGTACACCATTACTTGATGCAATGGAGATGGCTAATGAGCATATTATGGTGATTGCCATGATAGAAGATGTTGAAGGGGTTATGGCCATTGACGATATAGCACAAGTCGAAGGTTTAGACATGATAGTCGAAGGTGCCGCAGATTTATCCCAGTCACTTGGCATACCATGGCAAACGCGTGATGATCAAGTAACATCACATGTTCAA
+
DDDDD@FHIIIIHHIIIIIIIIIIIIIIIIIHIIIIIIIIIIIHIIIIGIIGIIIGHIIIIIIIIIIIIIIIIIHIIIHIIIIIIIIIIHHIIIIIIIIIIHIIIHHIIIIHHHIIIIIIIIIIIIIHHIIIGHIIIIIIIHIHIIIIIIIIIHIIIHIGIIIIIIIIIIIIIIIIIHHHHIIIIIHHHIIIHHIIIIIIIIIIIIIIIIIIHIIIIIHIIIHIIHIIGIIIHIGHHIIIHHHIEHHHHIH
@SRR14933407.54/1
ATTTGAAGAAGTGCCATCTTAGATTGGATTTCTTTCATTTTTAATAACCTATCTATTGAAGCGCTATTGGAATCAGAATTATCATATTTAACTAATAGATTTAATAGTTTCACTTGAGTCAAAGTGAGATGAAATGAATATTTCAACATATCATTAATGACTTTCAACATTAGTATGTGTGCAGTTAAACGCTTAACTTTAAATTTAGACATCTTTAAAACCTCTCTTAAACCATGCCTATATCTCAAGAT
+
DDDDDGHIIIIIHIIIIIIGIIHIIIHHHIIIIIIIHIIIHIIIGIIIIIIHIIIIIHGHIHIHIIGHHIIIGIIHIIIIIIHIIHHIIIIIIIGIIIIIIFHIHIIIIIIIIIIGHHHHIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIIIIIIIHIHIIIIIIIIIIIIIHHICHIHIIHIIHIIIIIIIGIIIIHHIIIIIIIIHIIIIHIHFGHHHIDHHIIIIIHGGHHIHIGIHIIHIIGIIII.
@SRR14933407.55/1
TGATCGGCCACACTGGAACTGAGACACGGTCCAGACTCCTACGGGAGGCAGCAGTAGGGAATCTTCCGCAATGGGCGAAAGCCTGACGGAGCAACGCCGCGTGAGTGATGAAGGTCTTCGGATCGTAAAACTCTGTTATTAGGGAAGAACATATGTGTAAGTAACTGTGCACATCTTGACGGTACCTAATCAGAAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAAGCGTTAT
+
DDDDDHHIIIIHIIIIIIHIIIIIIIIIIIIIHEHHIGIIIIGIIHHHICHIIIIHIIIIIIIIHIIHIDHIIHIIIIIICHIIGIHDGFHHFHIIIIHGI?FHHFHHHIIIGHHHGHIIHHEDHIIIHIIHIHIIIIIIIIGHIIIIIIHGEFGHHIIIHDGHIHHIHHII.FGHGIHIGDIEHHIHICHIIIHHHHC@FHIIIDEE.BBHHEHHHHHHGHHIIIHHC.-9BGB?GHCHHIIHEEEB.EG
@SRR14933407.56/1
TTAGTTTCAGGCGGGAATGTTGACTTAACTAGAGTTTCAGGTGTCATTGAACATGGACTGAATATTGCAGATACAAGCAAGGGTGTGGTAGGTTAAAACATTTAATCTGAAAAATGAGGTGTAATTATGTCAAATGGTAAAGAATTACAAAAAAATATAGGTTTCTTCTCAGCGTTTGCTATTGTTATGGGGACAGTTATTGGTTCAGGAGTATTCTTTAAAATATCAAACGTAACAGAAGTAACAGGAAC
+
DDDDDHHIIIIHIIIIIIIIIIHIIIIGIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIIIHHIIIIIGIIIIIIIIIIIHHIIIIIIIHIIIIIIIIDGHIIIIIIIIIHIIIIIHHHIIIIHIGHIIIHIIIIIIIIIIIIIIIIIIIIIIIIIHIHHIIIIIIIIHIIIIHIIHIIHGHIIIIII=CHHIFHIIIIIIHIIIIHIIHHIIIIHHIIIIICGHHIHIIIIIIIIIIIIHHHIIII@
@SRR14933407.57/1
ATATGAAAATATTTATTATTATTTTACTACAACTCGCTTCAATTTACTTAAAATAGACAATATTAATTAGATAGTACACACATTTCTTCATAAAAGTGATTTTTCAAAAATATAAATAACACACTCTTATCTTTTTCAAAATCATTTAATGCTATTTTCATTAAAATCAGCTGAAGCACCAAATCTATTCTGATTCAATCAAGAATACAAGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATT
+
DDDDDIIIHIIIIIIHHIIIHIIIHHIHIIIHIIGIIIIIIIIHHIIIHCFHGGHIIIIHIIHH?FHIIHHGHIHIIIIIIIIIIIIIHIIIHIHHHHHGHGHHIIHHIIIIIHIIIHHIIIIIIIIHIIIHHIIIIIIIIIIIIIIIIIIHIHHIGHIGHIHIIIIIHIIIIIIGGHEGHHIIHIIGHIHHEIIIIFFHIIIIIIIIHGHHIIHGHHHIIIIIIIIIIIIIHFHIHHE7FHIGHDEHGHB
@SRR14933407.58/1
AGTGCATGATGTAATCCTGTAGGAATTAATAATCTGTTGGCAACACCATATACGAAAGCCCCAAATGATCCTAAACCAACTATAGATTCACCAAATTTTACAATCCATGAATAAAGTAGTGGCCATAAGAATAACAATATGACAACTAAAAATGTACAGTAAAATGCAGTCATAATTGGAACTAAACGTTTACCACTAAAAAATGATAATGCTAATGGTAATTCTGTTTCACTAAACTTATTGTATGCATA
+
DDDDDHIIHIIIIIHIIIIIIIIIIIHIIIIIIIIIIIIIIHCDGHHIIHGIIIIIIIIHHIIIHIIIIIIIIGIIHIIFHHIIIHIIIHHIIIIIIIIIGHHGHIHIIIGHHHHIIIIIIFGHHGIIIIIIIIIIGIIIIHHIIGIHIHIIIHHIIGHHIIHEHHIIIHHIIIIIIIIIIIIGIIIIHIHIHIIIIIIIHIHHHIIIIIHIIFHIHEHHHIEHHHH?HIHIIHHHIEHGHHHIIIIIIIG
@SRR14933407.59/1
ACAACATGGTATTGATGATGAAAATGCAACAAAACAAACTCAAAAATATCGTGATGCAGAGCAAAGTAAGAAAACTGCTTATGATCAAGCTGTAGCTGCTGCGAAAGCAATTTTAAATAAACAAACTGGATCAAATTCAGATAAAGCAGCAGTTGACCGTGCATTACAACAAGTAACAAGTACGAAAGATGCATTGAATGGGGATGCTAAACTGGCAGAAGCGAAAGCGGCAGCTAAACAAAACTTAGGCA
+
DDDDDIIIIIIIIIIIIIIIIIIIIIIIIIGIIIIIIIHIIIIIIIIIIIIHIIIIIIIHIIIIIIIIIIIIIIIIHIIIIIIGIIIIIHIHIIIIIIIIIGHDIIIIIIIHHIIIIIIIIIIIIIIIIIGHIIIIHIGIIIIIIIIIIIHIIIIHIIIIIIIIIHIIIIIIIHIIIIIIIIIFCHIIIIIIIIGHEHEHHIIIHGHHIIIIIIIHHHIIIIHIH?EHI?<HHHHHIIIIIIIIIIHIHI.
@SRR14933407.60/1
AGTGATATTTCTATAGCACCAACAACGTACACGTTAGATGACATTGAACGTGAGCTTGGTCATGATATTAGTTTTAGAGAAAGAAATGGTAAGATAGTCCGCCTATATTTTAATGGGAAATCATCATTTGTAACATGTTATCTGCAAAATGAACAAAAAGACATTGTGGATCGTGCAGAATTTGTAATTAATAGTATGTTAGTACGTAAAGATTTTTATAGTTATACACGCATATTTTCTGAGTATTATGC
+
DDDDDIIIGHIHIIIIIIIIIHIIIIIHHGIIHIIIIHIIHHHHIIIIHIIIHIGIIIIHIIIGHIIGIIHGHIIHIGIIHIHHIHIIIIIIIIIIIIIIIFHHIIIIIIIHIHIHIGIIIIIHIIIHIIIIIIIIIIIIIHIIIIIIIIIIIIIIHHIIGIIIIIIIIIIIIIIIHIIIGIHIIIIHIHHHHHHHCHHIIG<FHHIHIHHIICHCHEIIHIIIIIIIICHHHHHIIEHIIIIIFHHIHHC
@SRR14933407.61/1
GTTTCGCCTTGTTTAAATAGCGTGCCTGAAGGTAACAATTCGATTTCAACAGGTACAATCTCATCTTGTGACAACTTTAATTCTGTTTCATGTTTATGCCAAGGTTGCGCGATTGTGGATTTTTCTTGATCTAATTCACGATGTGATACGCGTAACCAACCAGTAGCTACTTGACCATTTTCAATATGATTAAAATCAGGGAAGTTAACTTCATTACCACGACGATCTAACTTTTTAATACCTGCAAATAA
+
DDDDAHHHHHHEHHIIHIHHHIHHIIHHHIIIHIIIIIHHHGIIIEHHIIIIIIIHIIIIIIIHIIIHIIHFHHIIHHHHHIIHIHIFHHIIIIHIIEH1C<GHFHHHIIGHIGGCHHIIIIIIIIHHGHIIIIIIIIIIHIHIHFHHIIIIHIIIHHIGH=CGH?HHHHIHEHHHIIIHCHHHIIIEHHIIIFFHFHEFHHIIHGHIIFFHHHHHHHHHAEHDHHHHHEHFEHGGEHHIIHI.FHGB@BE
@SRR14933407.62/1
CAATTTAAGACAACTCAAATAGCATTTAGCTAACAAGTCCTGATTCAACTTTGAAACAGGCAATCGACGTATTTGCGGACCAACCGTATGAATTATATACTTTGCTGGCAAATTATATCCACGTGTTATTTTGGCTTTACCTACACCTTCATTGCGCCCTTGTTGTCGAATAATCTCTGCACAATCAAGTCGAACTTGAACACCCGCTTTTGTATGAATAATATTATCAATGCAGTCATGATTAGCTTGCA
+
DDDDCIIIIIIIIIIIIIIIIIHIIIHIIIIIIIIHIIIIIIGHIIIIIIHIIIHHHIHCHHIIIIIIIIHHIIHIIIHIIIIIIIHHHIIIIIIIIGHIIDFHHIHHGIIGIIGHIIHIIIIIIIIIIIIIHHHGHHIHHIIIIIIIIHIHHIIIIIHHHIIICHHGEHHHHIIIII/@HHIIIIIIHFHIIHHIIIICHIIGHHDCGHICHHGECHHHHHHIG7GGHF@EHIIHHBGHEHHGGEGF@G.
@SRR14933407.63/1
AGAAACAATTGATCATCCAGAAGATTTTGTTCCCTATCAAGATACATCTGTAACACAGCAATTAGAGGATTATCGCTCGAACAATAAATATGTTACTAGCTTTCTATTAATGCCAACAGTTATAGTAGTTAATTCAGATTTACAAGGAGATATTAAGATTCAAGGTTATCAAGATTTATTACAACCTATACTTAAAGGTAAAATTGCGTACTCAAATCCAAAGATCGGAAGAGCACACGTCTGAACTCCAG
+
DDDDDIIIIIIIIIIIIIIIHIIIIIIIIIIIIIHIIHIIIIIIIIIIIGIIIIIIIIIHIIIIIIIIIIIIIHIIIIIIIIIIIIIHIIIIIIIIIIIIIGHIIIIIIIIIIIIIIIIHHIGIIIIHHIIIIIIIIIIIIHIIGHHIIIIIIIIIIHHIHIIIIHIIIIIHIHIIIIIHIIHIIIIIIIIIHIIGIIHHEGFHIHIIIGHIIIIIIGGEHHFHIIHIIHDHIGIIIIHGHHHHBFGIIHF
@SRR14933407.64/1
GCTATATATGTTTCAAGAATATGAAAAAGGCGTGTCGTTTCAATACTTAATCGATACATTACAATTAGATATTAATGTCAATACTTTGTATCCTAAGTATAAATATTATAAAACCCATGGTATTGAAACACTACTTACACAAAAACAATACAATCACTATTCAAAGGAACTAAAATTAAAAGTTGTAAATGAATATTTGAATTCAAATCAATCAACCCAAGACATCGCAATCAAATACAATATTCGTAGTT
+
DDDDDIIIIIHIIHIIIIIIIIIIIIIIIIIIIIIIIHIHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIIIIIIIIIIHIIIIIIHIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIIIHIIIIIIIIIIGHGHIHHHHIIHHIIIIIHIIIIIIIIIIIIIIHIIIIHIIIHIIIIIIIIIGIIIIIIIIHHHHIIIHHHIIIIIIIIGHHHIIHH
@SRR14933407.65/1
GTTAGCATTGCGATTAAACATATTCAGGATTCTGTGCCAAATGTGACAACCGATGTACGTAAGGATATTCCGCAATCTTTAAGTAATGTCATTTTACGCGCTACAGAAAAAGACAAAGCGAATCGTTACAAAACAATTCAAGAAATGAAAGATGATTTGAGTAGTGTTTTACATGAAAATCGAGCGAATGAAGATGTCTATGAACTCGATAAAATGAAAACGATAGCGGTACCATTGAAAAAAGAAGATCT
+
DDADACHHFHGHHHIHHHEHHHIIHIIIHIIIHHHIIGIIIIGIIHIIIGIIDIIHIIHHIHIIIIIIE?FEHHHIIIHIIEHGHHIIHIHHHIHHEECHIEHIIHIHHIICGHIIIEHIIIHEHEEHEGHEHHHHIHEHIIGHIIEHIHIHIHHHHH?FEHGDHIIIIIIIIGHHHF/CGGIGHHIIIIIGIIIIIEHIIHEAHGHIIIICHHHIIIIHHHFIIIIHHHCEEHHIEHIIIIIIIGIHHIA
@SRR14933407.66/1
GGCAGCGCAGAAATAATATTTTTATTTCCTAGAGACATTTTAAGATTGTTAAATAGAATCATTAGTGAATCTTATTTTAAAGATAGTAATGGATTAATCTAAATAAGTGCGGATAATATTAACATAACAACATAATTAAAAGACATAAATGACAATAAAAGGAGTATAGAAATGACTCAAACTGTAAATGTAATAGGTGCTGGTCTTGCCGGTTCAGAAGCGGCAGATCGGAAGAGCACACGTCTGAACTC
+
DDDDDIIIIIIIIIIIIIIIIIIIIHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIHHIIIIIIIIGHHIIIIIIHIIIIIIIIIIIIHIIIIIIIIIHIIIIIIIIIIIIIIHHIIIIIIIIIIIIIIHIHIGIIIHHHIIIIIHHHIIIIIHIIIIIIIHHIHHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIIIIIIIIHHGH
@SRR14933407.67/1
TAAGTCGTTTTTAAGTGAGTATATTTCAAGTTAATTAGATCTATAAAAAATTTAGGATGGATTAAATGTATATTAAGGATGTTTAGGATAATTGGATTTAAATTTCTAATAAGTCATGATTATCTATATTTTGAGAGTCAATATCTAAAATACTATCTGAGAGAATTTCATTATTTTGGGATTTAGGGGACATAAATTTTATTGTTTCAACATATAATTCAGTGACAAAGTGTGTTTGTCCGTCTTTATCA
+
0B?DDGHHHE1CG?FEHEG1DGH?CCHH?CFGHHIHIEF1FEHFHHH@HCGHE@<@C1DEEHCEFHHCGHIIGHGHHH1<FHFH@CC@HI?<1FHIHFHHHGFHGEHIHGIIICGGHEECDGHHIHIIEGHIIIGIIC1GHEHHHIHHHIHGFHHIIII1GHICHHHHGHHFH?CGEFCHH1FGHGEH.<CEE?H@CHHIIIHHF?@@C?GEEHHIIGEHIHHHHH@EH?/DFFC@FCFE7ECHEHHHII.
@SRR14933407.68/1
CTTCCTTAGTGACTTCATAGCGTTCTTTTAATGTTTGAATATTTTGTTCGATTTCTTTATCGAATTGTTTATTATTTTTTAATACATGCTTAACAGCGCTTTGTGTTGGAAGCGGTCCACTAGAAATGTTGCTTCGTATAAGACCTTTTACTTTGGCTTCTAAAAGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAAAAAACACACT
+
DDDDDIIIIIIIIIIHHIHIIIHIHIIIIIHIHHHIIHIIGGHIIIIIIIGIIIIIFHHIIIHHIIIIIHHIIIIIIIIIHIIIIIIIIIIIGIHIHHIIIDGHIIIEHHIHHHIIIIIIIIIIIIIHIIIIIIIIIIIIIIIIHIIIIIIIIIIGHHIIHGIIIHIIIIIIIIHHEHIIHIIHHCGHIIIIIIIIIHECHIIHHIE;.BHHIIIHIIIIIEEH.FEH.96BGHHHDEHD6,,,---8...
@SRR14933407.69/1
AGCAAGGTTTTACAATCTTACGCTCTTATGAGCCAGAAATGTATGGCGAAAAGTTAGACTTAGCACTTATGTACAAAGCATTTTAAGTAAATTAAATATTATATTAAACGAAAAGGAGGGGAATAATGACTAACGCATATGTAGATTTAAAATTAGTAGAAGAAAAAGTTTTTAAAGACCCGATACATCGATATATTCATGTTGAAGATCAATTGATATGGGATTTAATTAAAACTAAGGAATTTCAAAGG
+
DDDDDIIHIIIIIIIIHIIIIIIIIIGIHHIIIIHHEHHHIHIIIIIIIIIIIIIIIIIGIIIIIIIIIIIHIIIIIHIIIIIIIHIHIIIIIIIIIIIHIGHIIIIIIIHIIIGHHIIIIEGHIHHIIIIIIIIIIHHHIIIIIIHHIHHHHHIHHHIIGHHGIDGDHHIIIIIHEGHIIIDEHHIGHIHHHHIIIHICHHIGIHEHHHHHHFHHHGHIGHIIHEHHHHIEHHIIIIIHIGHEHHHIIIG
@SRR14933407.70/1
AACATAGTCACCCATTGATATTTCATCTGTGACAAGATGTGTTCGGATACGTTCATCGTAACCTTCATAATGTCCGCATATGAAAACGATGTGGTCAGCCTTGCTTAATTCAACAGCTTTCTGATGTGAAAATGGCTCGCCTTGTGGACACATTAAAATAACGCGTGCTTGTTCTGTGACATCTAAGTCTTCCATCGCATTAAAAACAGGTTCAGGCTTTAACACCATACCTTGTCCGCCACCATACGGAT
+
DDDBDIIIIIIHGIIIHHGHHHIIIIIIIHHGIIHHGHHIIIGHIGIHIIIIIHHHIIIIIIIIIHGIIIIIHIHHIIIIIHHIIHIIIIIEHIHIGHEHIHHHIIIIHIIIIIHHIHHIIIIGIIHHIHIGHHHIIIHHIHGHIEHHHIIGHCHHHIIIIIIICHDEEIGCEHIFHIGHIHGCHEHHHIIIIIHHHHHHCEHHHDEFHHEHFHHCHFEHIIIIIHIHIGHHIGECFGDHH=@.9BF.E.7
@SRR14933407.71/1
CGGCGATATTAATAATTATTATACCCTAACTTTCAATATATCAAACCATTTAACTTTAACATGCTTATACTCTAAATATAGCACTTAAGCATCATTTTTATAATGAAAATGAGTAAATTTTAATTCAATCCTGGAAAATCTTGTTGACGTAACGCTTCATAAATTAATAACGCAGCAGTATTTGATAAATTTAATGAACGAATATGTTCACTCATAGGAATAGATCGGAAGAGCACACGTCTGAACTCCAG
+
DDDDDIIIIIIIIHIIIIIIIIIIIIIIIIIIIIIIIGIHIIIIIIIIIIGIIIIIIIIIIIIIIIIIIIGIIIIIIIIIIIHIIIIIIGIIIIIIIIIIIHHIIIIIIIIIIIIIIIIIIIIIIIIIIHHIIHIIIIIIIHIIIIHIIIGIIIIIHIIIIIIIHIHIIIIIIIIHIIIIIIIIIIIIIIIHIIIHHIGHIIIIIHHIIIHHIIIIIIIIIIIIIIHHIIHHIIIHIIIIIHIFHHIIIIG
@SRR14933407.72/1
CTCTGAAAAGAAAAAGTATAACAAAGATGTTGAAAAACGAGAGAAAGATTACAAAGCTTATTTGGATAATAAATCTAAAGAGATTAATAAAGCGATTAAAGCACAACGTTTTAGTTTGAATTATCATTATCCAACGGTTGCTGAAATTAAAGATATCGTTGAAACGAAAGCACCAAGAATATATGAAAAAACATCGCATCATCACGATTTCTTACATTATAAGTTAGGTATTGCGAATGTAGAAAAGTCAT
+
DDDDDIIHIIGHHIIICHHIHHIIGHHEHDGEHIIIIIIIIIIIIIHGHIHHIIIIGHHFGHIIHIIIIIIHIHIHIIHIIIIGHHHIIHIIIIGHHIIIGEHIIIIIEEHHHIFIIHIIIIIIIIHIIIIIIIIIIIHHHIIIIHIIHIIIIIGIIIIHIIIHIIHIHDHIIHIHIHHII?HHIHHIIHIIIHIIIHHHIIIIIHIGIHHE?BHGHHHFEHFCHHHEGHIGHF@D,?GHGHF.FAFGHHA
@SRR14933407.73/1
ACAATGATACCTTCTGACTAAAGCTGTAGTAACTATACTAGAACGTCTAGTTCATAAACACGTTTCACTCTATTCTTTATTTAAATAATATATAAGCAGCAAACGCCAAGGCTAATATACCAATAAAATAATTAATAAAGATTTTTATTTTAGTTTTTGTAATAGTTGACAAGTTTTATCACCTCTTAACATCGATAAATTAAACGCCGTTATTTTTCCTCTGCCTATCTTTTATCATTTCAGAAGGTACT
+
DDBDD?FGHIIHHIHHIHIIC1GE@?GHIHHHIFEHHHEH<<FEHFHHIHHIIHH@GHHIIG@FEHICHHIHHHHHGHHEGFHHHGECHHHIIHGHIIEHIFHHGCDH<G@HHHIHHHHG?<<<FGGEHHIEFCHGGHHHE?GH@GHIEFHIIHIIIIHHHGHHHHH1DHEEHHHGEHIICHGCCHHFGFEEE=GH@ECHEHGHHGIICDHHIIHHC@C@.DEEC?.FHHHIIIHIIIIFH..8G?GEHC.
@SRR14933407.74/1
GAAGTATTCACGCGCTCTTTGTGTCGCTTCTTTATCTTGTCCAGCAAATACTGTCACTAGACGATCAGGTAATACGTCATAATGTAAAGCATGTGATGCTGCTGGTCTTGCGATACCACCTGCACAACCACATACAGAATTGATCATAACTAGTGTTGTACCATCTTGTTTAAGAACTTTGTCAACATCTTCTGCAGTAGTTAATTGCTCATATCCCGCAGATTCAATTTCATTCCTTGCTTGTTCTACAA
+
A@BD@HI?@C<CFHCH=HHIH<1DCCFCEEHHEH@FHE<G1<F1@<<FE@EHEGHHHHCFHIIDH?H@1FFEH@CGF@0<G1CC<<CC1CCH1G11<1CG11<CFH@C@<1FCGHCCCGEH@EHHIHHECEGHC?<11<CC1C?CEEHEEGC?GECCCCG@G?F@C@0DHHG@GCFFH/<FHE/<ECH@@FFH?/<FEE?@FFHHIGEHEH.DFH?CDHGDHIHFH?GHHH?AG?FHHHHHIHHE.F?EB.
@SRR14933407.75/1
CCTATCGTTAATACACGTGTGTTAGCTAAAGATACATCATCATCTAAACGGAAGTACACACTTCCTGACAAACTATGTTGATGATCGATGACGATACCAGTTAAATTAGACATTAAGGCAAACAAACTGAAAATAATACATAATTGCCAACGTGCTTTCCATGTACGTCGACGATTCATTAAGTTTTTAAAATATGGAATATTCATCAACACATAGGCCAAAATAATAATTAAACCTACACGCTCAAGTAG
+
DDDDDIIHHIIIIIIIIIIIIIIIIIIIIGIIIIIIIHIIIIIIIIIIIIGHIIIHHIIIIIIIIIIIIIGIIIIIIIHIIIIIIIIFHFIIIIIIIIIIHFHHIIIIIIIIIHIIIIIIIIIIIHIHHIIIIIIIIIIIIIIIIIIHIIIIIIIIHIIIHIIIGHEHIIDHHHCHIIHHIIIHCHHHIGHIHIHIIHIIIIIHHIHH?HFHIIIIIGFHEHHHIGHIFHEHH@HHHI?HHGIHDHHCEH.
@SRR14933407.76/1
TGTTTGACTATCCAAGTCTTTACGGTCAACCAAAAAGATAACTTTCTTAATGTCATCTTGTTGTGATAAAATCTGACTCGCTTTAAAAGAAGTCAACGTCTTACCACTTCCCGTTGTATGCCACACATATCCATTATTCCCTGTCTCAGTCGCTTGTTGAATAAGTGCTTCTACCGCATACACTTGATACGGACGCATTGCCATCAGTATTCTATCTGTTTCATTAATAATCATATAGCGTGATATCATCT
+
DDDDDFEHIIIIH@HIHIIHIHHHHIIHIHIIIIIIHIIIIHHIIHIHIHIHIHHHHHIIHFHHHHIIIGHIIIIHIHIIGIHHHIIIIHIIIHHHIIHIH<FHEHHIIIIIIIHGHHIIIHIIIIIIIIIIIHHIIIIIHIGHHIIIIIIIIIHIHHIHEHHGHIIIHHHHIIHIIIH0CHHHIH<FEHHHGIGIHH=HHIHIIGHIIIIIH?HHHEHIIIIIIHEH?AFHIIIII?GCHEHFHHHEGHA
@SRR14933407.77/1
CCTACATCAACAACCAAACCACCTTTAACTACTTCTGTTACTTTCGCTTCGATGATTTCATTATTATCTAATTTTTCTTGTAAATAACTATAAGACTTCTCAGTTTCAAGTTGTCTTCTAGATAAGATGTAAGCTCCAGTTTCATTTTCTTCATCAAACTCAACTTTAGTGACATATGCTTCAACTTCGTCGCCCTCTTTTACAACTTCACTTGGGCTATCAATATGATGCGTAGATAGTTGACTAATAGG
+
B@DDAHHIHHHFHEEHHIHFIHFIIIIIIIEHHIIIIIIIHIIHEC<EEHIGE@1@CFHFH?CFEEHEHHIIIIIHIHC@GHFHEHIHECHHIH@GHHFCEDGGCGHIIIIIIIHHIHIGHEHH1<@FHHIHHIIHEHHEGHIIIHIIFHCEHHGIEHHCHHIHEHIHIECCGHICCGHEGGFHIIIIEGHEHII<HHHEH?HHHIIHHH?C?H/FHHIHHIGHIIHGIIHHIIHEEFFGHGHGHGCEC@G
@SRR14933407.78/1
TGTTAATAACGATGTAAAAGCTACATCGATGTCAATGTTATTTTCTCTTACTTTTTCACCTGCTCTAGTCGCTTCATTAATACCTTGTTCAGATAAATTAACATCTTCCCATCCAGTAAATAAGTTTTTAGCATTCCACTCGCTTTGTCCATGACGACATAAAATTAATTTTGGCATAACTTATAACCTCTTTCCTTTAATCTTTAGGACGTTATTTCAGGGATATTTTAAGATGATTTAATCTTAGCAAT
+
DD@DDIIEHHIIHIIHHIHIIIIIIHHHGHIIIHIIIHIIIIEHIHIIIIIIGHHGGHHHIIHHHHIIIIIHIGHHEHIIIIIIHIIHIIIIIHIHEHHIGGHHHIIHHIGIIIIIIIIIHHIGIGFHHIIIIIIIIIIIHGIGIIIIIHHHCHIGIIGHHHEHHHHGEHIIEHIHHHIGIIIIHIIGFHHHIIEHIIIEHICHIHEEFHHHIIIIEEHIFEGHIIIIIGEHFHHHHHHIEHHHIHHHHFH
@SRR14933407.79/1
AATTAAACCATGCTGACCATCGTCTCATTATTTATAAAGATATACATGAAAACATTGAAGATGGTATCACGTTGTTAATTGTTATGGCCGTAGTTCTTGTTTTACTAGTAATATTTGGTTTCATTAGTGCAGATAATATGGCTAAACGACAAACAAAAGATATCGAAACGATTATTCAAAAAATTTACTATGCCAAAAATCGTCATCTAGGTACATATACGCCTTTAAAAAACAACAGTGAACTAGAAGAA
+
DDDDDIIHHHIIIIIIIIIIHHHIIIIHHIIIIIHGH?GHHIIIHIIIIIIIIGHHHIGHIIIIIIEHIIIIIIIIIIIIIHHIIHIIHHHIIIHHIHIIEGHHHHIIEGFHGHHEHIHHIHIIIIIHGHIIIIIIIIIIIIIHIIIGIIIHIIIIIIIIHHIIE@HHHHIIHIIHGFHHIHIHGHIIIIIGIIIIHGHHEGGGGHHIIHHIHEHHHHHHHGCEGHHHHFF--BGHIE@.BFHIFHHEG@H
@SRR14933407.80/1
TATAAAAATCCAGATCCAGTGACAATATAATCAAAGCCATCATCTGTCTTTACAATTTCACTTTGGTATAATGTTCTAACCTTGCTTTCTACTTCAGTTTTTTGCGAACAAAAACCCGTAAAATCATGTGTACCTATGAATTGTTGTGCCGCTCTGTTCATTTTGTCTAAATCCAATGGTTCAGGAATAAATGTTTTCAAACCACTTTGAAATGGATCGCGTTGTTGTGCTTGATATACTTTATATAGATC
+
CDDDDIIIIIIIIEHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIHHHIIIIIIIIIIIIIIIIIIIIIIIGHIIIIIHIIIIIIIIIIIHIIHI<EEHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIIHHIIIIHIIIIIIIIIIIIIIHHHIIIIHIIIHHIIIIGIHHIIIIIIIIIHIIIIIIIIHHHGHIIIIHHGCEHEHIIHHCHHIIIHHIIGHIIIHHIIHIHHHFHIH
@SRR14933407.81/1
ACAAACGAAATCTAAATTGGGTACTTCTGAAGCAAGAAGTGCTGTTGATTCAGTAGTTGCAGACAAATTACCATTCTATTTAGAAGAAAAAGGACAATTGTCTAAATCACTTGTAAAAAAAGCAATTAAAGCACAACAAGCAAGGGAAGCTGCACGTAAAGCTCGTGAAGATGCTCGTTCAGGTAAGAAAAACAAGCGTAAAGACAGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTAGCT
+
DDDDDHIIIIIIIHIIHIIHHHIIIIHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIIIHIIIIIIIIIIIIIIIIIIIIIGHIIIIIIIHIIHIIIIIIIIIIIIIIIIIIIIIIIIIIEHIIIIIIIIIHIIIIIGIIIIIIIIIIIIHIIIIIIHIHIIIHIIHIHHHHHICECHHIIGGEEHIIHIIHHHHHHIIIIGHHIIIIIIGIEHHHIIIGHHHHDHH-.9.
@SRR14933407.82/1
GGGATATGACCCGTGTATTTTGAAAAGTGGTAAGCAATGTTTTTAGTTTATAGGGATTCATTAGTTGTATATAATAAGCATGAAAAGCATATAGTCGATCTTATACTTTTAATTGAGACTTATAGTGTATTTAAGTTATAGATAAATTCATTAAAATATTATGGAGGAATGAGTATGTCAAAGCGTCTATTATTGTTTGATTTTGATGAAACATATTTTAAACATAATACAAATGTAGAAGATTTAAGTCA
+
@@@@DFHGHHCH?DEC<C<<1FC<@@DFC1DGHH1FE@GEC?CE?111DGCEHHC11<C@HFG1<1<1D<DHHIEHH11FFC?HEH111GG@F1<D<1<<D<FHG11CG@@FHEG?G1<DFEHC@<<<<GGC1DCHHEHEFE1<FHFGHEGHGHFG@C<CEC?111C0D@G1<<FHHHH1FG?@CHH=EEHHHHH@FF?C@EH00GHHIIIF?/<<CGEG/<CG?CEHHIHH<FH/</C.?77.78GE@FC
@SRR14933407.83/1
TTACCACGTTATAATAAAGACGGTAATATGATTAAATGGTCACGACAAAAAGATTCTTTCTATTATTTCAATCCTAATTGGCATATTGTAGGTATTACATATGATAGTTTAGAGGAAATTAAGGAGAAATATTCAGAGCAGTTTTTAAAATACACACGTATAGTCAAAGTCAAACATGACTAATAATATATGGTAGTTTTTGAAAAGAAGTAAGGAAAGATTAAAAATTGCATATTTGAATAACCGTATAA
+
DDDCDIIIHHIIIIIIIIIHIIIHIHHIIHIIHHIIIIIIHIIIIIIIIIIIEEHIIIIIHHHIHIIIIIIIGHIIIIIIIGHIHHIHIHGHIIIGIIIHHDHIIIIIIIHIHHIHIIIHIHIHHHIIIIIIIIIIIHIIIIHIIIIIIIIIIIIIIIIIIHHHIIHIIHHGIIIHIIIIIHIHEHIHIIHIEHHHG@GGHDGEHF?CHHHHHHHFCDCHIHHEHHHIIIIHIIHIHHHIIHIIEH-..;F
@SRR14933407.84/1
TTCGTCAATATACTTATTCGTTCTACTAATAAACTTCCATACCGTAGATAATGCCACAGAAAATTGCAAACTTTCCATGCTTTCAGTATAACTTTTCACTGTTTCTAAAGCCATAGCTTCCATTTCTTCATCTAATTCATGAAGTGGACCTTGATACGCTGGTAATTCGCCATCAAAGTACTTATTAATCATAGAAATCGTACGGTTTACTAAGTTTCCTAAGTCATTTGCTAGATCGAAATTTGTACGCT
+
DDDDDIIIIIIIIIIIIIIHHIIIIIGIIHIHHIIIIIIIIIIIIIGIHHIIHIGHIIIIIIIHHHIIIIIIGIIIHIIIIIIIIIGHIHIIIIIIIIIIIHIGHHHIIIIIIIIIIIIIIIIIIIIIIIHIIIHIIIIHHIIIIIHHHIIIIHIGHHIIICHHIIIIHHIIHIHIIIIHHIHHHIGHHHHHHHHHHIIIIHIDHECHIHIHHHHHHECGHHHHHIIIII?FHIIIHICHEHHEEHHIHC.
@SRR14933407.85/1
TGTGAGGCGTGAAAAGACGCCTCACATATTGTCTTGATTTATCTATCTTGGACATTTGAGGTTGTCGTTTGTTATGAATGCGTTTCTCAGCGTCAAATGTGAACTCTATCCATTCAATTTTACGCCCTTTTTTCGCTTTGATTTTATTGATAGTAAGATTATTAAATATAGAACCTAATTCAATAATAATAGGTTTAAAAACATTTTTGTTAATATCAGTCATACGATAAGATTTAGGTATATCTAAACGA
+
DDDDDIIIIIIIIIIIHIIIIIIIIIIIIIIHIIIIHHIIIIIIIIIIIIIGHIIHIIHHIHIIHIHEHHIIIIIHIIIIIIHIIIIIIIIIIIIIIIIIIFHHIIIIGIIIIIIIIIIIIIIIIIIIIIHIHIGHHIIIFHHIIIIIIGIIIIIIIIHIIHIIHIIHHIHIHIEHHHHHGIIIIIIIIIIEHHIHIIIIEHHHIDGHHIIHHGFHHHHIGIIHIHIHGH@BH?EFHHIHIFHHH?EE.A9
@SRR14933407.86/1
AATTAGAAATGAAGTTATTCGTGCAACGTATGAATTTTTCAACAAAGATGGATTTACAAAGGTTGATCCACCAATTTTGACAGCAAGTGCACCAGAAGGTACAAGTGAATTATTCCATACTAAATACTTTGATCAAGATGCGTTTTTATCTCAAAGTGGTCAGTTATACTTAGAAGCTGCGGCAATGGCACACGGAAAAGTATTTTCATTTGGTCCAACTTTCAGAGCTGAAAAATCAAAAACACGTAGAC
+
DDDDDIIIIIIIIIIIIIIIIIHIIIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIIIIIIIIIIIIIIIIIIIIIIHIHIHIIIIIIIIIIIIIIIIIIIFHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHHIIIIIIIIIIIIHIIIIIIIIIIIIHIIIHIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIHIIIIIIIGIIHHHIHIIIIIGIIIIIIIIGHIHEHIHIIIIHIIIH
@SRR14933407.87/1
GGACGTAAAGCACTTTTAATAGGGGTAACTATTGGACCTGGTAATAACAGACATCACTCAATTTATTCTATCGGCCAAAGAGGTGTTAACCAATTCTTAAAAAACATTGCACCTCAAGTATCGATGACTGATTCAGGTGGACGTGTTAAACCGTTACCAATACAGAACCCAGCATATCTAAGTGATATTACGGAAGTTGGTCATTACTATATCTATACGCAAGACACACAAAATGCATTAGATTTCCCGTT
+
B@@BDIIIIIIGIIIIHCHHFGHI<GHHHFHIIIGHHIIIIIIGHHIIGHH@FHIIHHHHHIIIIIHIIIIGIIIIIGIIIHGHHHHIIIIIIHHIIIHIHHHIHIIHIHIIHIIIIIIIHIIHHHIIGIHIIIIIIHHIIIIIIHFHIIIHIHIIHCEHIIIIIHHIIIIHIIHGHIHGCHIICGHIIEEFIIIIIGHGHIGHCHIHHCHHHHFAHIFHEHDHEHFHIEGHGHHIIEHIG..BBGH?@GG
@SRR14933407.88/1
GAAACAATTATTTGAACATCGAGGGTTACCACAGTTACCTTATATTAGTTTCTTACGTTCTGAATATGAAAAATATGAACATAACATTTTAAAATTAGTAAATGATAAATTAAATTACCCAGTCTTTGTTAAACCTGCTAACTTAGGGTCAAGTATAGGTATCAGTAAATGTAGTAATGAAGTGGAACTTAAAGAAGGTATTAAAGAAGCATTCCAATTTGACCGTAAGCTTGTTATTGAACAAGGCGTTA
+
DDDDDIIIIIIIIIIHHIIHIIIHIHHIIHIGIIIIHIGIIIIIHCHIIIIIIIIIIIIIIIIIIIIIGIIIIIIIGIIHIIHIHIIIIIHIIIIIIHIIICHHHHIIIIIIHHHHIHHHHIIHIIIIIIIIIIIIIIIIIIIIIHIIIIIIIIIIIIHHIHIIIIHHHIIIIIIIHHHHHHHGIIIIIIIIIGHIHIIIIIHIIIIHIIIIIIHIGHHIIIIHIIIICHIIHHHHHHCGHHHIIHHIHHG
@SRR14933407.89/1
CTTAACACTTTTGAAAGGAAAAGCCTATTGTGATATCTATTGCAAACGCATTACATTTAATGTTAAGTTTCGGTATGTTTATCGTCACTTTCATTGGTGTAGTAGTCGCAATAATTAATTTAAACAATAAAAAATAACCATCATTTCAAAACTTTGACAACGAATGATGGTTAAAGAAAATCTCATTTAGTCACCGTCTTTTTAACGGGCTCATTAGGCGACATGTTCCTGCATGTCGTCTTTTTACATTT
+
@DD?@11F<<<<F@CC@FHII1D?CG1<DHCHIHHEHFHIGC@@GEG=<CG@@<CGFEHI1FFF1<D<<1<<E0DG?1DCF@11<<CHHHHHHG11C1111<<<<1<0C/<<<C@111CC@@<1<FHEC@GFHDH@<<DHHHCC@H?EFHHHGHEI?C1G@?E0@<<<DGIE0<<FE<DFDFHHIE//DC@<GEHH/<DHE/GH/CE-.:87CHHGE@7-,5,.C...FEHHHH.B.FA8BEEHHCHH?HH
@SRR14933407.90/1
ATAATTTTGTATTAATTGCTTAATAAGTGGTTGTGACATAAAATCTTGTTCAAAACCAGTTGCAACCATAATCTGTTGATATGGAACAGAATCATTTTCAGTGTTAATTAAACCATCACTAATTTGAGTGATAGGTGTTTTATGCACATTTATACGACCATTTTTAATATGTTTTTTAAGGCGTAAGTACAGTTCGTGAGGCATTGATCCTTTATGACGTTCACGTTGTACAATGGCATTTCTTTCAGGCA
+
DDDDDEHIIIIHIHGHHHIGIIIIIGIEHIIIIIGIIIHHGFHIIHIHIIHHIIIHIGIIIIHIIIIIIIIIIIIIIIIIGIHHHEFDFHHFHHHHHHHHHHHHIIIIIIIIIGIIIIIIHIICGEHGHHHHHIEHHIIIIIIIIIIIIIHIHIIIIIIIHHHHIIIHIHHHHIHHHI?CHEHHHHIIIH?FHIII=@@D@EHCHEHGIHGHHHICHHDCCGHHIIIHHHFHIIIGHHFF?BAGG.@@BE.
@SRR14933407.91/1
ATTAGAATCTGGTGCAGAAGGTACGCGTGTAGAAGATACCATGACACGTATTGCAAAAAAACTTGGTTACAGTGAAAGTAACAGCTTTGTTACAAACACTGTCATCCAGTTTACGTTACATTCGGAATCGTTTCCTAGAATATTTAGAATTACCTCTCGAGATACAAACTTAATAAAAATTTCTCAAGCTAATAAAATTTCGCGTCAAATTACAAATAATGAAATTTCTTTAGCCGAGGCAAAAACACAAC
+
DDDDDIIIIIIIIHIIIIIIHIIIIIIHHIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIIIIIHIIIIIIIIIIIHIIIIIIIIIIIIIIIGIIIHIHIIIHHIIIHIIIIIIIIIIHHHIHIHIIIIGIIIIIIIIIIIIIIIIIIGIIHGHIIIIIIIIIIIIIIIIHIIIIIIHHHHIIIIIHHHFHH?HIIIHHHIIHIIIIIIIIIIIIIGHHIGIIHIIIHIHGH@EEHCD,<CHH.BGHIHHHH
@SRR14933407.92/1
GAAGAAGGAAAACTTGAAAAAGAAAAGGTAAGAATTAAAGTCGAACAAGATCCTGGTGCTTTAGGTATGCTAGGTACAAATCAAAATCAGCAAATGCAAGAGATGATGAATCAATTAATGCCTAAAAAGAAAGTTGAGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAAAAACATGCCCTAATACAAGTGCATAACCACCACCATGC
+
DDDDDIIIIIIIIIIIIIIIIIHHIGHHHIICHIIIIIIIHIIIIIIIIIIIIIIIIIIIIIIIIHHIIIIIIIHIIIIIIIIIIIIIIIIIIIIIIIIIIFHIIIIIIIIIIIIIIHIIIIIIIIGIIIIIIIIIIIIIIIIIHIIIGIHIGIHIIIIIIIIIIIIIIIIIIIIIHIIFFHIIIIIIIIIIIIIIIIIHIIIFHHHHIHIH----....---.-7-..9-.---6--------;---.7.
@SRR14933407.93/1
ACCCCTCATTATAATCATCCTTATTTTCTATTTTTAAAAAGACAATTAGACCGCTCTTTAAACTATAGATTAATACTTAAGTTAAACTCATACATACTGATACCATACGTTAAATCTAACAATTTAAAATTCGTTATAACTATGGATTAAAGAGCTGCCCAACTCATTTAATCCTTAAAAACTTCACATGTGATTGTTTATTAATCCCTCCTTTATCTTATTAAATATCCTTATAACCCTTTTAAAATTAA
+
DDBDDIIIIIIIIIIIIIIIIIIIIHIIIIHHIIIIIIHIIIIIIIIIIHIIIIIIIIIHGEHHHHGHIIGIIHIIIIIIIFHIIIFHHHIIIIIIIIIIIDEHIIIIIIIIIIIIIIGIIIIIIIIICHHIHHHIIIHIIIIIIHHIIIIGIIIIHIIIIIIIIIIIIIIIIIIIIIDHHIIIIIIIIHIIHHIIIIHHEEHHIIHHHHIIIIEGHIHEIHIIGHHHIGHCFHIIIIHHEHHHICEHHHE
@SRR14933407.94/1
TAAACGTTTAGTTAAAGTGCTACTAACGATTTTAGTGATTGTGTTTTCAATTTTACTAGTAGGTGGTTGGTATATTTTTAAAAATGAAGCACCTCGTCCAACAAAGATAGTAGATCAACAAGGTCATACGCTTGTTACAAAAGGCGAGCTAATCAGTGGGCAAGCGATATATGAGAAATATGGGTTAACAGATTATGGCTCCTACCTGGGCAATGGTTCCTACTTAGGCCCAGATTATACAGCAGAGGCAT
+
CDDDDIHIEHHCGHIIIHIIIGIIHIIHHHIHHIIHGHHIIIF<FHHIIGIHIHIHIIIHIIHDGHEEHHIIIGHFHHHIIIIIIIIICHIIIGIIIIIGIGHHHIHHIIIHIHHIIHIHIIHHHHIIIGCEEHIIIIIEHHHHIIIIIDHIFIIIIIIGHIIGIGCHHHIIIIIIIIIIHHEGHIIIIIIIIHIFHHIIHIIIGGIHIIIGIHHHIIIIHIGIIHIIICHIIGHIHIIGCGCCHEFEGEB
@SRR14933407.95/1
CCAATCACAGAATGTAGTGGTAATGAAATATGTCAAGAATGGCTATATCACTTAGGTGTACCAACTGACAAAATTGAAGACTTAGCAAAACATGCATCTAATACGATTCCTGTTTATATGCCATATATTACCTCTTATTTCATGACGCGTGCTATCGGCGACAGACCTTTAGTCGTCCCGCATCAATCACAAAATTTAGCATTTATTGGTAACTTTGCAGAAACAGAGCGAGATACTGTATTTACAACAGA
+
<0DD@EHHEF?1C<CHEF@?FHIIEHH@1<D<DCGHGEH1C@EFHEHHIGEEFHIGHH@E@CHHH1C<FH1CGHE1FH?H?HHIICHIHIHEHHC1@@GHIHHHGGH?GGHIIIIHHCHCHEEHEEFHFHEHEHHIHIIIHIIE?FGCDHHCE@EH=CCEDHC??<<C@0D<DC=HHH/<E-CEHFHIH?@HHIHCFE.@GHFHHHHIEBF.AB....BGHE.B@G.B?H?D78FHF@GFFCEHEHEC@HC
@SRR14933407.96/1
ATCGAGAATCTAGTCTTGATTTTTTAACCTCATTCTCTTCTATAGTATGCTTTAAAAAACGAATATCATTGTTAACATCTGATTGCTCTGACATTAATGTATAGTATTCGTTTTTAATTTCTTCCAATTTTTCATCGTGTGCTTCGTCTGAAACATATAGTTGTTCTTCAAGTTCACGAATGACAGCATTGAGTTCTTTTTGTTTACTTTTCAGAGACTTATAAGTATCTTGAGCTTCAGAAATCTCACTT
+
DDDDDHIIHIIIHIIIHIIIIIIIIIIIIIIIIIIIIIIIIIIIIGHFHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIGIIIIIIIIIIIIIIIIIHIHHHHIHIIIIIIIIIHIIIIIIIIHIIIIIIHIIIIIHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIGIGIGHHIIIIIIIIIGHIHIHIIIIIIIIIIHIIIIHIGHHHHHGHIHHIIHIIHGHGIIGHHFHHHHHIIHIE
@SRR14933407.97/1
GATACATTTATTAGAGATTTAGTTATGATGATGTCTGATATTGAAGAGATTAAAAAAGCGACAAAAAAAGATAAGGCTGAAGAAAAACGTGTAGAATTCCACTTACATACTGCAATGAGTCAAATGGATGGTATACCAAATATTGGTGCGTATGTTAAACAGGCAGCAGACTGGGGACATCCAGCCATTGCGGTTACAGACCATAATGTTGTGCAAGCATTTCCAGATGCTCACGCAGCAGCGGAAAAACA
+
DDCDDHHIHIIHHICHFEHH@HEEHIIHIIHIIIHIHHCHHHHHHHCGHGHHGGHIIIH/DCCDHHIIHHHHHCECGH1GHIIHHHIHIHEGHHIIFHIHH<FHHHHHHIIHIFIHHHIHHHIHIHHIHHHIIHIHHEHHHHHHIGHHHHEG@@DHGHF@GGEGHHHIIHICHHHHHIIIHCHHHHGIEHHFHHHHIHHH?@HHFGCF@HHICHIIIGCH?HFEGHHIFH?FHEHGHIIIFFDH?EHGH79
@SRR14933407.98/1
TATTTGAGTGCCATTTATGACTTAGGTGCTAAACGTATTGTTAGTTATGAGCTAGGCCCTTCTAACAATAATCAATTAGTATTTAAAACATTCAATCAAGCGATAGAAAAAGTGGAAAATACAAAAGGTATTTTATTTCATAGTTACCGAGGTTTTCAATATACATCGAAAACATTTAAACATATGTTGGATGAATGTGGCATGATTCAAAGTATGTCACGTGTTAGCAAATGTATCGATAACGGTCCAAT
+
DADCDGH1DHIIHIHIIIIIIIHIGHIHIIIIIIIIIIIIHIIIIIIIIIIIIIFEHHHIHIGIHHIIIHIIIIHFHHIH11<FFHIIIIIIHHIFEHFHEECEHHIIIIIIHEFHHHGHIIHIIIHHHIIIGIIIGHHHHHEHIIIICEEHE@HHGIIHHIHIIIHHHIIGHHEHHHDHHGCHE@HEC0F@0FHHHIHHF/FHC/<FHHHEHGHEHHHFGHHHGCHHIII..7.7C@F?G=.@CHG<FH.
@SRR14933407.99/1
AGAACACCTTCTAATGCAGCGCGAATCATATGTTCTTTTTTATGAGATAAAGTTAAACCGAAGAATGAACCTCTTGCATTTGCGTTCCAAAGCGGTGCACGTTCTCCAGCTAAATAGGGATGGAATATTAAACCATCTGCACCTGGTTTAACACGCTTTGCAATTTGAGTTAAAACATCATAAGGATCAACACCGAGACGTTTCGCAGTTTCGACTTCGCTTGCTAACAACTCATCGCGCAACCATCTCAA
+
DDDDDIIIHHHGDHHEHHI@FHIDCHIEEGHHE@HIHIIHIFHHHIHIIIIIHIHHIIII<EHHIIIHFHEHHFHHFHIFH11FFHE@FHEE?EHGHGHHIGHGH@HIIIICHHHHHHHH?CEHHIIIIIGEECHHHEHHIHIHGGGFEFHFHHCHHGHHEHHIIHHHIH?C@FHFHF?CHHHHFFE:GFHCG@?HHGIHHHEF@HHH@HIHHF=G=EHCHHHCCCB..;BEGHGC-5><HC,@ACB-9BB
@SRR14933407.100/1
CTGCTCTAAAATTTCTGTTTGTTGTCCTTCATGTAAAATTGCAATGCAGTTTCGTGTTTCACCCTTAATGTTATAAAATGCATGCTTGATGCCGGCATGATCTAATTTTTTAGCAATAAATTGACCTAATTCACCGCCAATAAAACCACTCGCAAGGACTGGCTCACCTACTTGCGCAAGTACTCTTGTTACATTTAAACCTTTACCACCAGCTGTTTTACTTACTTCTTGAACACGATTAACATCATCTA
+
DDDDDIIIIIHHIIIGHHCHIHIIIIIHIGIHIIGIGHIIHIIIIHIIIEHIIFHIFHIIEHHIHIIIFHHIIIIIIIIHGHIIIHIHIIGIIIIIHHIIIDGIIIIIIIIIIIIIIIIIIHIGHHIIIIIIIIIIHIIIIIIIIGIIIIIIIIIIIIIIIHFHIIIIIHHIIHIGHICHHEHHIHFEHIIC?GHHIHIIHHIIG.FHHEF@AHHH?GHFHHHFEHHCEHHEHHHHGHHHHIIGBGHHHGC
@SRR14933407.101/1
GGTAATCCCATAACTTCAGTTAATTCATAATCAACGCCACTCTCTTCAAGAGTCTTTTTAATAGTATTTTTAAATTGCTTTAATGGATATACACTTGCGTTTGTACATAATAATAATGTTCCTTCAGATGATAAGATATTTAAGGCGCCAGTAATTAATTTGTCATAATCTTTTTGCACTGAAAATATACGTTTTTTATTGCGTGCAAAGCTAGGTGGATCAATCACGATCGTGTCATAACTATGTCCATG
+
DDDDDIIIIIIIIHIIIIIIIIHIIIIIIIIIIIHIIIIHIIIIIIIIGHHIHIIIHHIIIFHHHII1DFHIHCFHIIHG@GHIIIGEHHHIIHHIHHIHHDFDHHHIIIHHHIIIIIIIIIIIIIIIIIHIIIIIIIHIHIIIHHIIIHIIGIIIHIGHHIIIIHICHIIIIIIHHHIIIHHIHHIHHFGDHHHIIIIICHHHIHHGFHHIGFFHHHHIIHHIIIHIBH?C@EHHIIIIIGIIFH?FH?.
@SRR14933407.102/1
CCGGAAAAATATTTAAATGTTGGTATTCAGGTATTAAATCAATAACGGTTTTTGAATGATGATCTTCAGTTGCCGATATATAACCTTTTGGCTTATTTAACATAATATAGACATTTTCAACGTATTCTATTAATTCTCCACGAACTGATATCTTATCGTTTTCTGGTTCTATATGTGTTTTTGGTGATTTAATTACTTGTTCGTTGACATTTACAAGGCCTTTTTTAAGTAACTGTTTGACCTCATTACGT
+
@D@BDIHHHIEEECG?F<C<<FF0DG<EEHEDH1FH?@H1<@GHHHHIDCHHFIHHHHCCGHEHIIICGEHIEGH/</CEC1CF1DDEFHDH111DDGFGH1CFCCDHFEHHHHHIHH?<CCHCGE@1<<C<CC1FHEHCC@EC<@CCDEEGCEF@EEFF11<CFC<1DFH@0D<CG<D<D<EE@GFF/</<DFEHII/F@GHIIGDHC?GFC@@HHCH?@DGHED?H?G?GHHHBG@FFHEEHHIHGC.7
@SRR14933407.103/1
TAACGGTCCAAATTCGGAAACATGTAACTGATGAACTTTTGGTAGTCCTATACGAATATAACCGTATCCGATGATTACCATAAATATGCCCAGTGTCATAATAATGTATTGATTTAAACGATCTTGCATAACACGTTTAAATCGCTTCGTAGTAAACTTTTCAAATTGTCGAGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGCTCATTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAAAAA
+
DDDDDIIIIHIIIIIIIHIHHIIIIIHIHIIIIIIIIIIIHHHIIIIIHHIIIIIHHIIIIIIIIIIIIIGHIICGHIIIIGHHIIIIIIIIHEHFHHIHHEHHHFHIIIIIIIIGIIHIIIIHIIIIIIIIIIHFHHIIIIIHIIHHHHIIIIIHIHHHIIHHHIIIIIHHI0EHGGDHHGH/GHIIGIIHIHIHIHHE@FHHIHHHHIHIFH7..BGHIHHIIHHGFH@.8@.6BE@EAHDHH?D,--7
@SRR14933407.104/1
GCAGCGATTTTACAAGTAGAATCAATCGTTAAAAAGCCAGTAGTAATTAATGATATGATTGCAATTCGAAGTATGGTTAATTTATGTATTTCAATTGATCATCGTATTTTAGATGGTTTACAAACAGGTAAATTTATGAATCATATTAAACAGCGTATCGAACAGTATACTTTAGAAAATACAAATATATATTAGTGATAACATAGATGCATCTATCGACAACTTGTTTTATCTTGTTCTTGTCGATGGAT
+
DDDDDHDHIIIIIIIHIIIIIIIIIIIIHHHIHIIHIIHGIIIHHIIIIHIIIIIIIIGIIIIIHIIIIHHIHHHIIIIIIGHIHHIIIIIIIIIIIIIIIGHIIIIIIIIHIIIIIIIIIIIIIIIIIIIIIIIIGIIIIIIIIIIIHIHIHIIIIIICHIIIHCHIIHHHIIIIHIIIHIIHHEHIIIIIHHIIIIIHIIIIIIIIEHIHEHIGHHHHHIHHHGHEHHHHHGIGHIIIIHIIIICGHH?
@SRR14933407.105/1
CCCGTTTTATAATTACTATTGTTGTAAAAAAGGTTAGCTAAGCTAACTATTTTGTCTTAGGAGATGTCGCTATGCTATCACAAGAATTTTTCAATAGTTTTATAACAATATATCGCCCGTATTTAAAATTAACAGAGCCTATTTTGGAAAAACACAATATATATTATGGTCAATGGTTAATCTTACGCGATATCGCTAAACATCAACCCACTACTCTCATTGAAATTTCACATAGACGTGCAATTGAAAAG
+
DDDDDHIIIIIIHHIIIIIIIIIIIIIIHIIHIHIIIIIIIFHHIIIGIIIHHEHHIHIIIIIIIIIIIIIIIIIIIIIIGIHFHIIIIIIIIIIGI@?FHDGHIIIIIIIIIIHIIIIIIHHIIIHHHIIIIIIIIIIGIIIHIIIIIIIIIIIICCFHIIIIIIIIHIIIIIIGGHIICHIIIIIIHIDHIIGIIHIHHHHHIIIIHIIHIHIIIHIFHEGHGIIIGIIEHIIHIGHHHIHII?EHIIA
@SRR14933407.106/1
GAAAGCTTCTGAAATAAATTCAGCTTTTGAAAAGATGAATAATGGCAATGTCTTTGTTGGTATTGTAATAGGTTTAATAGCAGCTTATGCATACAATAAGTTTAGTGAAACAGAATTACCATTAGCATTATCATTTTTTAGTGGTAAACGTTTAGTTCCAATTATGACTGCATTTTACTGTACATTTTTAGTTGTCATATTGTTATTCTTATGGCCACTACTTTATTCATGGATTGTAAAATTTGGTGAAT
+
DDDDDIFHIIIIIIIIIIIIIIIIIIHIIHIIIIIIHIIHIIIIIIHHIHEHIHHIIIHIIIEHIIIIIIIIGHIIIIIHHIIIIIIIIIIHIIIGGHHIEFHHHIIIIIIIIIIIIHIIIIIHHIIIIHIIIIIIIIIIIIGIHIEHHIHIIIIIIIIIHIGIHEHHHIIIIHHHIIIIIIIIIIIHIIIIHHIIHIHHHIIII0CHHIHI?HHCHIIIIIIIHHGF?GFFHHIEHIGHIHIIIEH@HGE
@SRR14933407.107/1
CATATGTCTTCGCAATAATATTCCCCATCGTTTCTCCGAGGTTACTTCCAACTGCCTCTAAGTAGTGGTCCGGTTGTCTTAATTCCAAATCATCAAAGAAAATTTGATTCAATGTATAATCATAATTTTGACCAGTGTGTACTACTATCTGATTAAAATATTGATCACATGCTTTAATCGTTGATGATAAACGAATGATTTCAGGCCTTGTACCAACTATTGTAATTAATTTTAGTTTTTCCTTGCGCTAT
+
<<0<D1DDCHCCGHCHIE?GH@HE?FHHHE<C<CCCFHIIE0@CEEHIHEHHII@@11CE<FHH?H11DHEHIHGHHEE?HHIIG1DGHH1CEEHCEHHHHHHEE1F@<CGHFFHFFD?FCHEE@EFG@EHIH@GCEE1<<DGH1<CG?1@HH?HFFCGCFE1CH@GHHHCHHHGH<DD0<0<C?0CCGCH?/@@C@/<//DG/?/<CDG@GEHHICEHH.7:..;DF?G7.CGCG8.BEG..9..,5?-9

After today, you should have a better understanding of

Assessing sequencing data quality

FastQC

FastQC provides essential insights into sequencing data quality

Identifies issues affecting sequencing data reliability

Helps determine preprocessing steps before downstream analysis

Per Base Sequence Quality assesses the accuracy of base calls

Plot Description:

  • X-axis: Base position in the read.
  • Y-axis: Phred quality scores (higher scores = better quality).
  • Boxplot features: Median (red line), interquartile range (yellow box), 10-90% range (whiskers), mean (blue line).

Key Observations:

  • Early bases may show slightly lower quality due to priming biases.
  • Quality often decreases at the 3' end due to signal decay and phasing issues.

Per Base Sequence Content checks nucleotide composition bias

Plot Description:

  • X-axis: Position in the read.
  • Y-axis: Percentage of each nucleotide (A, T, G, C).

Key Observations:

  • A random library should show equal A-T and G-C proportions, with parallel lines across
  • Biases across the entire read indicate contamination or library preparation issues.

Per Base N Content identifies ambiguous base calls

Plot Description:

  • X-axis: Base position in the read
  • Y-axis: Percentage of N calls

Key Observations:

  • High N content near the end of reads suggests sequencing issues like signal decay.

Interpretation:

  • Acceptable: Flatline near 0%.
  • High N content (>5%): Reads may require trimming or removal.

Per Sequence GC Content identifies abnormal nucleotide composition

Plot Description:

  • X-axis: GC content percentage.
  • Y-axis: Number of reads with the given GC content.

Key Observations:

  • Normal distribution with a single peak indicates uniform GC content.
  • Bimodal or skewed distributions suggest contamination or systematic bias.
  • Single peak at genome-specific GC content is expected

Sequence Duplication Levels highlight library diversity

Plot Description:

  • X-axis: Duplication level (number of times a sequence appears).
  • Y-axis: Percentage of sequences.

Key Observations:

  • High duplication levels suggest overrepresentation due to PCR artifacts or highly abundant transcripts.

After today, you should have a better understanding of

Cleaning and preprocessing sequencing data

fastp

After accessing the quality of sequencing data, we can preprocess reads to correct some issues

Not all errors in sequencing data are correctable

Correctable Issues:

  • Adapter contamination: Removed with trimming tools like Fastp.
  • Low-quality bases: Corrected or trimmed based on quality scores.
  • Read duplication: Resolved by deduplication to ensure unique data.

Non-Correctable Issues:

  • Systematic biases or overclustering in sequencing runs.
  • Severe signal decay or phasing errors.
  • Insufficient coverage or failed experiments.

Quality filtering removes unreliable bases and reads

Bases with low Phred scores are more likely to have sequencing errors.

Reads are often filtered based on:

  • Minimum base quality thresholds (e.g., Q20 or Q30).
  • Average read quality.
  • Proportion of ambiguous bases (N calls).

Adapter trimming removes contamination from sequencing adapters

Adapters are artificial sequences added during library preparation.

Suppose our sequencing read length is this long

If the library insert is shorter than the read length, the adapter sequence is read.

Is the adapter included in the sequencing read?

No

Yes

Deduplication ensures library diversity by removing redundant reads

Source of Duplicates: PCR amplification biases.

Identifies and removes identical sequences, ensuring unique data.

After today, you should have a better understanding of

Assessing sequencing data quality

Galaxy Platform

Before the next class, you should

  • Finish and submit P01A by Friday, 11:59 pm
  • Start working on P01B (will be released Friday, Jan 17)
  • Start working on CByte 01 (will be released Friday, Jan 17)

Lecture 03A:
Genome assembly -
Foundations

Lecture 02B:
DNA sequencing -
Methodology

Today

Tuesday

BIOSC 1540: L02B (Sequencing)

By aalexmmaldonado

BIOSC 1540: L02B (Sequencing)

  • 78