Why does FASTQC show unexpectedly high sequence duplication levels (PCR-duplicates)?

Posted on July 22, 2020 by Lutz Froenicke

FASTQC is primarily designed to QC whole-genome shotgun sequencing data. Importantly, it is significantly limited in its analyses because it only works on single reads instead of read-pairs. As a consequence FASTQC tends to generate unnerving warnings for multiple Illumina…

Posted in

When should I trim my Illumina reads and how should I do it?

Posted on July 8, 2020 by Lutz Froenicke

Should I trim adapters from my Illumina reads? This depends on the objective of your experiments. For counting applications such as differential gene expression (DGE), RNA-seq analysis, ChIP-seq, or ATAC-seq, read trimming is generally not required anymore when using modern…

Posted in

Where can I find the UMIs in the Tag-Seq data? When and how should I trim my Tag-Seq data? What is the low complexity stretch in the Tag-Seq data?

Posted on May 9, 2019 by Lutz Froenicke

By default, we will generate Tag-Seq and Batch-Tag-Seq gene expression profiling data that incorporate Unique Molecular Identifiers (UMIs) in the sequence reads. (This FAQ provides information on the usage of UMIs: https://dnatech.genomecenter.ucdavis.edu/faqs/should-i-remove-pcr-duplicates-from-my-rna-seq-data/ ). Please note that the UMIs provide optional…

Posted in

Should I remove PCR duplicates from my RNA-seq data?

Posted on November 6, 2018 by Lutz Froenicke

Should I remove PCR duplicates from my RNA-seq data? The short and generalized answer to the question “Should I remove PCR duplicates from my RNA-seq data?” is in most cases NO. For some scenarios, de-duplification can be helpful, but only…

Posted in

Which data will I receive from the PacBio Sequel II sequencer? Will they have quality scores?

Posted on April 30, 2018 by Lutz Froenicke

We will deliver the complete data set generated by the PacBio Sequel to you securely via Bioshare. For push-button type secondary analyses (combining data for up to 2 SMRT-cells e.g. for demultiplexing, CCS, long amplicon, or IsoSeq analysis) we can run these…

Posted in

What data will I receive for Illumina sequencing? Demultiplexing, Trimming, Filtering

Posted on December 9, 2017 by Lutz Froenicke

By default you will receive gzip compressed FASTQ data, as individual files for each sample (demultiplexed). The demultiplexing is included in the service if you provide us the barcodes sequences on the submission form. The files will be available for download…

Posted in

How do I download my sequencing data?

Posted on December 23, 2016 by Lutz Froenicke

We deliver sequencing data via two portals: SLIMS for Illumina data, and BioShare for PacBio and Nanopore data. Both portals offer secure access to the data and support several download protocols. The emails that will notify you about new sequencing…

Posted in

Which strand is sequenced for my strand-specific RNA-seq data?

Posted on December 23, 2016 by Lutz Froenicke

Strand-Specific RNA-Seq Libraries RNA-Seq (conventional) after Poly-A enrichment or ribodepletion: By default we generate strand-specific RNA-seq libraries. Strand-specific (also known as stranded or directional) RNA-seq libraries substantially enhance the value of an RNA-seq experiment. They add information on the originating…

Posted in

My FASTQ file contains some “N”s. Is there a problem with my data?

Posted on October 14, 2016 by Lutz Froenicke

Please note that when opening an Illumina sequence fastq file it is expected that the first few thousand reads are of comparatively low quality and frequently contain “N”s. An “N” means that the Illumina software was not able to make…

Posted in

How should the miRNA/small-RNA data be trimmed?

Posted on May 31, 2016 by Lutz Froenicke

We are using the PerkinElmer NEXTflex™ Small RNA-Seq kit for the generation of micro RNA and small RNA-seq libraries because it significantly reduces sequence-specific biases in the library preparation. For this purpose the adapters oligonucleotides contain 4 randomized bases at the ligation…

Posted in