garbled fastq files after Filter and trim #1930

Danyang1111 · 2024-04-15T17:31:23Z

Hi,
I am testing the ITS sequencing data which are publicly available in NCBI Sequence Read Archive (SRA) under the BioProject ID PRJNA610042. Follow DADA2 ITS Pipeline Workflow (1.8).
The Primer F:TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG R:GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG
After the step Filter and trim, it outputs some fastq files with garbled characters. But I can still go for the following steps and get the taxonomic assignments.
Why is this happening? how to deal with it? if I want to check the filtered files, how to fix these garbled fastq files?

Thanks.

benjjneb · 2024-04-15T18:12:24Z

By default filterAndTrim(..., compress=TRUE), and so gzipped fastq files are being output irrespective of the filenames (and extensions) you are assigning to the filtered fastqs. The fastq input/output for dada2 (all taken from the ShortRead package) autodetects and uncompresses those file, so all works as expected. But if you open it in a plain text editor, it will look garbled since it is in a compressed format.

You can "fix" this by setting compress=FALSE, uncompressing the files yourself before viewing, or by viewing the files using something like zmore that will show you the uncompressed output.

benjjneb closed this as completed May 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

garbled fastq files after Filter and trim #1930

garbled fastq files after Filter and trim #1930

Danyang1111 commented Apr 15, 2024

benjjneb commented Apr 15, 2024

garbled fastq files after Filter and trim #1930

garbled fastq files after Filter and trim #1930

Comments

Danyang1111 commented Apr 15, 2024

benjjneb commented Apr 15, 2024