Input format: genbank
The GenBank or GenPept flat file format.
Output format: fastq-illumina
FASTQ files are a bit like FASTA files but also include sequencing qualities. In Biopython, 'fastq' refers to Sanger style FASTQ files which encode PHRED qualities using an ASCII offset of 33. See also the incompatible 'fastq-solexa' and 'fastq-illumina' variants.
How to convert from genbank to fastq-illumina ?
You can also convert between these formats by using command line tools.
On Windows install WSL, on
Mac
or Linux start
terminal
Install BioPython
Run following script:
Or you can use this site as online genbank to fastq-illumina converter by selecting your formats &
file.
Sequence Converter Home page
from Bio import SeqIO
records = SeqIO.parse("THIS_IS_YOUR_INPUT_FILE.genbank", "genbank")
count = SeqIO.write(records, "THIS_IS_YOUR_OUTPUT_FILE.fastq-illumina", "fastq-illumina")
print("Converted %i records" % count)