EZsplit Instructions
EZsplit extracts mitochondrial protein-coding genes from annotated genome files and generates separate FASTA files for each gene.
- Reads genomic sequences from a FASTA file
- Parses gene coordinates from a GFF3 annotation file
- Selects features annotated as
protein_coding - Matches gene names to standard mitochondrial gene synonyms
- Extracts gene sequences using annotation coordinates
- Reverse-complements genes on the negative strand
- Creates one FASTA file per gene
- Generates a
missing_genes.txtreport
-
Provide a genomic FASTA file
(
.fasta, .fa, .fna, .fas) -
Provide the corresponding GFF3 annotation file
(
.gff, .gff3) - Ensure record IDs match between FASTA and GFF
-
Only
gene_biotype=protein_codingfeatures are extracted - Gene names must match mitochondrial gene standards