How to create EGFP GTF file?
0
1
Entering edit mode
6.0 years ago
Payal ▴ 160

Hi,

I am analyzing RNA Seq data with EGFP. I need to get the count of EGFPs for samples. So I am thinking to align the fastq files to the EGFP sequence concatenated to the reference genome and then get the counts using HTseq counts. I have the EGFP.fa file, but how to create the EGFP GTF file as thats required for Htseq counts and for annotation!!

EGFP Sequence:(Is this ok?)

>EGFP
ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAA
GTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGC
TGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAG
CAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTA
CAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGG
ACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAAC
GGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACAC
CCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACG
AGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAA

GTF: (Is this ok?)

EGFP    EGFP    exon    1   720 0.000000    +   .   gene_id "EGFP"; transcript_id "EGFP";

Thanks, Payal

RNA-Seq • 3.9k views
ADD COMMENT
1
Entering edit mode

If your EGFP.fa file looks like this

>EGFP
AGCTCAGATCGATAGCTACG

then make your GTF file to have a line at the end with something like this (chromosome name EGFP needs to match in both, add right length of sequence in place of 1000)

EGFP    unknown exon    1   1000   .       +       .       gene_id "EGFP"; gene_name "EGFP"
ADD REPLY
0
Entering edit mode

This helped me a lot.

Just another update on this: You'll need to add "transcript_id" to the GTF file to get it to work.

ADD REPLY

Login before adding your answer.

Traffic: 2524 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6