How do I get a GFF file
0
2
Entering edit mode
4.8 years ago
apl00028 ▴ 90

I need GFF file from my vcf file but I do not how to get them or for what? What program should I use?

Thanks in advantage.

snp • 3.6k views
ADD COMMENT
1
Entering edit mode

I need GFF file from my vcf file

As posed this question does not make a lot of sense. VCF files describe polymorphisms compared to a reference, where as GFF files are used for describing gene models/annotations. They are not directly inter-convertible.

What exactly are you trying to do?

ADD REPLY
0
Entering edit mode

You are right, I am sorry. I trying to analyze my polimorfism data using this commands in R:

            chrom <- create.chromR(name='Supercontig', vcf=vcf, seq=dna, ann=gff)

For do that I need the vcf file, the reference sequence and the gff file of my vcf file, but I do not how to get that.

I got that code from here: https://cran.r-project.org/web/packages/vcfR/vignettes/intro_to_vcfR.html

ADD REPLY
1
Entering edit mode

the gff file of my vcf file

You actually need a GFF file for the reference genome you used to create that VCF file. Generally recommended that you get GFF from the same source where you got the reference from, so there are no discrepancies with chromosomal identifiers etc.

ADD REPLY
0
Entering edit mode

which program should do I use for that?

ADD REPLY
1
Entering edit mode

What genome are you working with and where did you get the reference sequence from?

ADD REPLY
0
Entering edit mode

My reference sequence I got for an assembly of my data

ADD REPLY
1
Entering edit mode

You probably have some additional work to do then. Get the RefSeq assembly for this virus from NCBI. You will also find a GFF file in that same directory.

Compare your assembly with the reference and depending on changes you see you could modify the GFF file available for RefSeq genome and use that.

ADD REPLY
1
Entering edit mode

You need to tell us more about your data first. We need far more information.

What organism? Where did you get the VCF? How was the VCF derived? From what sequence?

ADD REPLY
0
Entering edit mode

The organism is Cucumber Mosaic Virus RNA3, I got this vcf file from bam file using these functions:

samtools mpileup -g -f scaffold_nofilt_PV014_OD_DB.fa scaffold_nofilt_OD_PV067.bam >scaffold_nofilt_OD_PV067_q10_variant.bcf

bcftools view -v -c -g scaffold_nofilt_OD_PV067_q10_variant.bcf > scaffold_nofilt_OD_PV067_q10_variant.vcf
ADD REPLY
0
Entering edit mode

So this is a genome you de novo assembled?

ADD REPLY
0
Entering edit mode

Yes, using IDBA_UD program

ADD REPLY
0
Entering edit mode

Hi, are you the same person as the user apl0028?

ADD REPLY
0
Entering edit mode

No, I work with him. He can not reply to you because he spent all messages per day

ADD REPLY
0
Entering edit mode

I see. We will figure out if it is possible to remove this comment limit restriction.

By the way, if you've been up-voting his post and comments, please understand that such upvoting is not encouraged. Upvotes are for content with inherent objective value only, and almost none of apl0028's post/comments seem to have that value. In case such upvotes were given out, please go back and undo them.

ADD REPLY
0
Entering edit mode

Please, could you answer to me? How can I get this file?

ADD REPLY
0
Entering edit mode

genomax gave you an answer, see his comment up above.

ADD REPLY

Login before adding your answer.

Traffic: 3156 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6