GFF3 to fasta (error)
1
1
Entering edit mode
5.8 years ago
wocana ▴ 20

Hi im trying to extract some sequences from my GFF3 file to fasta but when I run this:

$   bedtools getfasta -fi Muschr4.fsa -bed Muschr4.fsa.mod.pass.list.gff3 -fo ERVCom.fa

I get

WARNING. chromosome (seq0) was not found in the FASTA file. Skipping.
WARNING. chromosome (seq0) was not found in the FASTA file. Skipping.
WARNING. chromosome (seq0) was not found in the FASTA file. Skipping.
WARNING. chromosome (seq0) was not found in the FASTA file. Skipping.
gff3 fasta • 1.5k views
ADD COMMENT
1
Entering edit mode

Do chromosome identifiers match in fasta and GFF files? That is the first thing to check. These are warnings so they may be benign.

ADD REPLY
1
Entering edit mode

Paste the output of these commands here

grep ">" Muschr4.fsa | head -10

AND

grep ">" Muschr4.fsa | grep "seq0"
ADD REPLY
0
Entering edit mode
intirules@Biointi:~/Desktop/Tesis/Secuencias/PruebaRetriever$ grep ">" Muschr4.fsa | head -10

>NC_000070.6 Mus musculus strain C57BL/6J chromosome 4, GRCm38.p4 C57BL/6J

intirules@Biointi:~/Desktop/Tesis/Secuencias/PruebaRetriever$ grep ">" Muschr4.fsa | grep "seq0"

intirules@Biointi:~/Desktop/Tesis/Secuencias/PruebaRetriever$

Then i tried with bed but still no working

ADD REPLY
1
Entering edit mode

It looks like you have things named seq0 in your GFF file but not in fasta file. Can you confirm by grep "seq0" your_gff | head -5

Please use ADD COMMENT/ADD REPLY when responding to existing posts to keep threads logically organized.

ADD REPLY
0
Entering edit mode
5.8 years ago
wocana ▴ 20
awk '{if ($1~/[0-9]+/) print $10"\t"$1}' Muschr4.fsa.mod.pass.list > Muschr4.fsa.mod.pass.list.extract
perl $path/LTR_retriever/bin/call_seq_by_list.pl Muschr4.fsa.mod.pass.list.extract -C Muschr4.fsa.mod > Muschr4.fsa.mod.pass.list.extract.fa

This solved it

ADD COMMENT

Login before adding your answer.

Traffic: 1583 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6