Using UCSC Genome Browser with results from NCBI or Ensembl assemblies
2
0
Entering edit mode
6.3 years ago
colin.kern ★ 1.1k

If I want to display some data on the UCSC genome browser, but have done the analysis with the Ensembl or NCBI assemblies and annotations, do I need to manually write scripts to replace all the chromosome IDs with the UCSC genome browser ones? It seems like this would be a very common situation that has an existing solution.

Alternatively, I could do my analyses from now on using the UCSC assembly and annotation files, but that seems to have some issues. The annotation is only distributed as a SQL database. I have looked at using the Table Browser in UCSC to get a GTF file, which is what most tools use, but the GTF files I can get have a lot of missing information. They only contain stop_codon, start_codon, exon, and CDS elements. Can I get it to have transcript and gene entries also? It also only gives the gene and transcript IDs, whereas the GTF files you can get from Ensembl or NCBI have a lot of useful information such as gene symbol and biotype.

I think I might be missing something because it seems like I'm having to jump through a lot of hoops for something that I suspect a huge number of people have dealt with before.

ucsc browser ncbi ensembl • 1.7k views
ADD COMMENT
1
Entering edit mode
6.3 years ago

do I need to manually write scripts to replace all the chromosome IDs with the UCSC genome browser ones?

yes and Devon's work wil help you : https://github.com/dpryan79/ChromosomeMappings

it seems like I'm having to jump through a lot of hoops

sed is your friend

ADD COMMENT
0
Entering edit mode

Thanks. Do you know what resources were used to make those mappings? I work with farm animal species and none of them are on that Github.

ADD REPLY
1
Entering edit mode
ADD COMMENT

Login before adding your answer.

Traffic: 2573 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6