How to download human reference transcriptome (hg19)
1
2
Entering edit mode
6.9 years ago

As the title. I do not known how to download human reference transcriptome. Who can help me.

Is the website http://hgdownload.soe.ucsc.edu/goldenPath/hg19/?

RNA-Seq next-gen sequencing • 15k views
ADD COMMENT
0
Entering edit mode

Hi! Follow up to this thread - has any managed to download a matching .gtf file? I am trying to download a transcriptome for use in creating .bam files in kallisto to be viewed using IGV, but the .bam files I'm producing don't show any reads when I upload them to the program, and I suspect it's because the .gtf and .fa files were downloaded from different places.

ADD REPLY
0
Entering edit mode

Matching GRCh37 human GTF file can be found here.

ADD REPLY
2
Entering edit mode
6.9 years ago
h.mon 35k

There are several versions of the human transcriptome. One often used is ENSEMBL, to get protein coding genes and non-coding RNA:

wget ftp://ftp.ensembl.org/pub/release-89/fasta/homo_sapiens/cdna/Homo_sapiens.GRCh38.cdna.all.fa.gz
wget ftp://ftp.ensembl.org/pub/release-89/fasta/homo_sapiens/ncrna/Homo_sapiens.GRCh38.ncrna.fa.gz
ADD COMMENT
0
Entering edit mode

Thanks very much.

I am reading a paper. It written "a human reference transcriptome derived from hg19 build of human genome" and "This transcriptome contains 214294 transcripts and occupied 96446089 bytes as a gzipped FASTA file". Maybe the version your provided is not identical the paper mentioned?

ADD REPLY
1
Entering edit mode

The links provided by h.mon state CRCh38, which is indeed not the same as hg19.

ADD REPLY
1
Entering edit mode
ADD REPLY
1
Entering edit mode

Most certainly it is not, but I would have to know the folder where the transcriptome is saved to be certain for sure.

"A human reference transcriptome derived from hg19 build of human genome" and "this transcriptome contains 214294 transcripts and occupied 96446089 bytes as a gzipped FASTA file" are only moderately useful to describe a transcriptome. But if the manuscript you are referring to is this paper, then it doesn't mater because:

1) the version of the transcriptome is unlikely to play a substantial roe in their results 2) you can use any different human transcriptome to encode / decode fastq files, provided you use the same transcriptome for both encoding / decoding the same files.

ADD REPLY

Login before adding your answer.

Traffic: 3310 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6