Ncbi Esearch :Searching For Snp/Genes/... At In A Given Segment Chr:Start-End ?
3
2
Entering edit mode
13.9 years ago

Can NCBI-ESearch be used to search some objects in a given genomic segment ?

For example, how can I find all the IDs for the Genes or SNPs on Human/build36/chr1:100000-200000 ?

Update: After searching in the DBSNP doc, I found that the following query returns the Human SNPs on chr7 between 88556398 and 88580839.

curl 'http://eutils.ncbi.nlm.nih.gov/entrez/eutils/esearch.fcgi?db=snp&term=%287%5BCHR%5D+AND+88556398%3A88580839%5BCHRPOS%5D+AND+txid9606%5BORGN%5D%29%0A'

It also seems to work for the genes.

curl 'http://eutils.ncbi.nlm.nih.gov/entrez/eutils/esearch.fcgi?db=gene&term=%287%5BCHR%5D+AND+88556398%3A818580839%5BCHRPOS%5D+AND+txid9606%5BORGN%5D%29%0A'

but I didn't find a way to specify the genome-build-id.

search ncbi eutils • 5.1k views
ADD COMMENT
4
Entering edit mode
13.9 years ago
Paulo Nuin ★ 3.7k

I don't know about E-Search, but BioMart can do it. Click here.

ADD COMMENT
0
Entering edit mode

Yes, thanks Pedro. I can also use BioDas:UCSC, Biodas:Ensembl, mysql etc... But I wish I could get a tool integrated with the NCBI data.

ADD REPLY
0
Entering edit mode

@ Pierrie : I know gene number varies between the three databases. Is there any significant differences between the SNP recorded in NCBI, UCSC and Ensembl.

ADD REPLY
0
Entering edit mode

Pedro Paulo, all the same thing :-). Anyway, I'm not aware of a way to do with NCBI tools, maybe something in the toolkit might help then.

ADD REPLY
3
Entering edit mode
13.9 years ago

I only asked once the dbSNP helpdesk about archiving, regarding information mapped to previous human genome builds, and what they told me at that time (it was on 2008) was that they were only able to maintain I single human genome build on their systems, and that if I needed older versions I would have to query them through the UCSC browser. as you may imagine, this was not very helpful.

I know this is not exactly answering your question, and that there is already an answer citing Ensembl's Biomart, but IMHO that's the best site to place such queries. in fact, there are 3 main reasons why I would highly suggest you to consider working with Ensembl to retrieve this kind of data:

  1. they DO maintain an archive of all previous builds, so you may always be able to trace all your work, at any time in the future
  2. their API not only is robust, but also it is very well documented
  3. they keep track of all "external" information they mirror, so even if they assign their own names to genes and variants, you are always able to find your NCBI coded information

it's not that I am an Ensembl fan against NCBI, but I've found out that when you need to dig really deep into the querying system you find more answers on the european site rather than on the american. but as you are a very experienced user, I guess you will already have another opinion. I just wanted to point this out, in case it helps any other reader coming to this thread looking for advice ;)

ADD COMMENT
2
Entering edit mode
13.9 years ago
Neilfws 49k

I'm reasonably sure that EUtils doesn't handle segment queries, just because I've never seen any documentation or code examples. Either BioMart or UCSC would be the "standard" way to approach this task. Their relational database structure makes it very easy.

I'd guess that it's possible to hack together a solution; perhaps based around the Map Viewer data, or by joining datasets from multiple EUtils queries, but it might not be the most efficient approach.

ADD COMMENT

Login before adding your answer.

Traffic: 2439 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6