How do I convert LINCS/CMap IDs to ChEMBL/PubChem IDs?
1
1
Entering edit mode
7.0 years ago
enricoferrero ▴ 900

I have some IDs from LINCS/CMap (e.g.: BRD-A00267231) that I need to convert to either ChEMBL or PubChem IDs.

I usually use UniChem for this kind of stuff but BRD identifiers are not recognised. BRD identifiers are used in most (all?) LINCS resources such as clue.io, Harmonizome, L1000CDS2 and LIFE.

Any idea on how I can programmatically convert these IDs?

Thanks!

chembl pubchem lincs cmap unichem • 3.0k views
ADD COMMENT
1
Entering edit mode
7.0 years ago

I'm afraid the answer is "with great difficulty, if at all". People in my group tried doing that, and the only way they found was to map via canonical SMILES.

ADD COMMENT
0
Entering edit mode

And yet on LIFE I can see BRD to PubChem IDs mappings so it must be possible to do that (without having to go through SMILES):

enter image description here

ADD REPLY
0
Entering edit mode

It is quite possible that they did the mapping via SMILES too, though. That is a pretty common approach when combining multiple sources of chemical compounds.

ADD REPLY
0
Entering edit mode

Can you please point me to the method you used to map these IDs to SMILES? Thanks

ADD REPLY
0
Entering edit mode

There is sadly no code that I can share with you. What we did was to download structure files associated with each of the two sets of identifiers (e.g. .sdf files). We then used Open Babel to construct canonical SMILES for all the compounds from both resources and compared the canonical SMILES between the two databases to construct an identifier mapping.

ADD REPLY

Login before adding your answer.

Traffic: 1539 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6