I have some methylation data and need to find the corresponding Illumina IDs. The information I have available is the chromosome and the start/end/strand information.
I need something like "cg123456789", and what I have is something like:
CpG_ID chr start.x end.x strand.x
chr1_246776210 chr1 246776210 246776210 +
These data were generated using the SeqCapEpi platform, not Illumina 450k. I want to use some scripts that rely on the illumina IDs, and think there must be a way to map this information to corresponding illumina ID.
I have looked at the Illumina 450k database R pacakge, but it seems a little dated and I can't find in the documentation a function that works like this, it seems to always assume you would have the cg IDs, because you would if you were using the 450k platform I'm guessing.
I expect that illumina has a database of positional data somewhere, but I don't know if it would be exactly the same as far as start position, maybe the same cg ID is 100 bp up or downstream for instance.
Anyhow, if anyone has any insight into how this might be approached, I would appreciate any advice! Thanks very much.