I have a data frame that I'm working within which I'd like to compare the contents inside two columns; PathwayName and ExpressionData. This comparison will be done across many rows (10,695,840 entries) using R language.
Here are the first few lines of my data frame where the contents inside are only separated by whitespace.
PathwayName ExpressionData 1 41bbPathway BLACK 215538_at 210671_x_at... 215538_at na 28.566616... 2 ace2Pathway BLACK 214533_at 215184_at... 215538_at na 28.566616... 3 acetPathway BLACK 215184_at 01502_s_at... 215184_at na 4.2084746... 4 achPathway BLACK 211570_s_at 215184_at... 215184_at na 4.2084746... 5 hoPathway BLACK 201968_at 214578_s_at... 201968_at na 472.4969...
As a final product, I want it to compare, copy and save into a new file where the output should be like this:
PathwayName ExpressionData 1 41bbPathway 215538_at 215538_at 2 acetPathway 215184_at 215184_at 3 achPathway 215184_at 215184_at 4 hoPathway 201968_at 201968_at
Everything that I'd done were failed because most of them compare by rows and not the contents inside.
hope there are people who can help.