Forum:Bioinformatics terms that might be confusing for beginners
4
0
Entering edit mode
5.6 years ago
mcclintock ▴ 10

Everyone involved in bioinformatics should first understand the concept of the base quality, which is Phred score. But for beginners, why the base quality was named that might still be confusing. Does anyone have other examples like this? Thanks.

For instance, Bowtie, TopHatCufflinksStringtie ,Ballgown etc. , all these software are about clothes. Why?

glossary jargon • 2.5k views
ADD COMMENT
2
Entering edit mode

Hello zhangjk21 ,

why do you find "base quality" confusing? It is one of the few terms that are quite well defined.

It become more complicated with:

  • coverage
  • read depth
  • read/fragment/insert size
  • duplicates
  • 1-based vs 0-based position
  • the large variety of file types
  • ...

fin swimmer

ADD REPLY
0
Entering edit mode

What really confused me is not the true meaning or professional definition of the base quality. I know it is well defined. And I'm just interested in the story why we use this word Phred , which I can not figure out according to the definition because it is not abbreviation of other terms. In other words, I'm more interested in the history behind this word Phred.

ADD REPLY
1
Entering edit mode

Bowtie, TopHat,Cufflinks ,Stringtie ,Ballgown etc. , all these softwares are about clothes. Why?

this is simply a gimmick of the developers (they have to name their software anyway, so why not this?) . They all belong to the same suite (no pun intended) of software. I believe it is actually one of the better examples of software naming

ADD REPLY
1
Entering edit mode

It's called the Tuxedo suite, IIRC

ADD REPLY
1
Entering edit mode

Just to go back one step: the very term, Bioinformatics, is confusing. As different people have different ideas about what is bioinformatics, actually, these people then mis-interpret the skills / abilities of others who call themselves bioinformaticians. Why? - it is because they expect that these other people have all of the skills that they believe a bioinformatician should have, i.e., based on their own ideas about what is bioinfortmatics.

Bioinformatics is very broad, and there are multiple areas in which each can specialise. Then again, you have bioinformaticians who have broad / general skills but who are not true experts in any one area.

ADD REPLY
0
Entering edit mode

The difference between insert size and fragment size was confusing for me initially.

ADD REPLY
0
Entering edit mode

Just to be sure :

  • Insert size = The size of the DNA after mechanical or enzymatic shearing of DNA
  • Fragment size = The size of DNA sheared + Poly-A + adaptators

Or is it the other way around ?

ADD REPLY
0
Entering edit mode

To follow through :

PE reads      R1--------->                                                 <---------R2
Adaptaors       ~~~                                                               ~~~
fragment        ~~~===============================================================~~~
insert                 ===========================================================
inner mate                  .................................................
ADD REPLY
1
Entering edit mode

at least the other way around:

  • Fragment size = The size of the DNA after mechanical or enzymatic shearing of DNA
  • insert size = the bit of DNA between the two adapters

see here for a nice blog post on this topic

ADD REPLY
0
Entering edit mode

I added markup to your post for increased readability. You can do this by selecting the text and clicking the 101010 button. When you compose or edit a post that button is in your toolbar, see image below:

101010 Button

ADD REPLY
0
Entering edit mode

It is explained clearly in this post here

ADD REPLY
0
Entering edit mode

In practice, a fragment size can mean the length with or without the adapters, it depends on the context of which fragments are being talked about.

ADD REPLY
0
Entering edit mode

It’s worth bearing in mind that some terms (software names especially) are often named withou any actual relevance to what they do. It might be an in-joke by the group, or a reference that they like etc.

ADD REPLY
1
Entering edit mode

like northern and western blotting in experiments.

ADD REPLY
1
Entering edit mode

Actually those sort of ‘make sense’, because they followed on from Southern blotting which was invented by Edward Southern. It seemed semi logical to give them names of other cardinal points.

ADD REPLY
1
Entering edit mode

Or Just Another Bogus Bioinformatics Algorithm!

http://www.acgt.me/blog?tag=jabba

ADD REPLY
2
Entering edit mode
5.6 years ago
H.Hasani ▴ 990

Well, Wikipedia has a clear explanation for the Phred score!

ADD COMMENT
0
Entering edit mode

Tells you what it is, but doesn't actually explain where the name originated from...

ADD REPLY
1
Entering edit mode

Phred stands for Phil's Read Editor. It is software (Phred, Phrap, and Consed) written by Phil Green and team. https://www.ncbi.nlm.nih.gov/pubmed/9521921?dopt=Abstract

What's in a name? I always like to think that it was a reference to Fred Sanger, but I am not sure.

ADD REPLY
2
Entering edit mode
5.6 years ago

I think if there is one example that seems to be confusing (not only to beginners) is the similarity <-> homology one. I see this mistake appearing even in manuscripts of very "experienced" people.

ADD COMMENT
2
Entering edit mode
5.6 years ago

Mapping <=> Alignment

Alignment and mapping

ADD COMMENT
1
Entering edit mode
5.6 years ago

For instance, Bowtie, TopHat,Cufflinks ,Stringtie ,Ballgown etc. , all these softwares are about clothes. Why?

Why not? :)

ADD COMMENT
0
Entering edit mode

just to elucidate my question, LOL

ADD REPLY
0
Entering edit mode

Well yeah, naming a tool doesn't have to make sense. You just have to remember if you google for Cufflinks that you should add "RNA". Same goes for the STRING database, though.

ADD REPLY

Login before adding your answer.

Traffic: 2223 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6