Biostar Beta. Not for public use.
Question: estimating read length and SD for Kallisto (single-end QuantSeq reads)
0
Entering edit mode

Hi all,

Deleted and reposted this to ask an actually sensible question. Sorry to those who've had to see an iteration of this twice.

I am intending to use Kallisto and was looking for an opinion re: estimating fragment length. We used a Lexogen 3’ QuantSeq kit. Here is a sample Bioanalyzer trace of one of the library preps.

bioanalyzer trace of library prep

Using kallisto for 3' QuantSeq was discussed in this paper, using l=100 s=30 to run kallisto quant. Looking at the GEO page for the prior study the authors accessed, however, I did not see these values listed (was this an approximation for convenience?).

Should I use the smear analysis (in red) or use the data called for peak 2, the 258 bp peak? The smear analysis boundaries were set manually at our sequencing core and the peak calling was automated. I am a little more inclined to go with the automated peak calling, however, I wanted to verify. Or should I use the approximation in the publication, which gave useful results?

Thank you!

ADD COMMENTlink 14 months ago cats_dogs • 0 • updated 14 months ago michael.ante ♦ 3.3k
Entering edit mode
1

For further references you could just update the entire question - in that way everybody already responding would be notified.

ADD REPLYlink 14 months ago
kristoffer.vittingseerup
♦ 1.8k
Entering edit mode
0

ah, thanks! noted for the future! no one had replied yet though

ADD REPLYlink 14 months ago
cats_dogs
• 0
2
Entering edit mode

Hi Cats_dogs,

According to the QuantSeq FAQ page result "mean library sizes of about 335 – 456 bp" in "mean insert sizes of 203 – 324 bp". Since your average fragment length is 28 bp shorter, the fragment should have a mean of 175 bp. The Kallisto's manual is referring to average insert size, not peak insert-size.

The actual sequenced fragment length may be shorter due to sequencing bias.

I'd try once with the mentioned mean of 100 bp and once with setting it to 175 bp. If you have Spike-ins like the ERCCs you can check the measured vs the expected concentration.

The standard deviation is harder to estimate. I'd stick to the published values.

Cheers,

Michael

ADD COMMENTlink 14 months ago michael.ante ♦ 3.3k
Entering edit mode
0

Ah, rats, okay. Thank you, you saved me from a fairly large goof. Do you think it's advisable to adjust the insert size for libraries with different fragment lengths in addition to doing a run with 100 bp for all? Cheers!

ADD REPLYlink 14 months ago
cats_dogs
• 0

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.0