Average insert size for paired end Illumina RNA-seq
2
1
Entering edit mode
9.7 years ago
xiefanfang ▴ 10

Does anyone know how to get the average insert size of paired-end RNA seq data (excluding linkers and adapters)? These are Illumina Hiseq2000 data with 2X100 bp run. Thank you!

next-gen sequencing RNA-Seq • 9.4k views
ADD COMMENT
5
Entering edit mode
9.7 years ago
Asaf 10k

You can't get this data from the sequencing but if prior to sequencing you (or the one that prepared the libraries) used bioanalyzer or other tool to measure the length of the DNA fragments, you can use this data to compute the length of the inserts.

After you map the reads to the genome you can get the length of each fragment from column 10 (TLEN) of the SAM file, just compute the average of the positive values (>0) in this column.

ADD COMMENT
0
Entering edit mode

For future readers: TLEN is the field number 9, not ten.

ADD REPLY
1
Entering edit mode
9.7 years ago
st.ph.n ★ 2.7k

A little more lengthy of an option: Post-trimming adaptors/barcodes you can align the reads to a reference, or your own de novo assembly using bwa or bowtie. You can then use picard tools (CollectInsertSizeMeterics), and you will get a nice histogram of the insert sizes in your library.

ADD COMMENT

Login before adding your answer.

Traffic: 2335 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6