Bioinformatics Training and Education Program

BTEP Question Forum

BTEP maintains several Question and Answer Forums of interest to the NCI/CCR community.
Currently, there are forums on these topics listed below:

If you wish to ask a question go to the Ask Question Page and submit your question.

 Back to Questions

Pipelines and QC: What is "effective genome size"?

What is "effective genome size"?


1 Answer:

Effective genome size denotes the portion of the genome that is “mappable” by a sequencing read. There are two common alternative ways to calculate this: 1. The number of non-N bases in the genome. 2. The number of regions (of some size) in the genome that are uniquely mappable (possibly given some maximal edit distance). Mappability is also affected by sequencing read-length, but typically we use ~2.9Gb and ~2.6Gb as effective genome sizes for human and mouse, respectively. - answered by Tovah Markowitz, Paul Schaughency, Vishal Koparde.

Answered on June 5th, 2020 by