Skip to main content
. Author manuscript; available in PMC: 2015 Apr 24.
Published in final edited form as: Curr Protoc Hum Genet. 2014 Apr 24;81:7.23.1–7.23.21. doi: 10.1002/0471142905.hg0723s81

Figure 9. Distribution of post-normalization target variance.

Figure 9

Before XHMM calculates z-scores and the HMM is run to call CNV for each sample (CNV “discovery”), we perform a final filtering step. Specifically, we remove any targets that have “very scattered” read depth distributions post normalization. These can be thought of as targets for which the normalization may have failed, and it is better to remove such strong effects (still likely to be artifacts) to prevent them from drowning out other more subtle signals. In detail, we removed any targets with large standard deviations of their post-normalization read depths across all samples. As a (proto-typical) example, we see here that the small fraction of targets with standard deviations any larger than the 30 to 50 range were removed (in this case, for a scenario of ~100× mean sequencing coverage).