Table 4. Locations of Hi-C centromere calls and RNA-seq mapping based prediction for P. pastoris CBS7435 centromeres.
Varoquaux et al. used the chromatin conformation capture assay, Hi-C, to predict the centromere regions of P. pastoris GS115 to within a 20 kbp region. Based on RNA-seq data mapping to the reference sequence presented here we were able observe a drastic drop in signal strength in the regions below, indicating a low transcriptional status in those regions. In those regions we identified near perfect inverted repeats. The reorientation of chr1, chr3 and chr4 described above resulted in a differing value when compared to GS115. The slightly differing value between GS115 and CBS7435 on chromosome 4 arises from the shorter length of GS115 chr4.
| GS115 2009 | CBS7435 2016 predicted centromeres | ORF-free space [bp] | Inverted Repeats [bp] | |||||
|---|---|---|---|---|---|---|---|---|
| chr | Hi-C | predicted | before reorientation | after reorientation | individual repeats | total sequence spanned | Identity [%] | |
| 1 | 1408908 ± 20000 | 1400423..1409375 | 1487825..1493796 | 1401559..1407530 | 8,955 | 1991 | 5354 | 99 |
| 2 | 1556231 ± 20000 | 1542915..1551466 | 1545323..1551977 | 844482..851136 | 10,413 | 2699 | 6655 | 99 |
| 3 | 2226823 ± 20000 | 2202870..2211602 | 2222793..2228973 | 34486..40666 | 8,734 | 2649 | 6183 | 99 |
| 4 | 1719280 ± 20000 | 1701016..1712046 | 1762920..1769148 | 58794..65022 | 9976 | 2559 | 6229 | 99 |