Table 1. Feature names and data type acquired in the risk factors dataset (Fernandes, Cardoso & Fernandes, 2017b).
| Feature | Type | Feature | Type |
|---|---|---|---|
| Age | int | IUD (years) | int |
| Number of sexual partners | bool × int | Sexually transmitted diseases (STDs) (yes/no) | bool × bool |
| Age of first sexual intercourse | bool × int | Number of STDs | int |
| Number of pregnancies | bool × int | Diagnosed STDs | Categorical |
| Smokes (yes/no) | bool × bool | STDs (years since first diagnosis) | int |
| Smokes (years and packs) | int × int | STDs (years last diagnosis) | int |
| Hormonal contraceptives (yes/no) | bool | Previous cervical diagnosis (yes/no) | bool |
| Hormonal contraceptives (years) | int | Previous cervical diagnosis (years) | int |
| Intrauterine device (IUD) (yes/no) | bool | Previous cervical diagnosis | Categorical |
Note:
int, integer; bool, boolean.