Skip to main content
. 2021 Dec 10;11:23823. doi: 10.1038/s41598-021-03204-z

Table 4.

Set of NLP regular expressions embedded into the header_function.py for the internal reports.

REDCap data label BIOPSY DATE ID NUMBER SURNAME NAME DATE OF BIRTH PLACE OF BIRTH SEX SSN SPECIMEN TYPE
REDCap data variable nod_date_exam_req nod_exam_num_req pts_surname_demo pts_name_demo dob_demo city_born_demo sex_demo ssn_demo ln_specimen_dis
REPORT TEMPLATE for internal reports Internal “Accettazione” or ”Pervenuto” or “Richiesta” del” or “Ricevimento” "N. Esame" "Cognome" "Nome" "Data di nascita" "Comune di Nascita" "Sesso" "Codice Fiscale" "Materiale Inviato"
NLP pattern cettaz. +|ervenuto. +|ichiesta.*del. + [0–3][0–9]/[0–1][0–9]/2[0–9][0–9][0–9]  + same. *[0–3][0–9-.-d] COGNOME.*|COGNOME.*DATA|COGNOME.*CITT \\bNOME.*|\\bNOME.*DATA|\\bNOME.*CITT . + asci. + [0–3][0–9]/[0–1][0–9]/[1, 2][0–9][0–9][0–9] . + omu. + asci. + \w +  . + ess.{1,3}m [A-Z]{6}[0–9][0–9][A-Z][0–9]{2}[A-Z][0–9]{3}[A-Z] ate. + al. + via. + \n. + 

NLP: Natural Language Processing; ID: Identification; NA, Not Available, SSN, Social Security Number.