Study cohort, inclusion criteria, and data collection. The data sources included in the study were categorized into comorbidity data, laboratory results, smoking history, and symptoms data. Comorbidity data encompassed information on ICD-10 codes, prescription medications, the number of visits, and quick tests performed in general practice. Laboratory results consisted of 20 different analyses. Smoking history provided detailed records of smoking habits in binary format, while symptoms data included information on common symptoms, familial predispositions, and relevant exposures to LC. These data were collected for specific periods leading up to the date of inclusion, referred to as the index date, and are depicted by the bars on the right side of the image. Created with Biorender.com.