Selecting genetic variants (e.g., single nucleotide polymorphisms or SNPs) that are robustly associated with the risk factor of interest. Ensuring these genetic variants are not associated with confounders that could bias the relationship between the risk factor and cancer. Using statistical methods to estimate the causal effect of the risk factor on cancer, leveraging the genetic variants as instrumental variables.