Data collection can be done through various means, including clinical trials, patient surveys, medical records, and genomic databases. Each source has its own set of challenges and benefits. For instance, clinical trials provide highly controlled datasets but may not represent the broader population. Medical records offer real-world data but often come with issues of inconsistency and incompleteness.