IntroductionDiffering ancestries of human subpopulations create systematic differences in genetic allele frequencies across the genome, a phenomenon known as population stratification or substructure. If a phenotypic trait such as disease is associated with subpopulation membership, a genetic association study can identify spurious relationships with genetic markers. Singular value decomposition (SVD) of genotype data or eigen decomposition of covariance matrices can be used to identify population stratification. The eigenvectors (essentially principal component scores) that correspond to large eigenvalues can be used as covariates in association analysis (Levine et al., ). The combined analysis of unrelated and related individuals is a common feature of genetic association studies (Zhu et al., ). However, the presence of close‐degree relatives in a genetic dataset presents difficulties, as the family structure can greatly influence the eigenvalues and eigenvectors.Cystic fibrosis (CF) is a recessive genetic lung disorder, caused by a mutation in the single gene CFTR. However, considerable genetic variation remains in the severity of disease, and evidence indicates this variation is complex and influenced by numerous genes (Wright et al., ). Genotypes gathered by the North American CF Consortium are typical of a large‐scale genomewide association study (GWAS), with thousands of individuals and over 1
Biometrics – Wiley
Published: Jan 1, 2018
Keywords: ; ;
It’s your single place to instantly
discover and read the research
that matters to you.
Enjoy affordable access to
over 18 million articles from more than
15,000 peer-reviewed journals.
All for just $49/month
Query the DeepDyve database, plus search all of PubMed and Google Scholar seamlessly
Save any article or search result from DeepDyve, PubMed, and Google Scholar... all in one place.
All the latest content is available, no embargo periods.
“Whoa! It’s like Spotify but for academic articles.”@Phil_Robichaud