Summarizing Genome-wide Phased Genotypes using Phased PC Plots

Sergio Torres-Sánchez, Nuria Medina-Medina, María M. Abad-Grau

2014

Abstract

Ordination in reduced space such as principal component (PC) analysis and their visual representation in PC plots may help to uncover important patterns among samples in highly dimensional data sets. When used with data sets obtained from genome-wide genotyping, they may show biologically relevant relationships among populations, such as population structure and admixture. Extending the PC analysis to genome-wide phased genotypes may help to reveal different levels of inbreeding between or within populations as well as to evaluate the quality of the haplotyping technique used. We have developed a method to perform PC analysis to a data set of genome-wide phased genotypes and to plot results keeping information about individuals. The method has been implemented in the computer program PCPhaser. To increase the method applicability and reduce development time, PCPhaser implements the method through the transformation of the input data set by segregating haplotypes and using software EIGENSOFT to perform PC analysis. Given this transformation, the proposed method can be applied through any other software able to perform PCA, although PCPhaser will be still required to draw the phased PC plots. PCPhaser is a linux-based software that can be downloaded from http://bios.ugr.es/PCPhaser.

Download


Paper Citation


in Harvard Style

Torres-Sánchez S., Medina-Medina N. and Abad-Grau M. (2014). Summarizing Genome-wide Phased Genotypes using Phased PC Plots . In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2014) ISBN 978-989-758-012-3, pages 130-135. DOI: 10.5220/0004793501300135

in Bibtex Style

@conference{bioinformatics14,
author={Sergio Torres-Sánchez and Nuria Medina-Medina and María M. Abad-Grau},
title={Summarizing Genome-wide Phased Genotypes using Phased PC Plots},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2014)},
year={2014},
pages={130-135},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004793501300135},
isbn={978-989-758-012-3},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2014)
TI - Summarizing Genome-wide Phased Genotypes using Phased PC Plots
SN - 978-989-758-012-3
AU - Torres-Sánchez S.
AU - Medina-Medina N.
AU - Abad-Grau M.
PY - 2014
SP - 130
EP - 135
DO - 10.5220/0004793501300135