Unraveling overlapping deletions by agglomerative clustering

Wittler, Roland

Titelaufnahme

Titel
Unraveling overlapping deletions by agglomerative clustering
Verfasser
Wittler, Roland
Enthalten in
BMC Genomics, Jg. 14 H. Suppl 1
Erschienen
2013
Sprache
Englisch
Dokumenttyp
Aufsatz in einer Zeitschrift
URN
urn:nbn:de:0070-pub-25520927
DOI
10.1186/1471-2164-14-S1-S12

Zugriffsbeschränkung

Das Dokument ist frei verfügbar

Links

Social Media

Share
Nachweis
Kein Nachweis verfügbar
IIIF
IIIF-Manifest

Dateien

Unraveling overlapping deletions by agglomerative clustering [pdf 1.85 mb]
correctedArticle.pdf [bin 0 bytes]
RIS

Klassifikation

Klassifikation (DDC) → Naturwissenschaften und Mathematik → Biowissenschaften; Biologie → Biowissenschaften; Biologie

Abstract

Background

Structural variations in human genomes, such as deletions, play an important role in cancer development. Next-Generation Sequencing technologies have been central in providing ways to detect such variations. Methods like paired-end mapping allow to simultaneously analyze data from several samples in order to, e.g., distinguish tumor from patient specific variations. However, it has been shown that, especially in this setting, there is a need to explicitly take overlapping deletions into consideration. Existing tools have only minor capabilities to call overlapping deletions, unable to unravel complex signals to obtain consistent predictions.
Result

We present a first approach specifically designed to cluster short-read paired-end data into possibly overlapping deletion predictions. The method does not make any assumptions on the composition of the data, such as the number of samples, heterogeneity, polyploidy, etc. Taking paired ends mapped to a reference genome as input, it iteratively merges mappings to clusters based on a similarity score that takes both the putative location and size of a deletion into account.
Conclusion

We demonstrate that agglomerative clustering is suitable to predict deletions. Analyzing real data from three samples of a cancer patient, we found putatively overlapping deletions and observed that, as a side-effect, erroneous mappings are mostly identified as singleton clusters. An evaluation on simulated data shows, compared to other methods which can output overlapping clusters, high accuracy in separating overlapping from single deletions.

Inhalt

Inhalt des Werkes

Statistik

Das PDF-Dokument wurde 4 mal heruntergeladen.

Detailsuche

Bibliotheken

Projekt

Impressum

Datenschutz

Titelaufnahme