Accelerating Binary String Comparisons with a Scalable, Streaming-Based System Architecture Based on FPGAs

Pilz, Sarah; Porrmann, Florian; Kaiser, Martin; Hagemeyer, Jens; Hogan, James M.; Rückert, Ulrich

Titelaufnahme

Titel
Accelerating Binary String Comparisons with a Scalable, Streaming-Based System Architecture Based on FPGAs
Verfasser
Pilz, Sarah ; Porrmann, Florian ; Kaiser, Martin ; Hagemeyer, Jens ; Hogan, James M. ; Rückert, Ulrich
Enthalten in
Algorithms, Jg. 13 H. 2
Erschienen
2020
Sprache
Englisch
Dokumenttyp
Aufsatz in einer Zeitschrift
Schlagwörter
FPGA / bioinformatics / multiple string matching / parallel computing / Hamming distance / locality sensitive hashing / hardware acceleration / parallel system architecture
URN
urn:nbn:de:0070-pub-29416462
DOI
10.3390/a13020047

Zugriffsbeschränkung

Das Dokument ist frei verfügbar

Links

Social Media

Share
Nachweis
Kein Nachweis verfügbar
IIIF
IIIF-Manifest

Dateien

Accelerating Binary String Comparisons with a Scalable, Streaming-Based System Architecture Based on FPGAs [pdf 1.61 mb] RIS

Klassifikation

Klassifikation (DDC) → Technik, Medizin, angewandte Wissenschaften → Technik → Technik, Technologie

Abstract

This paper is concerned with Field Programmable Gate Arrays (FPGA)-based systems for energy-efficient high-throughput string comparison. Modern applications which involve comparisons across large data sets—such as large sequence sets in molecular biology—are by their nature computationally intensive. In this work, we present a scalable FPGA-based system architecture to accelerate the comparison of binary strings. The current architecture supports arbitrary lengths in the range 16 to 2048-bit, covering a wide range of possible applications. In our example application, we consider DNA sequences embedded in a binary vector space through Locality Sensitive Hashing (LSH) one of several possible encodings that enable us to avoid more costly character-based operations. Here the resulting encoding is a 512-bit binary signature with comparisons based on the Hamming distance. In this approach, most of the load arises from the calculation of the O ( m ∗ n ) Hamming distances between the signatures, where m is the number of queries and n is the number of signatures contained in the database. Signature generation only needs to be performed once, and we do not consider it further, focusing instead on accelerating the signature comparisons. The proposed FPGA-based architecture is optimized for high-throughput using hundreds of computing elements, arranged in a systolic array. These core computing elements can be adapted to support other string comparison algorithms with little effort, while the other infrastructure stays the same. On a Xilinx Virtex UltraScale+ FPGA (XCVU9P-2), a peak throughput of 75.4 billion comparisons per second—of 512-bit signatures—was achieved, using a design with 384 parallel processing elements and a clock frequency of 200 MHz. This makes our FPGA design 86 times faster than a highly optimized CPU implementation. Compared to a GPU design, executed on an NVIDIA GTX1060, it performs nearly five times faster.

Inhalt

Inhalt des Werkes

Statistik

Das PDF-Dokument wurde 6 mal heruntergeladen.

Lizenz-/Rechtehinweis

Creative Commons Namensnennung 4.0 International Lizenz

Detailsuche

Bibliotheken

Projekt

Impressum

Datenschutz

Titelaufnahme