School of Computing and Information Sciences

High Performance Computing Framework for Tera-Scale Database Search of Mass Spectrometry Data

Muhammad Haseeb, Knight Foundation School of Computing and Information Sciences, Florida International UniversityFollow
Fahad Saeed, Knight Foundation School of Computing and Information Sciences, Biomolecular Sciences Institute, Department of Human and Molecular Genetics, Herbert Wertheim School of Medicine, Florida International University Follow

Date of this Version

12-10-2021

Document Type

Article

Abstract

Database peptide search algorithms deduce peptides from mass spectrometry data. There has been substantial effort in improving their computational efficiency to achieve larger and more complex systems biology studies. However, modern serial and high-performance computing (HPC) algorithms exhibit suboptimal performance mainly due to their ineffective parallel designs (low resource utilization) and high overhead costs. We present an HPC framework, called HiCOPS, for efficient acceleration of the database peptide search algorithms on distributed-memory supercomputers. HiCOPS provides, on average, more than tenfold improvement in speed and superior parallel performance over several existing HPC database search software. We also formulate a mathematical model for performance analysis and optimization, and report near-optimal results for several key metrics including strong-scale efficiency, hardware utilization, load-balance, inter-process communication and I/O overheads. The core parallel design, techniques and optimizations presented in HiCOPS are search-algorithm-independent and can be extended to efficiently accelerate the existing and future algorithms and software.

Comments

Pre- Print Version.

Version of record is available https://doi.org/10.1038/s43588-021-00113-z

Recommended Citation

Muhammad Haseeb, and Fahad Saeed, “High performance computing framework for tera-scale database search of mass spectrometry data”. Nature Computational Science 1, 550–561 (2021). https://doi.org/10.1038/s43588-021-00113-z

Download

COinS

DOI

10.1038/s43588-021-00113-z

Rights Statement

In Copyright. URI: http://rightsstatements.org/vocab/InC/1.0/
This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).

School of Computing and Information Sciences

High Performance Computing Framework for Tera-Scale Database Search of Mass Spectrometry Data

Date of this Version

Document Type

Abstract

Comments

Recommended Citation

DOI

Rights Statement

Search

Links

Browse

Author Corner

School of Computing and Information Sciences

High Performance Computing Framework for Tera-Scale Database Search of Mass Spectrometry Data

Authors

Date of this Version

Document Type

Abstract

Comments

Recommended Citation

Share

DOI

Rights Statement

Search

Links

Browse

Author Corner