ABSTRACT
Structural motifs refer to patterns in 3D space that bear biological significance as they can indicate regions in the protein that have important roles in biochemical functions. Finding structural motifs given a set of peptides are NP-hard. Thus, there is no known polynomial-time algorithm that solves the problem optimally except when the quality of the solution is compromised. Although computationally hard, the problem is approximable and has a known polynomial-time approximation scheme (PTAS). With an existing PTAS, the problem can have a guaranteed quality that is inversely proportional to the running time. For a small error bound, the running time of the algorithm can become impractical. In this study, we design and implement a parallel version of the PTAS for the (R-C)-compact structural motif problem. Based on the empirical results, we obtained a speedup between 4x - 5x from the sequential version of the algorithm using three different protein data sets.
- 2018. UCSF ChimeraX. https://www.cgl.ucsf.edu/chimerax/.Google Scholar
- 2022. PyMOL by SchrMath 33dinger. https://pymol.org/.Google Scholar
- 2022. Python 3.10.5 Documentation for Multiprocessing and Process-based parallelism. https://docs.python.org/3/library/multiprocessing.html.Google Scholar
- Ghada Badr, Isra Al-Turaiki, and Hassan Mathkour. 2013. Classification and assessment tools for structural motif discovery algorithms. In BMC Bioinformatics 14, S4. https://doi.org/:10.1186/1471-2105-14-S9-S4Google ScholarCross Ref
- Jym Paul A. Carandang, Jhoirene B. Clemente, John Erol M. Evangelista, and Henry N. Adorna. 2018. PepSquad: A Tool for Finding Compact Structural Motifs from Peptides. 49–58.Google Scholar
- Nurhan Cetin. 2001. Speed-Up and Efficiency. https://svn.vsp.tu-berlin.de/repos/public-svn/publications/kn-old/strc/html/node9.html.Google Scholar
- Anirudh Chakravorty, Thomas George, and Yogish Sabharwal. 2014. Multicore Parallelization of the PTAS Dynamic Program for the Bin-Packing Problem. International Conference of Distributed Computing and Networking (Jan. 2014), 81–95. https://doi.org/10.1007/978-3-642-45249-9_6 MAG ID: 72703661 S2ID: dd10f5c19812c88cc30a006b0a6a32f8c277e851.Google ScholarDigital Library
- Slobodan Jelic, Soeren Laue, Domagoj Matijevic, and Patrick Wijerama. 2015. A Fast Parallel Implementation of a PTAS for Fractional Packing and Covering Linear Programs. International Journal of Parallel Programming 43, 5 (Oct. 2015), 840–875. https://doi.org/10.1007/s10766-015-0352-y MAG ID: 2063612423 S2ID: 15ad2ebe3b613a98a7a9586ee166db205b53048e.Google ScholarDigital Library
- Anton Petrov, Craig Zirbel, and Neocles Leontis. 2012. RNA 3D Hub: an Online Resource for RNA Structural Bioinformatics. http://rna.bgsu.edu/.Google Scholar
- Jianbo Qian, Shuaicheng Li, Dongbo Bu, Ming Li, and Jinbo Xu. 2007. Finding Compact Structural Motifs. In Combinatorial Pattern Matching. 142–149. https://doi.org/10.1007/978-3-540-73437-6_16Google ScholarCross Ref
- Otto von Guericke University Magdeburg. 2018. Parallel efficiency - simple approach. https://wikis.ovgu.de/lss/doku.php?id=guide:parallel_efficiency.Google Scholar
Index Terms
- Parallel Polynomial-Time Approximation Scheme (PTAS) for Finding Compact Structural Motifs
Recommendations
Finding compact structural motifs
Protein structural motif detection has important applications in structural genomics. Compared with sequence motifs, structural motifs are more sensitive in revealing the evolutionary relationships among proteins. A variety of algorithms have been ...
Polynomial time approximation schemes and parameterized complexity
In this paper, we study the relationship between the approximability and the parameterized complexity of NP optimization problems. We introduce a notion of polynomial fixed-parameter tractability and prove that, under a very general constraint, an NP ...
Structural alphabet motif discovery and a structural motif database
This study proposes a general framework for structural motif discovery. The framework is based on a modular design in which the system components can be modified or replaced independently to increase its applicability to various studies. It is a two-...
Comments