Skip to main content

Advertisement

Log in

FFcPsA: a fast finite conventional state using prefix pattern gene search algorithm for large sequence identification

  • Focus
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Gnomic information continues to flood, and this trend comes in the wake of the life sciences’ rapid development. The eventuality has been an increase in the demand for more scalable and faster searching techniques, with the demand also proving urgent. Whereas a faster algorithm could be used to search biomedical data, the process of making gene prediction remains challenging. Particularly, the searching of biomedical data has been affirmed to be a simple gradient base approach. Therefore, indexing has been investigated with the aim of achieving a fast finite conventional rate. With biomedical expressed datasheet at hand, data-based large sequence identification has been achieved via the prefix pattern gene search algorithm. Imperative to note is that real-value expression matrices can replace microarray experimental gene expression data. To ensure that the genomic dataset’s querying exhibits reductions in the overall retrieval time and that the time used for pattern array building is sped up, parallel partitioned methods have gained application. Notably, the central merit accruing from the latter method is that the majority of unrelated sequences are skipped. Also, these methods ensure that the real search problems are only decomposed to establish original database fractions. To ensure that the establishment of the gene’s hidden information and similar characteristics is enhanced, large genetic data patterns are required.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  • Abualigah LM, Khader AT, AI-Betar MA (2016) Unsupervised feature selection technique based on harmony search. In: 2016 7th international conference on computer science and information technology (CSIT), IEEE

  • Agrawal A, Khaitan SK (2008) A new heuristic for multiple sequence alignment. In: IEEE international conference on electro/information technology, pp 215–217

  • Archuleta J, Tilevich E, Feng W (2007) A maintainable software architecture for fast and modular bioinformatics sequence search. In: IEEE international conference on software maintenance

  • Boyer RS, Srother Moore J (1977) A fast string searching algorithm. Commun Assoc Comput Mach 20(10):762–772

    MATH  Google Scholar 

  • Ceri S, Braga D, Corcoglioniti F, Grossniklaus M, Vadacca S (2010) Search computing challenges and directions. Springer, Berlin

    Book  Google Scholar 

  • Chang YF, Chen CY, Chen HW, Lin IH (2005) Bioinformatics analysis for genome design and synthetic biology. In: Proceedings of emerging information technology conference

  • Chao-Xue W et al (2015) An improved gene expression programming algorithm based on hybrid strategy. In: 2015 8th international conference on biomedical engineering and informatics (BMEI), IEEE

  • Chimmanga K, Kalezhi J, Mumba P (2016) Application of best first search algorithm to demand control. In: 2016 IEEE PES power Africa conference, IEEE, pp 51–55

  • Fuyao Z, Qingwei L (2009) A string matching algorithm based on efficient hash function. In: International conference on information engineering and computer science, pp 1–4

  • Gupta V, Singh M, Vinod KB (2014) Pattern matching algorithms for intrusion detection and prevention system: a comparative analysis. In: International conference on advances in computing, communications and informatics, pp 50–54

  • Masseroli M, Picozzi M, Ghisalberti G, Ceri S (2014) Explorative search of distributed bio-data to answer complex biomedical questions. BMC Bioinform 15(Suppl 1):S3

    Article  Google Scholar 

  • Ooi BC, Pang HH, Wang H, Wong L, Yu C (2002) Fast filter-and-refine algorithms for subsequence selection. In: Proceedings of IDEAS, pp 243–255, Research 2001

  • Ozturk O, Ferhatosmanoglu H (2003) Effective indexing and filtering for similarity search in large biosequence databases. In: Proceedings of IEEE symposium on bioinformatics and bioengineering, March 2003, pp 359–366

  • Paira S, Chandra S, Safikul Alam Sk, Patra SS (2014) Bi linear search a new session of searching. IJARCSSE 4(3):459–463

    Google Scholar 

  • Peddapati S, Phanisri Kruthiventi KK (2016) A new random search algorithm: multiple solution vector approach. In: 2016 6th international advanced computing conference, IEEE, pp 187–190

  • Qian G, Zhu Q, Xue Q, Pramanik S (2003) The tree: a dynamic indexing technique for multidimensional non-ordered discrete data spaces. In: Proceedings of VLDB, Germany, Sept 2003

  • Safavi AA, Kelarestaghi M, Eshghi F (2017) Gene expression programming with a local search operator. In: Artificial intelligence and signal processing conference (AISP)

  • Sahinalp SC, Tasan M, Macker J, Ozsoyoglu Z (2003) Distance based indexing for string proximity search. In: Proceedings of ICDE

  • Thorsen O, Jiang K, Peters A, Smith B, Lin H, Feng W, Sosa C (2007) Parallel genomic sequence search on a massively parallel system. In: ACM international conference on computing frontiers, May 2007

  • Wahlström S (2013) Evaluation of string searching algorithms, pp 1–6

  • Xiao S, Lin H, Feng W (2013) Accelerating protein sequence search in a heterogeneous computing system. In: 2011 IEEE international parallel distributed processing symposium, May 2011

  • Xu B, Zhou X, Li J (2006) Recursive shift indexing: a fast multi-pattern string matching algorithm. In: Proceedings of the 4th international conference on applied cryptography and network security (ACNS)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to A. Surendar.

Ethics declarations

Ethical approval

In situations, where human participants were involved, ethical guidelines stated by the national research committee were followed. Also, the study operated in line with the 1964 Helsinki Declaration regarding ethical guidelines governing the research process.

Informed consent

Imperatively, all participants were requested to provide permission or informed consent before participating in the study. The informed consent was also secured after clarifying the main aim and specific objectives of the study.

Conflict of interest

The first author declares that there is no conflict of interest in this study, and the second author also declares that there is no conflict of interest in the scholarly investigation described above.

Additional information

Communicated by P. Pandian.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Surendar, A., Arun, M. & Mahabub Basha, A. FFcPsA: a fast finite conventional state using prefix pattern gene search algorithm for large sequence identification. Soft Comput 23, 2761–2771 (2019). https://doi.org/10.1007/s00500-018-03733-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-018-03733-2

Keywords

Navigation