Loading [a11y]/accessibility-menu.js
A nearest neighbor method for predicting solenoid proteins | IEEE Conference Publication | IEEE Xplore

A nearest neighbor method for predicting solenoid proteins


Abstract:

Solenoid proteins are proteins with repeats of 5 to 40 residues in length. Identifying solenoid proteins presents a big challenge because the repeat sequences are highly ...Show More

Abstract:

Solenoid proteins are proteins with repeats of 5 to 40 residues in length. Identifying solenoid proteins presents a big challenge because the repeat sequences are highly degenerated. Here, we present a nearest neighbor (NN) method for predicting solenoid proteins based on residue composition. The distance between proteins is calculated as a weighted Euclidean distance defined by the residue composition vector. The NN method predicts solenoid proteins with an overall accuracy of 95.5% with 94.3% sensitivity and 96% specificity, outperforming other methods in direct comparisons. We also demonstrate that combining the NN method with HHrepID and Trust, which are previously published methods for addressing the same problem, can dramatically reduce the false positive rates in predicting repeats.
Date of Conference: 11-13 August 2012
Date Added to IEEE Xplore: 25 February 2013
ISBN Information:
Conference Location: Hangzhou, China

Contact IEEE to Subscribe

References

References is not available for this document.