CDF it all: Consensus prediction of intrinsically disordered proteins based on various cumulative distribution functions
Abstract
Many biologically active proteins are intrinsically disordered. A reasonable understanding of the disorder status of these proteins may be beneficial for better understanding of their structures and functions. The disorder contents of disordered proteins vary dramatically, with two extremes being fully ordered and fully disordered proteins. Often, it is necessary to perform a binary classification and classify a whole protein as ordered or disordered. Here, an improved error estimation technique was applied to develop the cumulative distribution function (CDF) algorithms for several established disorder predictors. A consensus binary predictor, based on the artificial neural networks, NN-CDF, was developed by using output of the individual CDFs. The consensus method outperforms the individual predictors by 4–5% in the averaged accuracy.
Abbreviations: IDP, intrinsically disordered protein, IDR, intrinsically disordered region, CDF, cumulative distribution function, PONDR, predictor of natural disordered regions, PDD, partially disordered dataset
Keywords: Intrinsically disordered protein, Prediction, Accuracy, CDF
To access this article, please choose from the options below
PII: S0014-5793(09)00260-9
doi:10.1016/j.febslet.2009.03.070
© 2009 Federation of European Biochemical Societies
