About: BACKGROUND: All polypeptide backbones have the potential to form amyloid fibrils, which are associated with a number of degenerative disorders. However, the likelihood that amyloidosis would actually occur under physiological conditions depends largely on the amino acid composition of a protein. We explore using a naive Bayesian classifier and a weighted decision tree for predicting the amyloidogenicity of immunoglobulin sequences. RESULTS: The average accuracy based on leave-one-out (LOO) cross validation of a Bayesian classifier generated from 143 amyloidogenic sequences is 60.84%. This is consistent with the average accuracy of 61.15% for a holdout test set comprised of 103 AM and 28 non-amyloidogenic sequences. The LOO cross validation accuracy increases to 81.08% when the training set is augmented by the holdout test set. In comparison, the average classification accuracy for the holdout test set obtained using a decision tree is 78.64%. Non-amyloidogenic sequences are predicted with average LOO cross validation accuracies between 74.05% and 77.24% using the Bayesian classifier, depending on the training set size. The accuracy for the holdout test set was 89%. For the decision tree, the non-amyloidogenic prediction accuracy is 75.00%. CONCLUSIONS: This exploratory study indicates that both classification methods may be promising in providing straightforward predictions on the amyloidogenicity of a sequence. Nevertheless, the number of available sequences that satisfy the premises of this study are limited, and are consequently smaller than the ideal training set size. Increasing the size of the training set clearly increases the accuracy, and the expansion of the training set to include not only more derivatives, but more alignments, would make the method more sound. The accuracy of the classifiers may also be improved when additional factors, such as structural and physico-chemical data, are considered. The development of this type of classifier has significant applications in evaluating engineered antibodies, and may be adapted for evaluating engineered proteins in general.

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: BACKGROUND: All polypeptide backbones have the potential to form amyloid fibrils, which are associated with a number of degenerative disorders. However, the likelihood that amyloidosis would actually occur under physiological conditions depends largely on the amino acid composition of a protein. We explore using a naive Bayesian classifier and a weighted decision tree for predicting the amyloidogenicity of immunoglobulin sequences. RESULTS: The average accuracy based on leave-one-out (LOO) cross validation of a Bayesian classifier generated from 143 amyloidogenic sequences is 60.84%. This is consistent with the average accuracy of 61.15% for a holdout test set comprised of 103 AM and 28 non-amyloidogenic sequences. The LOO cross validation accuracy increases to 81.08% when the training set is augmented by the holdout test set. In comparison, the average classification accuracy for the holdout test set obtained using a decision tree is 78.64%. Non-amyloidogenic sequences are predicted with average LOO cross validation accuracies between 74.05% and 77.24% using the Bayesian classifier, depending on the training set size. The accuracy for the holdout test set was 89%. For the decision tree, the non-amyloidogenic prediction accuracy is 75.00%. CONCLUSIONS: This exploratory study indicates that both classification methods may be promising in providing straightforward predictions on the amyloidogenicity of a sequence. Nevertheless, the number of available sequences that satisfy the premises of this study are limited, and are consequently smaller than the ideal training set size. Increasing the size of the training set clearly increases the accuracy, and the expansion of the training set to include not only more derivatives, but more alignments, would make the method more sound. The accuracy of the classifiers may also be improved when additional factors, such as structural and physico-chemical data, are considered. The development of this type of classifier has significant applications in evaluating engineered antibodies, and may be adapted for evaluating engineered proteins in general. Goto Sponge NotDistinct Permalink

An Entity of Type : fabio:Abstract, within Data Space : covidontheweb.inria.fr associated with source document(s)

Attributes	Values
type	abstract
value	BACKGROUND: All polypeptide backbones have the potential to form amyloid fibrils, which are associated with a number of degenerative disorders. However, the likelihood that amyloidosis would actually occur under physiological conditions depends largely on the amino acid composition of a protein. We explore using a naive Bayesian classifier and a weighted decision tree for predicting the amyloidogenicity of immunoglobulin sequences. RESULTS: The average accuracy based on leave-one-out (LOO) cross validation of a Bayesian classifier generated from 143 amyloidogenic sequences is 60.84%. This is consistent with the average accuracy of 61.15% for a holdout test set comprised of 103 AM and 28 non-amyloidogenic sequences. The LOO cross validation accuracy increases to 81.08% when the training set is augmented by the holdout test set. In comparison, the average classification accuracy for the holdout test set obtained using a decision tree is 78.64%. Non-amyloidogenic sequences are predicted with average LOO cross validation accuracies between 74.05% and 77.24% using the Bayesian classifier, depending on the training set size. The accuracy for the holdout test set was 89%. For the decision tree, the non-amyloidogenic prediction accuracy is 75.00%. CONCLUSIONS: This exploratory study indicates that both classification methods may be promising in providing straightforward predictions on the amyloidogenicity of a sequence. Nevertheless, the number of available sequences that satisfy the premises of this study are limited, and are consequently smaller than the ideal training set size. Increasing the size of the training set clearly increases the accuracy, and the expansion of the training set to include not only more derivatives, but more alignments, would make the method more sound. The accuracy of the classifiers may also be improved when additional factors, such as structural and physico-chemical data, are considered. The development of this type of classifier has significant applications in evaluating engineered antibodies, and may be adapted for evaluating engineered proteins in general.
subject	Senescence Classification algorithms Statistical classification
part of	Using simple artificial intelligence methods for predicting amyloidogenesis in antibodies
is abstract of	Using simple artificial intelligence methods for predicting amyloidogenesis in antibodies
is hasSource of	covid:ann/target/067ef894c231d19881fc302d3eebb66d0aad760d covid:ann/target/e29af255f62aa8c3b6067ebdd1db1bd1a05962a7 covid:ann/target/606ccc51dc91ee2fb3a40e1e6ce04f0ba7e063fa covid:ann/target/42890b6b364bf58218ca45bf358a9b51fc942223 covid:ann/target/26ad93b090010720e8f033a64bf10c3c7fa3a793 covid:ann/target/c509fdb9ef45acf6e32932fde37e814999c42e6c covid:ann/target/35f3eaa9e7d19e40d9d3ffb4474ae936b283825e covid:ann/target/6059e193a68718da92b51b5ef3ad4d7c1702012f covid:ann/target/78c8dcbfd2df2bd51222d14d231bbf0e1f5841a1 covid:ann/target/9a007c6007e7f38359755a29652c334cede6b1cc covid:ann/target/1b54fc1d8f1104ad628af4f153f01e04cb91d4d2 covid:ann/target/a046fab8e10a2b7383da69cd86a665d975db121b covid:ann/target/911bc7264d5a158854513e46757ac48e1d3fe8b3 covid:ann/target/9439a2a9b6590485f502cb9a8f7a207affc5cc12 covid:ann/target/631ac166f2f694b6e54ac524a2c917246a8d709b covid:ann/target/11d21ac93282eb19dc10f5982c770c300f1f11f5 covid:ann/target/5f9dcc1b3b48d53714c048fc7b08ac48fde7774d covid:ann/target/8c9c9a5a0f77c4273244f5fb8addf1c92bfa2dc4 covid:ann/target/d6a03b73d2e339ec5b34cd68da98e051229e16fb covid:ann/target/711b5e7110e5016a545cae5e1b0ac291acff44c5 covid:ann/target/79074148e8465ca94abe081f55b87cffd2573277 covid:ann/target/b8e0e55b1ba9f8b79034417a1a287bbed1fb429f covid:ann/target/139201c1567b7d299151fbd295ec3eac343e1362 covid:ann/target/1677c8ac87f9eded30be5f7f401dc0ce7e4c2390 covid:ann/target/21774be37d4732ac03e2982a27d8a78fbdec97e9 covid:ann/target/37dee82081a605127a4f577d98e3fc1603b9e68f covid:ann/target/b6d2e23d71978831931167fac9838358b88283e2 covid:ann/target/63b28528a85781a5e9f27b168bab7a1d77eb1c58 covid:ann/target/77e6606dc509fe2c3f068d90d4e595d8ece7bbc0 covid:ann/target/c5dcb028a137d8892f09db5daf2c9a729bc23dc8 covid:ann/target/8cf0fad833ff425b66c856fb61685958bf6a6d5b covid:ann/target/6aa819473105a7cdc95263d596367b8bc809f347 covid:ann/target/218e37d6b8506899a29940d3b62c119ce7cfb59f covid:ann/target/ff9b519a0eca90baea4ecf5897ad1b3f83db26b8 covid:ann/target/4e9efadeee1cc5129b88129f57a73cb3cafe0371 covid:ann/target/5a70587b93f3d07922d3d5edaad52c5fdc58894e covid:ann/target/d7eabfec2e5c1901e3224bc6ccb99c0d4b32ce65 covid:ann/target/b05a217c8150fbd2e24985cf908a433e3a2f0384 covid:ann/target/d72fd64638136bb8e3a047944e2bd0ab66a5fc29 covid:ann/target/fb7188f919906da7ec57a28bba13240aaa1ce1f3 covid:ann/target/0f0cbc39a3e8e2e6126714b6651085493dba8669 covid:ann/target/fe6a3c9722d77f2694f7ba0c284374d282a7bc5f covid:ann/target/48174165cab38b95c5db8797bef76b80dd2ddbb7 covid:ann/target/8da56e8763ebadb8a1115350dcf2a42e3d39d7d1 covid:ann/target/b56fffcbf024237c80af3755b95e29c720c74337 covid:ann/target/0e396cc112c19c988273fdefe3e0ebf1f5dfeede covid:ann/target/4a338515b3f15ec407a0ec8bba836c321db95de9 covid:ann/target/add176f1dc7ddf15b7ae318131c45dcc37ab81b5 covid:ann/target/0bae3516f2bbd5ce1eb421983df6deb323b1574a covid:ann/target/b2d7d2e3e62af131ce533085bc78d9bb5567eea4 covid:ann/target/0bcd4f836c99ddb00ad699f0984aebc2de9b61e9 covid:ann/target/19b088277b64729080e0ab68fb2b5a98ea2f8df4 covid:ann/target/d185b737150ea7a3514e1544daa3e6757988b5e4 covid:ann/target/fff71f31a7dba10a7b0f40ff4485faefaa918a68 covid:ann/target/3318b367e023aa587ced596282030e6aa8a707ba covid:ann/target/ddbccdff786cae836a973bf1162dbf522c317f46 covid:ann/target/d4a5196c1ccc977e5f6b1fde5426fe0b50e38ed2 covid:ann/target/5a82a533d09b082ffb08393782df2003b8cf4280 covid:ann/target/b93f12ebdc798e8f8debfc3d9e53e685dcf9018b

Faceted Search & Find service v1.13.91 as of Mar 24 2020

Alternative Linked Data Documents: Sponger | ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3229 as of Jul 10 2020, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (94 GB total memory)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2025 OpenLink Software