SDAP Home Page
SDAP Overview
Search SDAP
SDAP All
SDAP Food
SDAP Tools
AllergenAI
FAO/WHO Allergenicity Test
FASTA Search in SDAP
Peptide Match
Peptide Similarity
Peptide-Protein PD Index
Aller_ML, Allergen Markup Language
List SDAP
About SDAP
General Information
Manual
FAQ
Publications
Who Are We
Advisory Board
New Allergen Submission form
Allergy Links
Our Software Tools
MPACK
FANTOM
GETAREA
InterProSurf
EpiSearch
Allergen Databases
WHO/IUIS Allergen Nomenclature database
FARRP Allergen Protein Database (University of Nebraska)
Allergen Database for Food Safety (ADFS)
COMPARE database
ALLFAM (Medical University of Vienna)
Allermatch (Wageninen University)
Allergome Database
Protein Databases
PDB
MMDB - Entrez
SWISS-PROT
NCBI - Entrez
PIR
Protein Classification
CATH
FSSP
iProClass
ProtoMap
SCOP
VAST
Bioinformatics Servers
BLAST @ NCBI
FASTA @ EMBL-EBI
Peptide Match @ PIR
ClustalW @ EMBL - EBI
|
SDAP 2.0 - Structural Database of Allergenic Proteins
AllergenAI : A deep learning model predicting allergenicity based on protein sequence
|
AllergenAI overview
Innovations in protein engineering can help redesign allergenic proteins to reduce adverse reactions in sensitive individuals. To accomplish this aim, a better knowledge of the molecular properties of allergenic proteins and the molecular features that make a protein allergenic is needed. We present a novel AI-based tool, AllergenAI, to quantify the allergenic potential of a given protein. Our approach is solely based on protein sequences, differentiating it from previous tools that use some knowledge of the allergens' physicochemical and other properties in addition to sequence homology. We used the collected data on protein sequences of allergenic proteins as archived in the three well-established databases, SDAP 2.0, COMPARE, and AlgPred 2, to train a convolutional neural network and assessed its prediction performance by cross-validation. We then used Allergen AI to find novel potential proteins of the cupin family in date palm, spinach, maize, and red clover plants with a high allergenicity score that might have an adverse allergenic effect on sensitive individuals. By analyzing the feature importance scores (FIS) of vicilins, we identified a proline-alanine-rich (P-A) motif in the top 50% of FIS regions that overlapped with known IgE epitope regions of vicilin allergens. Furthermore, using~ 1600 allergen structures in our SDAP database, we showed the potential to incorporate 3D information in a CNN model. Future, incorporating 3D information in training data should enhance the accuracy. AllergenAI is a novel foundation for identifying the critical features that distinguish allergenic proteins.
AllergenAI prediction : (Please note: AllergenAI is trained on allergenic protiens having less than 1000 amino acids)
|
SDAP Home Page
| Search SDAP
| SDAP Manual
| SDAP FAQ
| Contact
UTMB
| Search
| Directories
| UTMB Map
| News
| Employment
| Sitemap
This site published by Surendra Negi
Copyright
2001-2023 The University of Texas Medical Branch.
Please review our privacy policy
and Internet guidelines.
|