The University of Texas Medical Branch
Department of Biochemistry and Molecular Biology Sealy Center for Structural Biology Computational Biology


SDAP Home Page
SDAP Overview

Search SDAP
SDAP All
SDAP Food

SDAP Tools
AllergenAI
FAO/WHO Allergenicity Test
FASTA Search in SDAP
Peptide Match
Peptide Similarity
Peptide-Protein PD Index
Aller_ML, Allergen Markup Language
List SDAP

About SDAP
General Information
Manual
FAQ
Publications
Who Are We
Advisory Board
New Allergen Submission form

Allergy Links

Our Software Tools
MPACK
FANTOM
GETAREA
InterProSurf
EpiSearch

Allergen Databases
WHO/IUIS Allergen Nomenclature database
FARRP Allergen Protein Database (University of Nebraska)
Allergen Database for Food Safety (ADFS)
COMPARE database
ALLFAM (Medical University of Vienna)
Allermatch (Wageninen University)
Allergome Database

Protein Databases
PDB
MMDB - Entrez
SWISS-PROT
NCBI - Entrez
PIR

Protein Classification
CATH
FSSP
iProClass
ProtoMap
SCOP
VAST

Bioinformatics Servers
BLAST @ NCBI
FASTA @ EMBL-EBI
Peptide Match @ PIR
ClustalW @ EMBL - EBI


                SDAP 2.0 - Structural Database of Allergenic Proteins
Go to: SDAP All allergens       Go to: SDAP Food allergens
Send a comment to Werner Braun      Submit new allergen information to SDAP
  
Alphabetical listing of allergens: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Access to SDAP is available free of charge for Academic and non-profit use.< Licenses for commercial use can be obtained by contacting W. Braun (webraun@utmb.edu). Secure access to SDAP is available from https://fermi.utmb.edu/SDAP


Input Sequence

MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNST
GKEICPLTVVQSPNKHNKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWA
IVEREGLQAVTLAARDTVDGWFNIERVSREYNDYYKLVFCPQEAEDNKCEDIGIQIDNDG
IRRLVLSKNKPLVVEFQKFRSSTA
Sequence Info Allergen Name Length Opt Bits Score E Value
AAB23483 Gly m TI 204 1365 318.0 1.3e-88
AAB23482 Gly m TI 203 1232 287.7 1.7e-79
CAA45777 Gly m TI 217 795 188.0 1.8e-49
AAB23464 Gly m TI 216 778 184.2 2.6e-48
CAA45778 Gly m TI 217 774 183.3 4.9e-48
P01071 Gly m TI 181 642 153.3 4.4e-39
CAA56343 Gly m TI 208 527 127.0 4.1e-31
O24383 Sola t 3.0101 186 150 41.1 2.6e-05
P30941 Sola t 4 221 124 35.1 0.002
CAA45723 Sola t 4 217 119 34.0 0.0043
P20347 Sola t 3.0102 222 118 33.7 0.0052
Please note: Alignment made with FASTA version 36.3.8. As explained in the FASTA manual, the bit score is equivalent to the bit score reported by BLAST. A 1 bit increase in score corresponds to a 2-fold reduction in expectation, and a 10-bit increase implies 1000-fold lower expectation. Sequences with E values < 0.01 are almost always homologous. All FASTA search sequence alignment are printed in Blast format where Query is input sequence, and Sbjct is sequence found in the database.

Sequence Alignment: 1. Allergen Name: Gly m TI Sequence ID: AAB23483

 Score = 318.0 bits 1365,  Expect = 1e-88
 Identities = 204/204 100%, Positives = 204/204 100%, Gaps = 0/204 0%
Query  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNST 60
            MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNST
Sbjct  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNST 60
Query  61   GKEICPLTVVQSPNKHNKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWA 120
            GKEICPLTVVQSPNKHNKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWA
Sbjct  61   GKEICPLTVVQSPNKHNKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWA 120
Query  121  IVEREGLQAVTLAARDTVDGWFNIERVSREYNDYYKLVFCPQEAEDNKCEDIGIQIDNDG 180
            IVEREGLQAVTLAARDTVDGWFNIERVSREYNDYYKLVFCPQEAEDNKCEDIGIQIDNDG
Sbjct  121  IVEREGLQAVTLAARDTVDGWFNIERVSREYNDYYKLVFCPQEAEDNKCEDIGIQIDNDG 180
Query  181  IRRLVLSKNKPLVVEFQKFRSSTA 204
            IRRLVLSKNKPLVVEFQKFRSSTA
Sbjct  181  IRRLVLSKNKPLVVEFQKFRSSTA 204

Sequence Alignment: 2. Allergen Name: Gly m TI Sequence ID: AAB23482

 Score = 287.7 bits 1232,  Expect = 2e-79
 Identities = 187/203 92%, Positives = 194/203 95%, Gaps = 1/203 0%
Query  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNST 60
            MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGK GGIE +ST
Sbjct  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDST 60
Query  61   GKEICPLTVVQSPNKHNKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWA 120
            GKEICPLTVVQSPN+ +KGIGLVF SPLHALFIAERYPLSIKF SFAVI LC  MPT+WA
Sbjct  61   GKEICPLTVVQSPNELDKGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWA 120
Query  121  IVEREGLQAVTLAARDTVDGWFNIERVSREYNDYYKLVFCPQEAEDNKCEDIGIQIDNDG 180
            IVEREGLQAV LAARDTVDGWFNIERVSREYNDY KLVFCPQ+AEDNKCEDIGIQID+DG
Sbjct  121  IVEREGLQAVKLAARDTVDGWFNIERVSREYNDY-KLVFCPQQAEDNKCEDIGIQIDDDG 179
Query  181  IRRLVLSKNKPLVVEFQKFRSSTA 204
            IRRLVLSKNKPLVV+FQKFRSSTA
Sbjct  180  IRRLVLSKNKPLVVQFQKFRSSTA 203

Sequence Alignment: 3. Allergen Name: Gly m TI Sequence ID: CAA45777

 Score = 188.0 bits 795,  Expect = 2e-49
 Identities = 127/197 64%, Positives = 153/197 77%, Gaps = 6/197 3%
Query  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNST 60
            MKSTIFFALFL CAFT SYLPSA A FVLD + +PL+NGGTYY+L  +    GGI    T
Sbjct  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITA-FGGIRAAPT 59
Query  61   GKEICPLTVVQSPNKHNKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWA 120
            G E CPLTVVQS N+ +KGIG +  SP    FIAE +PLS+KFDSFAVI LC  +PT+W+
Sbjct  60   GNERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWS 119
Query  121  IVER--EGLQAVTLAA-RDTVDGWFNIERVSREYNDYYKLVFCPQEAEDNKCEDIGIQID 177
            +VE   EG  AV +   +D +DGWF +ERVS +  + YKLVFCPQ+AED+KC DIGI ID
Sbjct  120  VVEDLPEG-PAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISID 178
Query  178  -NDGIRRLVLSKNKPLVVEFQKF 199
             +DG RRLV+SKNKPLVV+FQK+
Sbjct  179  HDDGTRRLVVSKNKPLVVQFQKL 201

Sequence Alignment: 4. Allergen Name: Gly m TI Sequence ID: AAB23464

 Score = 184.2 bits 778,  Expect = 3e-48
 Identities = 126/196 64%, Positives = 152/196 77%, Gaps = 7/196 3%
Query  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNST 60
            MKSTIFF LFL CAFT SYLPSA A FVLD + +PL+NGGTYY+L  +    GGI    T
Sbjct  1    MKSTIFF-LFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITA-FGGIRAAPT 58
Query  61   GKEICPLTVVQSPNKHNKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWA 120
            G E CPLTVVQS N+ +KGIG +  SP    FIAE +PLS+KFDSFAVI LC  +PT+W+
Sbjct  59   GNERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWS 118
Query  121  IVER--EGLQAVTLAA-RDTVDGWFNIERVSREYNDYYKLVFCPQEAEDNKCEDIGIQID 177
            +VE   EG  AV +   +D +DGWF +ERVS +  + YKLVFCPQ+AED+KC DIGI ID
Sbjct  119  VVEDLPEG-PAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISID 177
Query  178  -NDGIRRLVLSKNKPLVVEFQKF 199
             +DG RRLV+SKNKPLVV+FQK+
Sbjct  178  HDDGTRRLVVSKNKPLVVQFQKL 200

Sequence Alignment: 5. Allergen Name: Gly m TI Sequence ID: CAA45778

 Score = 183.3 bits 774,  Expect = 5e-48
 Identities = 126/196 64%, Positives = 149/196 76%, Gaps = 6/196 3%
Query  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNST 60
            MKSTIFFALFL CAFT SYLPSA A FVLD + +PL +GGTYY+L  +    GGI    T
Sbjct  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLDSGGTYYILSDITA-FGGIRAAPT 59
Query  61   GKEICPLTVVQSPNKHNKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWA 120
            G E CPLTVVQS N+ +KGIG +  SP+   FIAE  PL +KFDSFAVI LC  +PT+W+
Sbjct  60   GNERCPLTVVQSRNELDKGIGTIISSPFRIRFIAEGNPLRLKFDSFAVIMLCVGIPTEWS 119
Query  121  IVER--EGLQAVTLAA-RDTVDGWFNIERVSREYNDYYKLVFCPQEAEDNKCEDIGIQID 177
            +VE   EG  AV +   +D VDGWF IERVS +  + YKLVFC Q+AED+KC DIGI ID
Sbjct  120  VVEDLPEG-PAVKIGENKDAVDGWFRIERVSDDEFNNYKLVFCTQQAEDDKCGDIGISID 178
Query  178  -NDGIRRLVLSKNKPLVVEFQK 198
             +DG RRLV+SKNKPLVV+FQK
Sbjct  179  HDDGTRRLVVSKNKPLVVQFQK 200

Sequence Alignment: 6. Allergen Name: Gly m TI Sequence ID: P01071

 Score = 153.3 bits 642,  Expect = 4e-39
 Identities = 105/170 61%, Positives = 127/170 74%, Gaps = 6/170 3%
Query  27   FVLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNSTGKEICPLTVVQSPNKHNKGIGLVFKS 86
            FVLD + +PL NGGTYY+L  +    GGI    TG E CPLTVVQS N+ +KGIG +  S
Sbjct  2    FVLDNEGNPLSNGGTYYILSDITA-FGGIRAAPTGNERCPLTVVQSRNELDKGIGTIISS 60
Query  87   PLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWAIVER--EGLQAVTLAA-RDTVDGWFN 143
            P+   FIAE  PL +KFDSFAVI LC  +PT+W++VE   EG  AV +   +D VDGWF 
Sbjct  61   PFRIRFIAEGNPLRLKFDSFAVIMLCVGIPTEWSVVEDLPEG-PAVKIGENKDAVDGWFR 119
Query  144  IERVSREYNDYYKLVFCPQEAEDNKCEDIGIQID-NDGIRRLVLSKNKPLVVEFQK 198
            IERVS +  + YKLVFC Q+AED+KC DIGI ID +DG RRLV+SKNKPLVV+FQK
Sbjct  120  IERVSDDEFNNYKLVFCTQQAEDDKCGDIGISIDHDDGTRRLVVSKNKPLVVQFQK 175

Sequence Alignment: 7. Allergen Name: Gly m TI Sequence ID: CAA56343

 Score = 127.0 bits 527,  Expect = 4e-31
 Identities = 98/200 49%, Positives = 130/200 65%, Gaps = 12/200 6%
Query  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNST 60
            MKST  +ALFL+CA+T SY PSATA  V+DT+ +P++NGGTYY+LPV+RGK GGIE   T
Sbjct  1    MKSTTSLALFLLCALTSSYQPSATADIVFDTEGNPIRNGGTYYVLPVIRGKGGGIEFAKT 60
Query  61   GKEICPLTVVQSPNKH-NKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKW 119
              E CPLTVVQSP +   +G+ L+  SP+  L I E   LS+KF     + L      ++
Sbjct  61   ETETCPLTVVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFHLCTPLSLNSFSVDRY 120
Query  120  AI--VEREGLQAVTLAARDTVDGWFNIERVSREYNDYYKLVFCPQEAEDNKCEDIGIQID 177
            +     R   Q   L   +    WF I+R S E N YYKLVFC    +D+ C DI   ID
Sbjct  121  SQGSARRTPCQTHWLQKHNRC--WFRIQRASSESN-YYKLVFCTSN-DDSSCGDIVAPID 176
Query  178  NDGIRRLVLS--KNKPLVVEFQK---FRSSTA 204
             +G R L+++  +N PL+V+FQK   + SSTA
Sbjct  177  REGNRPLIVTHDQNHPLLVQFQKVEAYESSTA 208

Sequence Alignment: 8. Allergen Name: Sola t 3.0101 Sequence ID: O24383

 Score = 41.1 bits 150,  Expect = 3e-05
 Identities = 50/161 31%, Positives = 75/161 46%, Gaps = 16/161 9%
Query  28   VLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNSTGKEICPLTVVQS---PNKHNKGIGLVF 84
            V D D +PL+ G  Y +   + G +G +  ++ G   CP  V+Q    P    KG  +VF
Sbjct  13   VYDQDGNPLRIGERYIIKNPLLG-AGAVYLDNIGNLQCPNAVLQHMSIPQFLGKGTPVVF 71
Query  85   KSPLHALFIAERYPLSIKFDSFAV--IPLCGVMPTKWAIVEREGLQAVTLAARDTVDGWF 142
                 + +      ++  +  F V    LC V  T W  V  E L  VT       +  F
Sbjct  72   IRKSESDYGDVVRLMTAVYIKFFVKTTKLC-VDETVWK-VNNEQL-VVTGGNVGNENDIF 128
Query  143  NIER---VSREYNDYYKLVFCPQEAEDNKCEDIGIQIDNDGIRRLVLSKNKPLVVEF 196
             I++   V R   + YKL+ CP   E   C++IG    N G  RLV   ++   + F
Sbjct  129  KIKKTDLVIRGMKNVYKLLHCPSHLE---CKNIGSNFKN-GYPRLVTVNDEKDFIPF 181

Sequence Alignment: 9. Allergen Name: Sola t 4 Sequence ID: P30941

 Score = 35.1 bits 124,  Expect = 0.002
 Identities = 53/171 30%, Positives = 85/171 49%, Gaps = 28/171 16%
Query  20   LPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGG--IEGNSTGKEI-CPLTVVQSPNKH 76
            LPS  A  VLD     L +  +Y ++    G  GG    G S   +  C   +     ++
Sbjct  29   LPS-DATPVLDVAGKELDSRLSYRIISTFWGALGGDVYLGKSPNSDAPCANGIF----RY 83
Query  77   NKGIG-------LVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWAIVEREG-LQ 128
            N  +G       ++  S      I E   L+I+F + +   LC V  T W + + +  L 
Sbjct  84   NSDVGPSGTPVRFIGSSSHFGQGIFENELLNIQF-AISTSKLC-VSYTIWKVGDYDASLG 141
Query  129  AVTLAARDTV----DGWFNIERVSRE-YNDYY----KLVFCPQEAEDNKCEDIGIQIDND 179
             + L    T+      WF I + S+  YN  Y      + CP  ++D  C  +G+ +  +
Sbjct  142  TMLLETGGTIGQADSSWFKIVKSSQFGYNLLYCPVTSTMSCPFSSDDQFCLKVGV-VHQN 200
Query  180  GIRRLVLSKNKPLVVEFQK 198
            G RRL L K+ PL V F++
Sbjct  201  GKRRLALVKDNPLDVSFKQ 219

Sequence Alignment: 10. Allergen Name: Sola t 4 Sequence ID: CAA45723

 Score = 34.0 bits 119,  Expect = 0.004
 Identities = 52/173 30%, Positives = 85/173 49%, Gaps = 20/173 11%
Query  20   LPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGG--IEGNSTGKEI-CPLTVVQ-SPNK 75
            LPS  A  VLD     L +  +Y ++    G  GG    G S   +  C   + + + + 
Sbjct  29   LPS-DATPVLDVAGKELDSRLSYRIISTFWGALGGDVYLGKSPNSDAPCANGIFRYNSDV 87
Query  76   HNKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWAIVEREG-LQAVTLAA 134
               G  + F    + +F  E   L+I+F + +   LC V  T W + + +  L  + L  
Sbjct  88   GPSGTPVRFSHFGQGIF--ENELLNIQF-AISTSKLC-VSYTIWKVGDYDASLGTMLLET 143
Query  135  RDTV----DGWFNIERVSRE-YNDYY----KLVFCPQEAEDNKCEDIGIQIDNDGIRRLV 185
              T+      WF I + S+  YN  Y      + CP  ++D  C  +G+ +  +G RRL 
Sbjct  144  GGTIGQADSSWFKIVKSSQFGYNLLYCPVTSTMSCPFSSDDQFCLKVGV-VHQNGKRRLA 202
Query  186  LSKNKPLVVEFQK 198
            L K+ PL V F++
Sbjct  203  LVKDNPLDVSFKQ 215

Sequence Alignment: 11. Allergen Name: Sola t 3.0102 Sequence ID: P20347

 Score = 33.7 bits 118,  Expect = 0.005
 Identities = 45/161 27%, Positives = 73/161 45%, Gaps = 17/161 10%
Query  28   VLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNSTGKEICPLTVVQS---PNKHNKGIGLVF 84
            V D D +PL+ G  Y +   + G +G +   + G   CP  V+Q    P    +G  +VF
Sbjct  48   VYDQDGNPLRIGERYIINNPLLG-AGAVYLYNIGNLQCPNAVLQHMSIPQFLGEGTPVVF 106
Query  85   KSPLHALFIAERYPLSIKFDSFAV--IPLCGVMPTKWAIVEREGLQAVTLAARDTVDGWF 142
                 + +      +++ +  F V    LC V  T W + + +    VT       +  F
Sbjct  107  VRKSESDYGDVVRVMTVVYIKFFVKTTKLC-VDQTVWKVNDEQ--LVVTGGKVGNENDIF 163
Query  143  NIER---VSREYNDY-YKLVFCPQEAEDNKCEDIGIQIDNDGIRRLVLSKNKPLVVEF 196
             I +   V+   + Y YKL+ CP       C++IG    N G  RLV   +    + F
Sbjct  164  KIMKTDLVTPGGSKYVYKLLHCPSHL---GCKNIGGNFKN-GYPRLVTVDDDKDFIPF 217

SDAP Home Page | Search SDAP | SDAP Manual | SDAP FAQ | Contact  
UTMB | Search | Directories | UTMB Map | News | Employment | Sitemap 
This site published by Surendra Negi
Copyright   2001-2023  The University of Texas Medical Branch. Please review our privacy policy and Internet guidelines.