The University of Texas Medical Branch
Department of Biochemistry and Molecular Biology Sealy Center for Structural Biology Computational Biology


SDAP Home Page
SDAP Overview

Search SDAP
SDAP All
SDAP Food

SDAP Tools
AllergenAI
FAO/WHO Allergenicity Test
FASTA Search in SDAP
Peptide Match
Peptide Similarity
Peptide-Protein PD Index
Aller_ML, Allergen Markup Language
List SDAP

About SDAP
General Information
Manual
FAQ
Publications
Who Are We
Advisory Board
New Allergen Submission form

Allergy Links

Our Software Tools
MPACK
FANTOM
GETAREA
InterProSurf
EpiSearch

Allergen Databases
WHO/IUIS Allergen Nomenclature database
FARRP Allergen Protein Database (University of Nebraska)
Allergen Database for Food Safety (ADFS)
COMPARE database
ALLFAM (Medical University of Vienna)
Allermatch (Wageninen University)
Allergome Database

Protein Databases
PDB
MMDB - Entrez
SWISS-PROT
NCBI - Entrez
PIR

Protein Classification
CATH
FSSP
iProClass
ProtoMap
SCOP
VAST

Bioinformatics Servers
BLAST @ NCBI
FASTA @ EMBL-EBI
Peptide Match @ PIR
ClustalW @ EMBL - EBI


                SDAP 2.0 - Structural Database of Allergenic Proteins
Go to: SDAP All allergens       Go to: SDAP Food allergens
Send a comment to Werner Braun      Submit new allergen information to SDAP
  
Alphabetical listing of allergens: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Access to SDAP is available free of charge for Academic and non-profit use.< Licenses for commercial use can be obtained by contacting W. Braun (webraun@utmb.edu). Secure access to SDAP is available from https://fermi.utmb.edu/SDAP


Input Sequence

MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDST
GKEICPLTVVQSPNELDKGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWA
IVEREGLQAVKLAARDTVDGWFNIERVSREYNDYKLVFCPQQAEDNKCEDIGIQIDDDGI
RRLVLSKNKPLVVQFQKFRSSTA
Sequence Info Allergen Name Length Opt Bits Score E Value
AAB23482 Gly m TI 203 1352 364.9 9.3e-103
AAB23483 Gly m TI 204 1232 333.1 3.5e-93
CAA45777 Gly m TI 217 862 235.1 1.2e-63
AAB23464 Gly m TI 216 845 230.6 2.8e-62
CAA45778 Gly m TI 217 841 229.5 5.9e-62
P01071 Gly m TI 181 709 194.6 1.6e-51
CAA56343 Gly m TI 208 553 153.2 5.2e-39
O24383 Sola t 3.0101 186 152 47.0 4.4e-07
P20347 Sola t 3.0102 222 122 39.0 0.00014
P30941 Sola t 4 221 116 37.4 0.00041
CAA45723 Sola t 4 217 114 36.8 0.00058
P16348 Sola t 2 188 104 34.3 0.003
Please note: Alignment made with FASTA version 36.3.8. As explained in the FASTA manual, the bit score is equivalent to the bit score reported by BLAST. A 1 bit increase in score corresponds to a 2-fold reduction in expectation, and a 10-bit increase implies 1000-fold lower expectation. Sequences with E values < 0.01 are almost always homologous. All FASTA search sequence alignment are printed in Blast format where Query is input sequence, and Sbjct is sequence found in the database.

Sequence Alignment: 1. Allergen Name: Gly m TI Sequence ID: AAB23482

 Score = 364.9 bits 1352,  Expect = 9e-103
 Identities = 203/203 100%, Positives = 203/203 100%, Gaps = 0/203 0%
Query  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDST 60
            MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDST
Sbjct  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDST 60
Query  61   GKEICPLTVVQSPNELDKGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWA 120
            GKEICPLTVVQSPNELDKGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWA
Sbjct  61   GKEICPLTVVQSPNELDKGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWA 120
Query  121  IVEREGLQAVKLAARDTVDGWFNIERVSREYNDYKLVFCPQQAEDNKCEDIGIQIDDDGI 180
            IVEREGLQAVKLAARDTVDGWFNIERVSREYNDYKLVFCPQQAEDNKCEDIGIQIDDDGI
Sbjct  121  IVEREGLQAVKLAARDTVDGWFNIERVSREYNDYKLVFCPQQAEDNKCEDIGIQIDDDGI 180
Query  181  RRLVLSKNKPLVVQFQKFRSSTA 203
            RRLVLSKNKPLVVQFQKFRSSTA
Sbjct  181  RRLVLSKNKPLVVQFQKFRSSTA 203

Sequence Alignment: 2. Allergen Name: Gly m TI Sequence ID: AAB23483

 Score = 333.1 bits 1232,  Expect = 3e-93
 Identities = 187/203 92%, Positives = 194/203 95%, Gaps = 1/203 0%
Query  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDST 60
            MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGK GGIE +ST
Sbjct  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNST 60
Query  61   GKEICPLTVVQSPNELDKGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWA 120
            GKEICPLTVVQSPN+ +KGIGLVF SPLHALFIAERYPLSIKF SFAVI LC  MPT+WA
Sbjct  61   GKEICPLTVVQSPNKHNKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWA 120
Query  121  IVEREGLQAVKLAARDTVDGWFNIERVSREYNDY-KLVFCPQQAEDNKCEDIGIQIDDDG 179
            IVEREGLQAV LAARDTVDGWFNIERVSREYNDY KLVFCPQ+AEDNKCEDIGIQID+DG
Sbjct  121  IVEREGLQAVTLAARDTVDGWFNIERVSREYNDYYKLVFCPQEAEDNKCEDIGIQIDNDG 180
Query  180  IRRLVLSKNKPLVVQFQKFRSSTA 203
            IRRLVLSKNKPLVV+FQKFRSSTA
Sbjct  181  IRRLVLSKNKPLVVEFQKFRSSTA 204

Sequence Alignment: 3. Allergen Name: Gly m TI Sequence ID: CAA45777

 Score = 235.1 bits 862,  Expect = 1e-63
 Identities = 137/196 69%, Positives = 158/196 80%, Gaps = 7/196 3%
Query  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDST 60
            MKSTIFFALFL CAFT SYLPSA A FVLD + +PL+NGGTYY+L  +   GG I    T
Sbjct  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGG-IRAAPT 59
Query  61   GKEICPLTVVQSPNELDKGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWA 120
            G E CPLTVVQS NELDKGIG + +SP    FIAE +PLS+KF SFAVI LC G+PTEW+
Sbjct  60   GNERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWS 119
Query  121  IVER--EGLQAVKLAA-RDTVDGWFNIERVSR-EYNDYKLVFCPQQAEDNKCEDIGIQID 176
            +VE   EG  AVK+   +D +DGWF +ERVS  E+N+YKLVFCPQQAED+KC DIGI ID
Sbjct  120  VVEDLPEG-PAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISID 178
Query  177  -DDGIRRLVLSKNKPLVVQFQKF 198
             DDG RRLV+SKNKPLVVQFQK+
Sbjct  179  HDDGTRRLVVSKNKPLVVQFQKL 201

Sequence Alignment: 4. Allergen Name: Gly m TI Sequence ID: AAB23464

 Score = 230.6 bits 845,  Expect = 3e-62
 Identities = 136/195 69%, Positives = 157/195 80%, Gaps = 8/195 4%
Query  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDST 60
            MKSTIFF LFL CAFT SYLPSA A FVLD + +PL+NGGTYY+L  +   GG I    T
Sbjct  1    MKSTIFF-LFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGG-IRAAPT 58
Query  61   GKEICPLTVVQSPNELDKGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWA 120
            G E CPLTVVQS NELDKGIG + +SP    FIAE +PLS+KF SFAVI LC G+PTEW+
Sbjct  59   GNERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWS 118
Query  121  IVER--EGLQAVKLAA-RDTVDGWFNIERVSR-EYNDYKLVFCPQQAEDNKCEDIGIQID 176
            +VE   EG  AVK+   +D +DGWF +ERVS  E+N+YKLVFCPQQAED+KC DIGI ID
Sbjct  119  VVEDLPEG-PAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISID 177
Query  177  -DDGIRRLVLSKNKPLVVQFQKF 198
             DDG RRLV+SKNKPLVVQFQK+
Sbjct  178  HDDGTRRLVVSKNKPLVVQFQKL 200

Sequence Alignment: 5. Allergen Name: Gly m TI Sequence ID: CAA45778

 Score = 229.5 bits 841,  Expect = 6e-62
 Identities = 136/195 69%, Positives = 154/195 78%, Gaps = 7/195 3%
Query  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDST 60
            MKSTIFFALFL CAFT SYLPSA A FVLD + +PL +GGTYY+L  +   GG I    T
Sbjct  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLDSGGTYYILSDITAFGG-IRAAPT 59
Query  61   GKEICPLTVVQSPNELDKGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWA 120
            G E CPLTVVQS NELDKGIG + +SP+   FIAE  PL +KF SFAVI LC G+PTEW+
Sbjct  60   GNERCPLTVVQSRNELDKGIGTIISSPFRIRFIAEGNPLRLKFDSFAVIMLCVGIPTEWS 119
Query  121  IVER--EGLQAVKLAA-RDTVDGWFNIERVSR-EYNDYKLVFCPQQAEDNKCEDIGIQID 176
            +VE   EG  AVK+   +D VDGWF IERVS  E+N+YKLVFC QQAED+KC DIGI ID
Sbjct  120  VVEDLPEG-PAVKIGENKDAVDGWFRIERVSDDEFNNYKLVFCTQQAEDDKCGDIGISID 178
Query  177  -DDGIRRLVLSKNKPLVVQFQK 197
             DDG RRLV+SKNKPLVVQFQK
Sbjct  179  HDDGTRRLVVSKNKPLVVQFQK 200

Sequence Alignment: 6. Allergen Name: Gly m TI Sequence ID: P01071

 Score = 194.6 bits 709,  Expect = 2e-51
 Identities = 115/169 68%, Positives = 132/169 78%, Gaps = 7/169 4%
Query  27   FVLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDSTGKEICPLTVVQSPNELDKGIGLVFTS 86
            FVLD + +PL NGGTYY+L  +   GG I    TG E CPLTVVQS NELDKGIG + +S
Sbjct  2    FVLDNEGNPLSNGGTYYILSDITAFGG-IRAAPTGNERCPLTVVQSRNELDKGIGTIISS 60
Query  87   PLHALFIAERYPLSIKFGSFAVITLCAGMPTEWAIVER--EGLQAVKLAA-RDTVDGWFN 143
            P+   FIAE  PL +KF SFAVI LC G+PTEW++VE   EG  AVK+   +D VDGWF 
Sbjct  61   PFRIRFIAEGNPLRLKFDSFAVIMLCVGIPTEWSVVEDLPEG-PAVKIGENKDAVDGWFR 119
Query  144  IERVSR-EYNDYKLVFCPQQAEDNKCEDIGIQID-DDGIRRLVLSKNKPLVVQFQK 197
            IERVS  E+N+YKLVFC QQAED+KC DIGI ID DDG RRLV+SKNKPLVVQFQK
Sbjct  120  IERVSDDEFNNYKLVFCTQQAEDDKCGDIGISIDHDDGTRRLVVSKNKPLVVQFQK 175

Sequence Alignment: 7. Allergen Name: Gly m TI Sequence ID: CAA56343

 Score = 153.2 bits 553,  Expect = 5e-39
 Identities = 101/200 50%, Positives = 133/200 66%, Gaps = 11/200 5%
Query  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDST 60
            MKST  +ALFL+CA+T SY PSATA  V+DT+ +P++NGGTYY+LPV+RGKGGGIE   T
Sbjct  1    MKSTTSLALFLLCALTSSYQPSATADIVFDTEGNPIRNGGTYYVLPVIRGKGGGIEFAKT 60
Query  61   GKEICPLTVVQSPNE-LDKGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEW 119
              E CPLTVVQSP E L +G+ L+ +SP+  L I E   LS+KF     ++L +     +
Sbjct  61   ETETCPLTVVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFHLCTPLSLNSFSVDRY 120
Query  120  AI--VEREGLQAVKLAARDTVDGWFNIERVSREYNDYKLVFCPQQAEDNKCEDIGIQIDD 177
            +     R   Q   L   +    WF I+R S E N YKLVFC    +D+ C DI   ID 
Sbjct  121  SQGSARRTPCQTHWLQKHNRC--WFRIQRASSESNYYKLVFCTSN-DDSSCGDIVAPIDR 177
Query  178  DGIRRLVLS--KNKPLVVQFQK---FRSSTA 203
            +G R L+++  +N PL+VQFQK   + SSTA
Sbjct  178  EGNRPLIVTHDQNHPLLVQFQKVEAYESSTA 208

Sequence Alignment: 8. Allergen Name: Sola t 3.0101 Sequence ID: O24383

 Score = 47.0 bits 152,  Expect = 4e-07
 Identities = 51/160 31%, Positives = 76/160 47%, Gaps = 17/160 10%
Query  28   VLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDSTGKEICPLTVVQS---PNELDKGIGLVF 84
            V D D +PL+ G  Y +   + G  G + +D+ G   CP  V+Q    P  L KG  +VF
Sbjct  13   VYDQDGNPLRIGERYIIKNPLLG-AGAVYLDNIGNLQCPNAVLQHMSIPQFLGKGTPVVF 71
Query  85   TSPLHALFIAERYPLSIKFGSFAVIT--LCAGMPTEWAIVEREGLQAVKLAARDTVDGWF 142
                 + +      ++  +  F V T  LC    T W  V  E L  V        +  F
Sbjct  72   IRKSESDYGDVVRLMTAVYIKFFVKTTKLCVD-ETVWK-VNNEQL-VVTGGNVGNENDIF 128
Query  143  NIER---VSREY-NDYKLVFCPQQAEDNKCEDIGIQIDDDGIRRLVLSKNKPLVVQF 195
             I++   V R   N YKL+ CP + E   C++IG    + G  RLV   ++   + F
Sbjct  129  KIKKTDLVIRGMKNVYKLLHCPSHLE---CKNIGSNFKN-GYPRLVTVNDEKDFIPF 181

Sequence Alignment: 9. Allergen Name: Sola t 3.0102 Sequence ID: P20347

 Score = 39.0 bits 122,  Expect = 0.0001
 Identities = 41/162 25%, Positives = 74/162 45%, Gaps = 14/162 8%
Query  28   VLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDSTGKEICPLTVVQS---PNELDKGIGLVF 84
            V D D +PL+ G  Y +   + G  G + + + G   CP  V+Q    P  L +G  +VF
Sbjct  48   VYDQDGNPLRIGERYIINNPLLG-AGAVYLYNIGNLQCPNAVLQHMSIPQFLGEGTPVVF 106
Query  85   TSPLHALFIAERYPLSIKFGSFAVIT--LCAGMPTEWAIVEREGL-QAVKLAARDTVDGW 141
                 + +      +++ +  F V T  LC    T W + + + +    K+   + +   
Sbjct  107  VRKSESDYGDVVRVMTVVYIKFFVKTTKLCVDQ-TVWKVNDEQLVVTGGKVGNENDIFKI 165
Query  142  FNIERVSREYNDY--KLVFCPQQAEDNKCEDIGIQIDDDGIRRLVLSKNKPLVVQF 195
               + V+   + Y  KL+ CP +     C++IG    + G  RLV   +    + F
Sbjct  166  MKTDLVTPGGSKYVYKLLHCPSHL---GCKNIGGNFKN-GYPRLVTVDDDKDFIPF 217

Sequence Alignment: 10. Allergen Name: Sola t 4 Sequence ID: P30941

 Score = 37.4 bits 116,  Expect = 0.0004
 Identities = 47/172 27%, Positives = 86/172 50%, Gaps = 25/172 14%
Query  20   LPSATAQFVLDTDDDPLQNGGTYYMLPVMRGK-GGGIEVDST--GKEICPLTVVQSPNEL 76
            LPS  A  VLD     L +  +Y ++    G  GG + +  +      C   + +  +++
Sbjct  29   LPS-DATPVLDVAGKELDSRLSYRIISTFWGALGGDVYLGKSPNSDAPCANGIFRYNSDV 87
Query  77   DKG---IGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWAIVEREG-LQAVKL 132
                  + ++ +S      I E   L+I+F + +   LC    T W + + +  L  + L
Sbjct  88   GPSGTPVRFIGSSSHFGQGIFENELLNIQF-AISTSKLCVSY-TIWKVGDYDASLGTMLL 145
Query  133  AARDTV----DGWFNIERVSREYNDYKLVFCPQ--------QAEDNKCEDIGIQIDDDGI 180
                T+      WF I + S ++  Y L++CP          ++D  C  +G+ +  +G 
Sbjct  146  ETGGTIGQADSSWFKIVK-SSQFG-YNLLYCPVTSTMSCPFSSDDQFCLKVGV-VHQNGK 202
Query  181  RRLVLSKNKPLVVQFQK 197
            RRL L K+ PL V F++
Sbjct  203  RRLALVKDNPLDVSFKQ 219

Sequence Alignment: 11. Allergen Name: Sola t 4 Sequence ID: CAA45723

 Score = 36.8 bits 114,  Expect = 0.0006
 Identities = 48/170 28%, Positives = 87/170 51%, Gaps = 25/170 14%
Query  20   LPSATAQFVLDTDDDPLQNGGTYYMLPVMRGK-GGGIEVDST--GKEICPLTVVQSPNEL 76
            LPS  A  VLD     L +  +Y ++    G  GG + +  +      C   + +  +++
Sbjct  29   LPS-DATPVLDVAGKELDSRLSYRIISTFWGALGGDVYLGKSPNSDAPCANGIFRYNSDV 87
Query  77   D-KGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWAIVEREG-LQAVKLAA 134
               G  + F+   + +F  E   L+I+F + +   LC    T W + + +  L  + L  
Sbjct  88   GPSGTPVRFSHFGQGIF--ENELLNIQF-AISTSKLCVSY-TIWKVGDYDASLGTMLLET 143
Query  135  RDTV----DGWFNIERVSREYNDYKLVFCPQ--------QAEDNKCEDIGIQIDDDGIRR 182
              T+      WF I + S ++  Y L++CP          ++D  C  +G+ +  +G RR
Sbjct  144  GGTIGQADSSWFKIVK-SSQFG-YNLLYCPVTSTMSCPFSSDDQFCLKVGV-VHQNGKRR 200
Query  183  LVLSKNKPLVVQFQK 197
            L L K+ PL V F++
Sbjct  201  LALVKDNPLDVSFKQ 215

Sequence Alignment: 12. Allergen Name: Sola t 2 Sequence ID: P16348

 Score = 34.3 bits 104,  Expect = 0.003
 Identities = 50/164 30%, Positives = 81/164 49%, Gaps = 22/164 13%
Query  28   VLDTDDDPLQNGGTYYMLPVMRGK-GGGIEVDST--GKEICPLTVVQSPNELDKGIGLVF 84
            VLDT+   L    +Y ++ + RG  GG + +  +      CP  V +  +++      V 
Sbjct  8    VLDTNGKELNPNSSYRIISIGRGALGGDVYLGKSPNSDAPCPDGVFRYNSDVGPSGTPVR 67
Query  85   TSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWAIVEREG-LQAVKLAARDTV----D 139
              PL    I E   L+I+F + A + LC    T W +      ++ + L    T+     
Sbjct  68   FIPLSGG-IFEDQLLNIQF-NIATVKLCVSY-TIWKVGNLNAYFRTMLLETGGTIGQADS 124
Query  140  GWFNIERVSREYNDYKLVFCPQQA--------EDNKCEDIGIQIDDDGIRRLVLSKNKPL 191
             +F I ++S     Y L++CP           +DN C  +G+ I + G RRL L    PL
Sbjct  125  SYFKIVKLSNF--GYNLLYCPITPPFLCPFCRDDNFCAKVGVVIQN-GKRRLALVNENPL 181
Query  192  VVQFQK 197
             V FQ+
Sbjct  182  DVLFQE 187

SDAP Home Page | Search SDAP | SDAP Manual | SDAP FAQ | Contact  
UTMB | Search | Directories | UTMB Map | News | Employment | Sitemap 
This site published by Surendra Negi
Copyright   2001-2023  The University of Texas Medical Branch. Please review our privacy policy and Internet guidelines.