The University of Texas Medical Branch
Department of Biochemistry and Molecular Biology Sealy Center for Structural Biology Computational Biology


SDAP Home Page
SDAP Overview

Search SDAP
SDAP All
SDAP Food

SDAP Tools
AllergenAI
FAO/WHO Allergenicity Test
FASTA Search in SDAP
Peptide Match
Peptide Similarity
Peptide-Protein PD Index
Aller_ML, Allergen Markup Language
List SDAP

About SDAP
General Information
Manual
FAQ
Publications
Who Are We
Advisory Board
New Allergen Submission form

Allergy Links

Our Software Tools
MPACK
FANTOM
GETAREA
InterProSurf
EpiSearch

Allergen Databases
WHO/IUIS Allergen Nomenclature database
FARRP Allergen Protein Database (University of Nebraska)
Allergen Database for Food Safety (ADFS)
COMPARE database
ALLFAM (Medical University of Vienna)
Allermatch (Wageninen University)
Allergome Database

Protein Databases
PDB
MMDB - Entrez
SWISS-PROT
NCBI - Entrez
PIR

Protein Classification
CATH
FSSP
iProClass
ProtoMap
SCOP
VAST

Bioinformatics Servers
BLAST @ NCBI
FASTA @ EMBL-EBI
Peptide Match @ PIR
ClustalW @ EMBL - EBI


                SDAP 2.0 - Structural Database of Allergenic Proteins
Go to: SDAP All allergens       Go to: SDAP Food allergens
Send a comment to Werner Braun      Submit new allergen information to SDAP
  
Alphabetical listing of allergens: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Access to SDAP is available free of charge for Academic and non-profit use.< Licenses for commercial use can be obtained by contacting W. Braun (webraun@utmb.edu). Secure access to SDAP is available from https://fermi.utmb.edu/SDAP


Input Sequence

MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGGIRAAPTG
NERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWSV
VEDLPEGPAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISIDHD
DGTRRLVVSKNKPLVVQFQKLDKESLAKKNHGLSRSE
Sequence Info Allergen Name Length Opt Bits Score E Value
CAA45777 Gly m TI 217 1457 338.9 7.2e-95
AAB23464 Gly m TI 216 1440 335.0 1.1e-93
CAA45778 Gly m TI 217 1404 326.8 3.1e-91
P01071 Gly m TI 181 1178 275.4 7.9e-76
AAB23482 Gly m TI 203 862 203.3 4.5e-54
AAB23483 Gly m TI 204 795 188.0 1.8e-49
CAA56343 Gly m TI 208 521 125.6 1.2e-30
CAA45723 Sola t 4 217 180 47.8 3.1e-07
O24383 Sola t 3.0101 186 146 40.2 5.3e-05
P16348 Sola t 2 188 139 38.6 0.00016
P30941 Sola t 4 221 138 38.3 0.00024
P20347 Sola t 3.0102 222 132 36.9 0.00062
Please note: Alignment made with FASTA version 36.3.8. As explained in the FASTA manual, the bit score is equivalent to the bit score reported by BLAST. A 1 bit increase in score corresponds to a 2-fold reduction in expectation, and a 10-bit increase implies 1000-fold lower expectation. Sequences with E values < 0.01 are almost always homologous. All FASTA search sequence alignment are printed in Blast format where Query is input sequence, and Sbjct is sequence found in the database.

Sequence Alignment: 1. Allergen Name: Gly m TI Sequence ID: CAA45777

 Score = 338.9 bits 1457,  Expect = 7e-95
 Identities = 217/217 100%, Positives = 217/217 100%, Gaps = 0/217 0%
Query  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGGIRAAPTG 60
            MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGGIRAAPTG
Sbjct  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGGIRAAPTG 60
Query  61   NERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWSV 120
            NERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWSV
Sbjct  61   NERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWSV 120
Query  121  VEDLPEGPAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISIDHD 180
            VEDLPEGPAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISIDHD
Sbjct  121  VEDLPEGPAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISIDHD 180
Query  181  DGTRRLVVSKNKPLVVQFQKLDKESLAKKNHGLSRSE 217
            DGTRRLVVSKNKPLVVQFQKLDKESLAKKNHGLSRSE
Sbjct  181  DGTRRLVVSKNKPLVVQFQKLDKESLAKKNHGLSRSE 217

Sequence Alignment: 2. Allergen Name: Gly m TI Sequence ID: AAB23464

 Score = 335.0 bits 1440,  Expect = 1e-93
 Identities = 216/216 100%, Positives = 216/216 100%, Gaps = 1/216 0%
Query  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGGIRAAPTG 60
            MKSTIFF LFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGGIRAAPTG
Sbjct  1    MKSTIFF-LFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGGIRAAPTG 59
Query  61   NERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWSV 120
            NERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWSV
Sbjct  60   NERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWSV 119
Query  121  VEDLPEGPAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISIDHD 180
            VEDLPEGPAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISIDHD
Sbjct  120  VEDLPEGPAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISIDHD 179
Query  181  DGTRRLVVSKNKPLVVQFQKLDKESLAKKNHGLSRSE 217
            DGTRRLVVSKNKPLVVQFQKLDKESLAKKNHGLSRSE
Sbjct  180  DGTRRLVVSKNKPLVVQFQKLDKESLAKKNHGLSRSE 216

Sequence Alignment: 3. Allergen Name: Gly m TI Sequence ID: CAA45778

 Score = 326.8 bits 1404,  Expect = 3e-91
 Identities = 208/217 95%, Positives = 215/217 99%, Gaps = 0/217 0%
Query  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGGIRAAPTG 60
            MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPL++GGTYYILSDITAFGGIRAAPTG
Sbjct  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLDSGGTYYILSDITAFGGIRAAPTG 60
Query  61   NERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWSV 120
            NERCPLTVVQSRNELDKGIGTIISSP+RIRFIAEG+PL LKFDSFAVIMLCVGIPTEWSV
Sbjct  61   NERCPLTVVQSRNELDKGIGTIISSPFRIRFIAEGNPLRLKFDSFAVIMLCVGIPTEWSV 120
Query  121  VEDLPEGPAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISIDHD 180
            VEDLPEGPAVKIGENKDA+DGWFR+ERVSDDEFNNYKLVFC QQAEDDKCGDIGISIDHD
Sbjct  121  VEDLPEGPAVKIGENKDAVDGWFRIERVSDDEFNNYKLVFCTQQAEDDKCGDIGISIDHD 180
Query  181  DGTRRLVVSKNKPLVVQFQKLDKESLAKKNHGLSRSE 217
            DGTRRLVVSKNKPLVVQFQK+DKESLAKKNHGLSRSE
Sbjct  181  DGTRRLVVSKNKPLVVQFQKVDKESLAKKNHGLSRSE 217

Sequence Alignment: 4. Allergen Name: Gly m TI Sequence ID: P01071

 Score = 275.4 bits 1178,  Expect = 8e-76
 Identities = 173/181 95%, Positives = 178/181 98%, Gaps = 0/181 0%
Query  26   DFVLDNEGNPLENGGTYYILSDITAFGGIRAAPTGNERCPLTVVQSRNELDKGIGTIISS 85
            DFVLDNEGNPL NGGTYYILSDITAFGGIRAAPTGNERCPLTVVQSRNELDKGIGTIISS
Sbjct  1    DFVLDNEGNPLSNGGTYYILSDITAFGGIRAAPTGNERCPLTVVQSRNELDKGIGTIISS 60
Query  86   PYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWSVVEDLPEGPAVKIGENKDAMDGWFRL 145
            P+RIRFIAEG+PL LKFDSFAVIMLCVGIPTEWSVVEDLPEGPAVKIGENKDA+DGWFR+
Sbjct  61   PFRIRFIAEGNPLRLKFDSFAVIMLCVGIPTEWSVVEDLPEGPAVKIGENKDAVDGWFRI 120
Query  146  ERVSDDEFNNYKLVFCPQQAEDDKCGDIGISIDHDDGTRRLVVSKNKPLVVQFQKLDKES 205
            ERVSDDEFNNYKLVFC QQAEDDKCGDIGISIDHDDGTRRLVVSKNKPLVVQFQK+DKES
Sbjct  121  ERVSDDEFNNYKLVFCTQQAEDDKCGDIGISIDHDDGTRRLVVSKNKPLVVQFQKVDKES 180
Query  206  L 206
            L
Sbjct  181  L 181

Sequence Alignment: 5. Allergen Name: Gly m TI Sequence ID: AAB23482

 Score = 203.3 bits 862,  Expect = 4e-54
 Identities = 137/196 69%, Positives = 158/196 80%, Gaps = 7/196 3%
Query  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGG-IRAAPT 59
            MKSTIFFALFL CAFT SYLPSA A FVLD + +PL+NGGTYY+L  +   GG I    T
Sbjct  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDST 60
Query  60   GNERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWS 119
            G E CPLTVVQS NELDKGIG + +SP    FIAE +PLS+KF SFAVI LC G+PTEW+
Sbjct  61   GKEICPLTVVQSPNELDKGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWA 120
Query  120  VVEDLPEG-PAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISID 178
            +VE   EG  AVK+   +D +DGWF +ERVS  E+N+YKLVFCPQQAED+KC DIGI ID
Sbjct  121  IVER--EGLQAVKLAA-RDTVDGWFNIERVSR-EYNDYKLVFCPQQAEDNKCEDIGIQID 176
Query  179  HDDGTRRLVVSKNKPLVVQFQKL 201
             DDG RRLV+SKNKPLVVQFQK+
Sbjct  177  -DDGIRRLVLSKNKPLVVQFQKF 198

Sequence Alignment: 6. Allergen Name: Gly m TI Sequence ID: AAB23483

 Score = 188.0 bits 795,  Expect = 2e-49
 Identities = 125/198 63%, Positives = 150/198 75%, Gaps = 4/198 2%
Query  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITA-FGGIRAAPT 59
            MKSTIFFALFL CAFT SYLPSA A FVLD + +PL+NGGTYY+L  +    GGI    T
Sbjct  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNST 60
Query  60   GNERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWS 119
            G E CPLTVVQS N+ +KGIG +  SP    FIAE +PLS+KFDSFAVI LC  +PT+W+
Sbjct  61   GKEICPLTVVQSPNKHNKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWA 120
Query  120  VVEDLPEGPAVKIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIGISIDH 179
            +VE   EG        +D +DGWF +ERVS +  + YKLVFCPQ+AED+KC DIGI ID 
Sbjct  121  IVER--EGLQAVTLAARDTVDGWFNIERVSREYNDYYKLVFCPQEAEDNKCEDIGIQID- 177
Query  180  DDGTRRLVVSKNKPLVVQFQKL 201
            +DG RRLV+SKNKPLVV+FQK+
Sbjct  178  NDGIRRLVLSKNKPLVVEFQKF 199

Sequence Alignment: 7. Allergen Name: Gly m TI Sequence ID: CAA56343

 Score = 125.6 bits 521,  Expect = 1e-30
 Identities = 100/193 51%, Positives = 126/193 65%, Gaps = 17/193 8%
Query  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGG-IRAAPT 59
            MKST  +ALFL+CA+T+SY PSA AD V+D EGNP+ NGGTYY+L  I   GG I  A T
Sbjct  1    MKSTTSLALFLLCALTSSYQPSATADIVFDTEGNPIRNGGTYYVLPVIRGKGGGIEFAKT 60
Query  60   GNERCPLTVVQSRNE-LDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEW 118
              E CPLTVVQS  E L +G+  IISSP++I  I EG  LSLKF       LC  +    
Sbjct  61   ETETCPLTVVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFH------LCTPLSLNS 114
Query  119  SVVEDLPEGPAVKIGENKDAMDG----WFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIG 174
              V+   +G A +       +      WFR++R S  E N YKLVFC    +D  CGDI 
Sbjct  115  FSVDRYSQGSARRTPCQTHWLQKHNRCWFRIQRASS-ESNYYKLVFCTSN-DDSSCGDIV 172
Query  175  ISIDHDDGTRRLVVS--KNKPLVVQFQKLD 202
              ID + G R L+V+  +N PL+VQFQK++
Sbjct  173  APIDRE-GNRPLIVTHDQNHPLLVQFQKVE 201

Sequence Alignment: 8. Allergen Name: Sola t 4 Sequence ID: CAA45723

 Score = 47.8 bits 180,  Expect = 3e-07
 Identities = 60/184 32%, Positives = 103/184 55%, Gaps = 38/184 20%
Query  9    LFLFC-------AFTTSY-------LPSAIADFVLDNEGNPLENGGTYYILSDI-TAFGG 53
            LFL+C        F++++       LPS  A  VLD  G  L++  +Y I+S    A+GG
Sbjct  4    LFLLCLCLVPIVVFSSTFTSKNPINLPSD-ATPVLDVAGKELDSRLSYRIISTFWGALGG 62
Query  54   ---IRAAPTGNERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIML 110
               +  +P  +  C   + +  +++    GT +   +  + I E   L+++F + +   L
Sbjct  63   DVYLGKSPNSDAPCANGIFRYNSDVGPS-GTPVRFSHFGQGIFENELLNIQF-AISTSKL 120
Query  111  CVGIPTEWSVVE-DLPEGPAV--KIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAE- 166
            CV   T W V + D   G  +    G    A   WF++  V   +F  Y L++CP  +  
Sbjct  121  CVSY-TIWKVGDYDASLGTMLLETGGTIGQADSSWFKI--VKSSQFG-YNLLYCPVTSTM 176
Query  167  ------DDK-CGDIGISIDHDDGTRRLVVSKNKPLVVQFQKL 201
                  DD+ C  +G+   H +G RRL + K+ PL V F+++
Sbjct  177  SCPFSSDDQFCLKVGVV--HQNGKRRLALVKDNPLDVSFKQV 216

Sequence Alignment: 9. Allergen Name: Sola t 3.0101 Sequence ID: O24383

 Score = 40.2 bits 146,  Expect = 5e-05
 Identities = 47/161 29%, Positives = 73/161 45%, Gaps = 18/161 11%
Query  28   VLDNEGNPLENGGTYYILSDITAFGGIRAAPTGNERCPLTVVQSRN---ELDKGIGTIIS 84
            V D +GNPL  G  Y I + +   G +     GN +CP  V+Q  +    L KG   +  
Sbjct  13   VYDQDGNPLRIGERYIIKNPLLGAGAVYLDNIGNLQCPNAVLQHMSIPQFLGKGTPVVFI 72
Query  85   SPYRIRFIAEGHPLSLKFDSFAV--IMLCVGIPTEWSVVEDLPEGPAVKIGENKDAMDGW 142
                  +      ++  +  F V    LCV   T W V  +      V  G N    +  
Sbjct  73   RKSESDYGDVVRLMTAVYIKFFVKTTKLCVD-ETVWKVNNE----QLVVTGGNVGNENDI 127
Query  143  FRLER---VSDDEFNNYKLVFCPQQAEDDKCGDIGISIDHDDGTRRLVVSKNKPLVVQF 198
            F++++   V     N YKL+ CP + E   C +IG   +  +G  RLV   ++   + F
Sbjct  128  FKIKKTDLVIRGMKNVYKLLHCPSHLE---CKNIG--SNFKNGYPRLVTVNDEKDFIPF 181

Sequence Alignment: 10. Allergen Name: Sola t 2 Sequence ID: P16348

 Score = 38.6 bits 139,  Expect = 0.0002
 Identities = 50/166 30%, Positives = 86/166 51%, Gaps = 23/166 13%
Query  28   VLDNEGNPLENGGTYYILS-DITAFGG---IRAAPTGNERCPLTVVQSRNELDKGIGTII 83
            VLD  G  L    +Y I+S    A+GG   +  +P  +  CP  V +  +++    GT +
Sbjct  8    VLDTNGKELNPNSSYRIISIGRGALGGDVYLGKSPNSDAPCPDGVFRYNSDVGPS-GTPV 66
Query  84   SSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWSV--VEDLPEGPAVKIGENKDAMDG 141
                    I E + L+++F+  A + LCV   T W V  +        ++ G      D 
Sbjct  67   RFIPLSGGIFEDQLLNIQFN-IATVKLCVSY-TIWKVGNLNAYFRTMLLETGGTIGQADS 124
Query  142  -WFRLERVSDDEFNNYKLVFCPQQA--------EDDKCGDIGISIDHDDGTRRLVVSKNK 192
             +F++ ++S+  F  Y L++CP           +D+ C  +G+ I   +G RRL +    
Sbjct  125  SYFKIVKLSN--FG-YNLLYCPITPPFLCPFCRDDNFCAKVGVVI--QNGKRRLALVNEN 179
Query  193  PLVVQFQKL 201
            PL V FQ++
Sbjct  180  PLDVLFQEV 188

Sequence Alignment: 11. Allergen Name: Sola t 4 Sequence ID: P30941

 Score = 38.3 bits 138,  Expect = 0.0002
 Identities = 62/184 33%, Positives = 105/184 57%, Gaps = 42/184 22%
Query  9    LFLFC-------AFTTSY-------LPSAIADFVLDNEGNPLENGGTYYILSDI-TAFGG 53
            LFL+C        F++++       LPS  A  VLD  G  L++  +Y I+S    A+GG
Sbjct  4    LFLLCLCLVPIVVFSSTFTSKNPINLPSD-ATPVLDVAGKELDSRLSYRIISTFWGALGG 62
Query  54   ---IRAAPTGNERCPLTVVQSRNELDKGIGTII----SSPYRIRFIAEGHPLSLKFDSFA 106
               +  +P  +  C   + +  +++    GT +    SS +  + I E   L+++F + +
Sbjct  63   DVYLGKSPNSDAPCANGIFRYNSDVGPS-GTPVRFIGSSSHFGQGIFENELLNIQF-AIS 120
Query  107  VIMLCVGIPTEWSVVE-DLPEGPAV--KIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQ 163
               LCV   T W V + D   G  +    G    A   WF++  V   +F  Y L++CP 
Sbjct  121  TSKLCVSY-TIWKVGDYDASLGTMLLETGGTIGQADSSWFKI--VKSSQFG-YNLLYCPV 176
Query  164  QAE-------DDK-CGDIGISIDHDDGTRRLVVSKNKPLVVQFQKL 201
             +        DD+ C  +G+   H +G RRL + K+ PL V F+++
Sbjct  177  TSTMSCPFSSDDQFCLKVGVV--HQNGKRRLALVKDNPLDVSFKQV 220

Sequence Alignment: 12. Allergen Name: Sola t 3.0102 Sequence ID: P20347

 Score = 36.9 bits 132,  Expect = 0.0006
 Identities = 44/161 27%, Positives = 72/161 44%, Gaps = 14/161 8%
Query  28   VLDNEGNPLENGGTYYILSDITAFGGIRAAPTGNERCPLTVVQSRN---ELDKGIGTIIS 84
            V D +GNPL  G  Y I + +   G +     GN +CP  V+Q  +    L +G   +  
Sbjct  48   VYDQDGNPLRIGERYIINNPLLGAGAVYLYNIGNLQCPNAVLQHMSIPQFLGEGTPVVFV 107
Query  85   SPYRIRFIAEGHPLSLKFDSFAV--IMLCVGIPTEWSVVEDLPEGPAVKIGENKDAMDGW 142
                  +      +++ +  F V    LCV   T W V ++       K+G   D     
Sbjct  108  RKSESDYGDVVRVMTVVYIKFFVKTTKLCVD-QTVWKVNDEQLVVTGGKVGNENDIFK-I 165
Query  143  FRLERVSDDEFNN-YKLVFCPQQAEDDKCGDIGISIDHDDGTRRLV-VSKNKPLV 195
             + + V+       YKL+ CP +     C +IG   +  +G  RLV V  +K ++
Sbjct  166  MKTDLVTPGGSKYVYKLLHCPSHL---GCKNIG--GNFKNGYPRLVTVDDDKDFI 215

SDAP Home Page | Search SDAP | SDAP Manual | SDAP FAQ | Contact  
UTMB | Search | Directories | UTMB Map | News | Employment | Sitemap 
This site published by Surendra Negi
Copyright   2001-2023  The University of Texas Medical Branch. Please review our privacy policy and Internet guidelines.