The University of Texas Medical Branch
Department of Biochemistry and Molecular Biology Sealy Center for Structural Biology Computational Biology


SDAP Home Page
SDAP Overview

Search SDAP
SDAP All
SDAP Food

SDAP Tools
AllergenAI
FAO/WHO Allergenicity Test
FASTA Search in SDAP
Peptide Match
Peptide Similarity
Peptide-Protein PD Index
Aller_ML, Allergen Markup Language
List SDAP

About SDAP
General Information
Manual
FAQ
Publications
Who Are We
Advisory Board
New Allergen Submission form

Allergy Links

Our Software Tools
MPACK
FANTOM
GETAREA
InterProSurf
EpiSearch

Allergen Databases
WHO/IUIS Allergen Nomenclature database
FARRP Allergen Protein Database (University of Nebraska)
Allergen Database for Food Safety (ADFS)
COMPARE database
ALLFAM (Medical University of Vienna)
Allermatch (Wageninen University)
Allergome Database

Protein Databases
PDB
MMDB - Entrez
SWISS-PROT
NCBI - Entrez
PIR

Protein Classification
CATH
FSSP
iProClass
ProtoMap
SCOP
VAST

Bioinformatics Servers
BLAST @ NCBI
FASTA @ EMBL-EBI
Peptide Match @ PIR
ClustalW @ EMBL - EBI


                SDAP 2.0 - Structural Database of Allergenic Proteins
Go to: SDAP All allergens       Go to: SDAP Food allergens
Send a comment to Werner Braun      Submit new allergen information to SDAP
  
Alphabetical listing of allergens: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Access to SDAP is available free of charge for Academic and non-profit use.< Licenses for commercial use can be obtained by contacting W. Braun (webraun@utmb.edu). Secure access to SDAP is available from https://fermi.utmb.edu/SDAP


Input Sequence

MKCLFLLCLCLVPIVVFSSTFTSKNPINLPSDATPVLDVAGKELDSRLSYRIISTFWGAL
GGDVYLGKSPNSDAPCANGIFRYNSDVGPSGTPVRFSHFGQGIFENELLNIQFAISTSKL
CVSYTIWKVGDYDASLGTMLLETGGTIGQADSSWFKIVKSSQFGYNLLYCPVTSTMSCPF
SSDDQFCLKVGVVHQNGKRRLALVKDNPLDVSFKQVQ
Sequence Info Allergen Name Length Opt Bits Score E Value
CAA45723 Sola t 4 217 1470 341.9 9.3e-96
P30941 Sola t 4 221 1452 337.8 1.6e-94
P16348 Sola t 2 188 909 214.1 2.4e-57
P20347 Sola t 3.0102 222 206 53.7 5.2e-09
AAB23464 Gly m TI 216 182 48.3 2.2e-07
CAA45778 Gly m TI 217 181 48.1 2.6e-07
CAA45777 Gly m TI 217 180 47.8 3.1e-07
O24383 Sola t 3.0101 186 177 47.2 4e-07
P01071 Gly m TI 181 163 44.1 3.5e-06
AAB23483 Gly m TI 204 119 34.0 0.0043
CAA56343 Gly m TI 208 118 33.7 0.0052
AAB23482 Gly m TI 203 114 32.8 0.0095
Please note: Alignment made with FASTA version 36.3.8. As explained in the FASTA manual, the bit score is equivalent to the bit score reported by BLAST. A 1 bit increase in score corresponds to a 2-fold reduction in expectation, and a 10-bit increase implies 1000-fold lower expectation. Sequences with E values < 0.01 are almost always homologous. All FASTA search sequence alignment are printed in Blast format where Query is input sequence, and Sbjct is sequence found in the database.

Sequence Alignment: 1. Allergen Name: Sola t 4 Sequence ID: CAA45723

 Score = 341.9 bits 1470,  Expect = 9e-96
 Identities = 217/217 100%, Positives = 217/217 100%, Gaps = 0/217 0%
Query  1    MKCLFLLCLCLVPIVVFSSTFTSKNPINLPSDATPVLDVAGKELDSRLSYRIISTFWGAL 60
            MKCLFLLCLCLVPIVVFSSTFTSKNPINLPSDATPVLDVAGKELDSRLSYRIISTFWGAL
Sbjct  1    MKCLFLLCLCLVPIVVFSSTFTSKNPINLPSDATPVLDVAGKELDSRLSYRIISTFWGAL 60
Query  61   GGDVYLGKSPNSDAPCANGIFRYNSDVGPSGTPVRFSHFGQGIFENELLNIQFAISTSKL 120
            GGDVYLGKSPNSDAPCANGIFRYNSDVGPSGTPVRFSHFGQGIFENELLNIQFAISTSKL
Sbjct  61   GGDVYLGKSPNSDAPCANGIFRYNSDVGPSGTPVRFSHFGQGIFENELLNIQFAISTSKL 120
Query  121  CVSYTIWKVGDYDASLGTMLLETGGTIGQADSSWFKIVKSSQFGYNLLYCPVTSTMSCPF 180
            CVSYTIWKVGDYDASLGTMLLETGGTIGQADSSWFKIVKSSQFGYNLLYCPVTSTMSCPF
Sbjct  121  CVSYTIWKVGDYDASLGTMLLETGGTIGQADSSWFKIVKSSQFGYNLLYCPVTSTMSCPF 180
Query  181  SSDDQFCLKVGVVHQNGKRRLALVKDNPLDVSFKQVQ 217
            SSDDQFCLKVGVVHQNGKRRLALVKDNPLDVSFKQVQ
Sbjct  181  SSDDQFCLKVGVVHQNGKRRLALVKDNPLDVSFKQVQ 217

Sequence Alignment: 2. Allergen Name: Sola t 4 Sequence ID: P30941

 Score = 337.8 bits 1452,  Expect = 2e-94
 Identities = 217/217 100%, Positives = 217/217 100%, Gaps = 4/217 1%
Query  1    MKCLFLLCLCLVPIVVFSSTFTSKNPINLPSDATPVLDVAGKELDSRLSYRIISTFWGAL 60
            MKCLFLLCLCLVPIVVFSSTFTSKNPINLPSDATPVLDVAGKELDSRLSYRIISTFWGAL
Sbjct  1    MKCLFLLCLCLVPIVVFSSTFTSKNPINLPSDATPVLDVAGKELDSRLSYRIISTFWGAL 60
Query  61   GGDVYLGKSPNSDAPCANGIFRYNSDVGPSGTPVRF----SHFGQGIFENELLNIQFAIS 116
            GGDVYLGKSPNSDAPCANGIFRYNSDVGPSGTPVRF    SHFGQGIFENELLNIQFAIS
Sbjct  61   GGDVYLGKSPNSDAPCANGIFRYNSDVGPSGTPVRFIGSSSHFGQGIFENELLNIQFAIS 120
Query  117  TSKLCVSYTIWKVGDYDASLGTMLLETGGTIGQADSSWFKIVKSSQFGYNLLYCPVTSTM 176
            TSKLCVSYTIWKVGDYDASLGTMLLETGGTIGQADSSWFKIVKSSQFGYNLLYCPVTSTM
Sbjct  121  TSKLCVSYTIWKVGDYDASLGTMLLETGGTIGQADSSWFKIVKSSQFGYNLLYCPVTSTM 180
Query  177  SCPFSSDDQFCLKVGVVHQNGKRRLALVKDNPLDVSFKQVQ 217
            SCPFSSDDQFCLKVGVVHQNGKRRLALVKDNPLDVSFKQVQ
Sbjct  181  SCPFSSDDQFCLKVGVVHQNGKRRLALVKDNPLDVSFKQVQ 221

Sequence Alignment: 3. Allergen Name: Sola t 2 Sequence ID: P16348

 Score = 214.1 bits 909,  Expect = 2e-57
 Identities = 136/182 74%, Positives = 151/182 82%, Gaps = 0/182 0%
Query  35   PVLDVAGKELDSRLSYRIISTFWGALGGDVYLGKSPNSDAPCANGIFRYNSDVGPSGTPV 94
            PVLD  GKEL+   SYRIIS   GALGGDVYLGKSPNSDAPC +G+FRYNSDVGPSGTPV
Sbjct  7    PVLDTNGKELNPNSSYRIISIGRGALGGDVYLGKSPNSDAPCPDGVFRYNSDVGPSGTPV 66
Query  95   RFSHFGQGIFENELLNIQFAISTSKLCVSYTIWKVGDYDASLGTMLLETGGTIGQADSSW 154
            RF  +  GIFE++LLNIQF I+T KLCVSYTIWKVG+ +A + TMLLETGGTIGQADSS+
Sbjct  67   RFIPLSGGIFEDQLLNIQFNIATVKLCVSYTIWKVGNLNAYFRTMLLETGGTIGQADSSY 126
Query  155  FKIVKSSQFGYNLLYCPVTSTMSCPFSSDDQFCLKVGVVHQNGKRRLALVKDNPLDVSFK 214
            FKIVK S FGYNLLYCP+T    CPF  DD FC KVGVV QNGKRRLALV +NPLDV F+
Sbjct  127  FKIVKLSNFGYNLLYCPITPPFLCPFCRDDNFCAKVGVVIQNGKRRLALVNENPLDVLFQ 186
Query  215  QV 216
            +V
Sbjct  187  EV 188

Sequence Alignment: 4. Allergen Name: Sola t 3.0102 Sequence ID: P20347

 Score = 53.7 bits 206,  Expect = 5e-09
 Identities = 75/187 40%, Positives = 109/187 58%, Gaps = 44/187 23%
Query  5    FLLCLCLVPIVVFSSTFTSKNPINLPS----DATPVL----DVAGKELDSRLSYRIISTF 56
            FLL    + +V F+ +FTS+NPI LP+    D   VL    D  G  L     Y I + +
Sbjct  9    FLLLSSTLSLVAFARSFTSENPIVLPTTCHDDDNLVLPEVYDQDGNPLRIGERYIINNPL 68
Query  57   WGALGGDVYLGKSPNSDAPCANGIFRYNS--DVGPSGTPVRF-----SHFGQGIFENELL 109
             GA  G VYL    N    C N ++++ S       GTPV F     S +G  +    ++
Sbjct  69   LGA--GAVYLYNIGNLQ--CPNAVLQHMSIPQFLGEGTPVVFVRKSESDYGDVVRVMTVV 124
Query  110  NIQFAISTSKLCVSYTIWKVGDYDASLGTMLLETGGTIGQADSSWFKIVKS-------SQ 162
             I+F + T+KLCV  T+WKV D        L+ TGG +G  ++  FKI+K+       S+
Sbjct  125  YIKFFVKTTKLCVDQTVWKVND------EQLVVTGGKVGN-ENDIFKIMKTDLVTPGGSK 177
Query  163  FGYNLLYCPVTSTMSCPFSSDDQFCLKVGVVHQNGKRRLALVKDNPLDVSF 213
            + Y LL+CP  S + C           +G   +NG  RL  V D+   + F
Sbjct  178  YVYKLLHCP--SHLGCK---------NIGGNFKNGYPRLVTVDDDKDFIPF 217

Sequence Alignment: 5. Allergen Name: Gly m TI Sequence ID: AAB23464

 Score = 48.3 bits 182,  Expect = 2e-07
 Identities = 60/184 32%, Positives = 103/184 55%, Gaps = 38/184 20%
Query  4    LFLLCLCLVPIVVFSSTFTSKNPINLPSD-ATPVLDVAGKELDSRLSYRIISTFWGALGG 62
            LFL+C        F++++       LPS  A  VLD  G  L++  +Y I+S    A+GG
Sbjct  8    LFLFC-------AFTTSY-------LPSAIADFVLDNEGNPLENGGTYYILSDI-TAFGG 52
Query  63   DVYLGKSPNSDAPCANGIFRYNSDVGPS-GTPVRFSHFGQGIFENELLNIQF-AISTSKL 120
               +  +P  +  C   + +  +++    GT +   +  + I E   L+++F + +   L
Sbjct  53   ---IRAAPTGNERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIML 109
Query  121  CVSY-TIWKVGDYDASLGTMLLETGGTIGQADSSWFKI--VKSSQFG-YNLLYCPVTSTM 176
            CV   T W V + D   G  +    G    A   WF++  V   +F  Y L++CP  +  
Sbjct  110  CVGIPTEWSVVE-DLPEGPAV--KIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAE- 165
Query  177  SCPFSSDDQFCLKVGVV--HQNGKRRLALVKDNPLDVSFKQV 216
                  DD+ C  +G+   H +G RRL + K+ PL V F+++
Sbjct  166  ------DDK-CGDIGISIDHDDGTRRLVVSKNKPLVVQFQKL 200

Sequence Alignment: 6. Allergen Name: Gly m TI Sequence ID: CAA45778

 Score = 48.1 bits 181,  Expect = 3e-07
 Identities = 63/184 34%, Positives = 100/184 54%, Gaps = 38/184 20%
Query  4    LFLLCLCLVPIVVFSSTFTSKNPINLPSD-ATPVLDVAGKELDSRLSYRIISTFWGALGG 62
            LFL+C        F++++       LPS  A  VLD  G  LDS  +Y I+S    A+GG
Sbjct  9    LFLFC-------AFTTSY-------LPSAIADFVLDNEGNPLDSGGTYYILSDI-TAFGG 53
Query  63   DVYLGKSPNSDAPCANGIFRYNSDVGPS-GTPVRFSHFGQGIFENELLNIQF-AISTSKL 120
               +  +P  +  C   + +  +++    GT +      + I E   L ++F + +   L
Sbjct  54   ---IRAAPTGNERCPLTVVQSRNELDKGIGTIISSPFRIRFIAEGNPLRLKFDSFAVIML 110
Query  121  CVSY-TIWKVGDYDASLGTMLLETGGTIGQADSSWFKI--VKSSQFG-YNLLYCPVTSTM 176
            CV   T W V + D   G  +    G    A   WF+I  V   +F  Y L++C   +  
Sbjct  111  CVGIPTEWSVVE-DLPEGPAV--KIGENKDAVDGWFRIERVSDDEFNNYKLVFCTQQAE- 166
Query  177  SCPFSSDDQFCLKVGVV--HQNGKRRLALVKDNPLDVSFKQV 216
                  DD+ C  +G+   H +G RRL + K+ PL V F++V
Sbjct  167  ------DDK-CGDIGISIDHDDGTRRLVVSKNKPLVVQFQKV 201

Sequence Alignment: 7. Allergen Name: Gly m TI Sequence ID: CAA45777

 Score = 47.8 bits 180,  Expect = 3e-07
 Identities = 60/184 32%, Positives = 103/184 55%, Gaps = 38/184 20%
Query  4    LFLLCLCLVPIVVFSSTFTSKNPINLPSD-ATPVLDVAGKELDSRLSYRIISTFWGALGG 62
            LFL+C        F++++       LPS  A  VLD  G  L++  +Y I+S    A+GG
Sbjct  9    LFLFC-------AFTTSY-------LPSAIADFVLDNEGNPLENGGTYYILSDI-TAFGG 53
Query  63   DVYLGKSPNSDAPCANGIFRYNSDVGPS-GTPVRFSHFGQGIFENELLNIQF-AISTSKL 120
               +  +P  +  C   + +  +++    GT +   +  + I E   L+++F + +   L
Sbjct  54   ---IRAAPTGNERCPLTVVQSRNELDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIML 110
Query  121  CVSY-TIWKVGDYDASLGTMLLETGGTIGQADSSWFKI--VKSSQFG-YNLLYCPVTSTM 176
            CV   T W V + D   G  +    G    A   WF++  V   +F  Y L++CP  +  
Sbjct  111  CVGIPTEWSVVE-DLPEGPAV--KIGENKDAMDGWFRLERVSDDEFNNYKLVFCPQQAE- 166
Query  177  SCPFSSDDQFCLKVGVV--HQNGKRRLALVKDNPLDVSFKQV 216
                  DD+ C  +G+   H +G RRL + K+ PL V F+++
Sbjct  167  ------DDK-CGDIGISIDHDDGTRRLVVSKNKPLVVQFQKL 201

Sequence Alignment: 8. Allergen Name: Sola t 3.0101 Sequence ID: O24383

 Score = 47.2 bits 177,  Expect = 4e-07
 Identities = 55/162 33%, Positives = 80/162 49%, Gaps = 23/162 14%
Query  36   VLDVAGKELDSRLSYRIISTFWGALGGDVYLGKSPNSDAPCANGIFRYNS--DVGPSGTP 93
            V D  G  L     Y I + + GA  G VYL    N    C N ++++ S       GTP
Sbjct  13   VYDQDGNPLRIGERYIIKNPLLGA--GAVYLDNIGNLQ--CPNAVLQHMSIPQFLGKGTP 68
Query  94   VRF-----SHFGQGIFENELLNIQFAISTSKLCVSYTIWKVGDYDASLGTMLLETGGTIG 148
            V F     S +G  +     + I+F + T+KLCV  T+WKV +        L+ TGG +G
Sbjct  69   VVFIRKSESDYGDVVRLMTAVYIKFFVKTTKLCVDETVWKVNN------EQLVVTGGNVG 122
Query  149  QADSSWFKIVKSSQFGYNLLYCPVTSTMSCPFSSDDQFCLKVGVVHQNGKRRLALVKDNP 208
              ++  FKI K+      +    V   + CP   +   C  +G   +NG  RL  V D  
Sbjct  123  N-ENDIFKIKKTDLVIRGMK--NVYKLLHCPSHLE---CKNIGSNFKNGYPRLVTVNDEK 176
Query  209  LDVSF 213
              + F
Sbjct  177  DFIPF 181

Sequence Alignment: 9. Allergen Name: Gly m TI Sequence ID: P01071

 Score = 44.1 bits 163,  Expect = 4e-06
 Identities = 51/166 30%, Positives = 84/166 50%, Gaps = 23/166 13%
Query  36   VLDVAGKELDSRLSYRIISTFWGALGGDVYLGKSPNSDAPCANGIFRYNSDVGPS-GTPV 94
            VLD  G  L +  +Y I+S    A+GG   +  +P  +  C   + +  +++    GT +
Sbjct  3    VLDNEGNPLSNGGTYYILSDI-TAFGG---IRAAPTGNERCPLTVVQSRNELDKGIGTII 58
Query  95   RFSHFGQGIFENELLNIQF-AISTSKLCVSY-TIWKVGDYDASLGTMLLETGGTIGQADS 152
                  + I E   L ++F + +   LCV   T W V + D   G  +    G    A  
Sbjct  59   SSPFRIRFIAEGNPLRLKFDSFAVIMLCVGIPTEWSVVE-DLPEGPAV--KIGENKDAVD 115
Query  153  SWFKI--VKSSQFG-YNLLYCPVTSTMSCPFSSDDQFCLKVGVV--HQNGKRRLALVKDN 207
             WF+I  V   +F  Y L++C           ++D  C  +G+   H +G RRL + K+ 
Sbjct  116  GWFRIERVSDDEFNNYKLVFCTQ--------QAEDDKCGDIGISIDHDDGTRRLVVSKNK 167
Query  208  PLDVSFKQV 216
            PL V F++V
Sbjct  168  PLVVQFQKV 176

Sequence Alignment: 10. Allergen Name: Gly m TI Sequence ID: AAB23483

 Score = 34.0 bits 119,  Expect = 0.004
 Identities = 52/173 30%, Positives = 85/173 49%, Gaps = 20/173 11%
Query  29   LPS-DATPVLDVAGKELDSRLSYRIISTFWGALGGDVYLGKSPNSDAPCANGIFRYNSDV 87
            LPS  A  VLD     L +  +Y ++    G  GG    G S   +  C   + + + + 
Sbjct  20   LPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGG--IEGNSTGKEI-CPLTVVQ-SPNK 75
Query  88   GPSGTPVRFSHFGQGIF--ENELLNIQF-AISTSKLC-VSYTIWKVGDYDASLGTMLLET 143
               G  + F    + +F  E   L+I+F + +   LC V  T W + + +  L  + L  
Sbjct  76   HNKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKWAIVEREG-LQAVTLAA 134
Query  144  GGTIGQADSSWFKIVKSSQFGYNLLYCPVTSTMSCPFSSDDQFCLKVGV-VHQNGKRRLA 202
              T+      WF I + S+  YN  Y      + CP  ++D  C  +G+ +  +G RRL 
Sbjct  135  RDTV----DGWFNIERVSRE-YNDYY----KLVFCPQEAEDNKCEDIGIQIDNDGIRRLV 185
Query  203  LVKDNPLDVSFKQ 215
            L K+ PL V F++
Sbjct  186  LSKNKPLVVEFQK 198

Sequence Alignment: 11. Allergen Name: Gly m TI Sequence ID: CAA56343

 Score = 33.7 bits 118,  Expect = 0.005
 Identities = 50/185 27%, Positives = 91/185 49%, Gaps = 37/185 20%
Query  4    LFLLCLCLVPIVVFSSTFTSKNPINLPS-DATPVLDVAGKELDSRLSYRIISTFWGALGG 62
            LFLLC        ++S++        PS  A  V+D  G  + +  +Y ++    G  GG
Sbjct  9    LFLLC-------ALTSSYQ-------PSATADIVFDTEGNPIRNGGTYYVLPVIRGK-GG 53
Query  63   DVYLGKSPNSDAPCANGIFRYNSDVGPSGTPVRFSHFGQ--GIFENELLNIQFAISTSKL 120
             + + K+     P    + +   +    G P+  S   +   I E  +L+++F + T   
Sbjct  54   GIEFAKTETETCPLT--VVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFHLCTP-- 109
Query  121  CVSYTIWKVGDYDASLGTMLLETGGTIGQADSSWFKIVKSSQFG--YNLLYCPVTSTMSC 178
             +S   + V  Y              + + +  WF+I ++S     Y L++C        
Sbjct  110  -LSLNSFSVDRYSQGSARRTPCQTHWLQKHNRCWFRIQRASSESNYYKLVFCT------- 161
Query  179  PFSSDDQFCLK-VGVVHQNGKRRLALVKD--NPLDVSFKQVQ 217
              S+DD  C   V  + + G R L +  D  +PL V F++V+
Sbjct  162  --SNDDSSCGDIVAPIDREGNRPLIVTHDQNHPLLVQFQKVE 201

Sequence Alignment: 12. Allergen Name: Gly m TI Sequence ID: AAB23482

 Score = 32.8 bits 114,  Expect = 0.009
 Identities = 48/170 28%, Positives = 86/170 50%, Gaps = 25/170 14%
Query  29   LPS-DATPVLDVAGKELDSRLSYRIISTFWGALGGDVYLGKSPNSDAPCANGIFRYNSDV 87
            LPS  A  VLD     L +  +Y ++    G  GG + +  +      C   + +  +++
Sbjct  20   LPSATAQFVLDTDDDPLQNGGTYYMLPVMRGK-GGGIEVDST--GKEICPLTVVQSPNEL 76
Query  88   GPSGTPVRFSHFGQGIF--ENELLNIQF-AISTSKLCVSY-TIWKVGDYDASLGTMLLET 143
               G  + F+   + +F  E   L+I+F + +   LC    T W + + +  L  + L  
Sbjct  77   D-KGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEWAIVEREG-LQAVKLAA 134
Query  144  GGTIGQADSSWFKIVKSSQF--GYNLLYCPVTSTMSCPFSSDDQFCLKVGV-VHQNGKRR 200
              T+      WF I + S+    Y L++CP          ++D  C  +G+ +  +G RR
Sbjct  135  RDTV----DGWFNIERVSREYNDYKLVFCPQ--------QAEDNKCEDIGIQIDDDGIRR 182
Query  201  LALVKDNPLDVSFKQ 215
            L L K+ PL V F++
Sbjct  183  LVLSKNKPLVVQFQK 197

SDAP Home Page | Search SDAP | SDAP Manual | SDAP FAQ | Contact  
UTMB | Search | Directories | UTMB Map | News | Employment | Sitemap 
This site published by Surendra Negi
Copyright   2001-2023  The University of Texas Medical Branch. Please review our privacy policy and Internet guidelines.