The University of Texas Medical Branch
Department of Biochemistry and Molecular Biology Sealy Center for Structural Biology Computational Biology


SDAP Home Page
SDAP Overview

Search SDAP
SDAP All
SDAP Food

SDAP Tools
AllergenAI
FAO/WHO Allergenicity Test
FASTA Search in SDAP
Peptide Match
Peptide Similarity
Peptide-Protein PD Index
Aller_ML, Allergen Markup Language
List SDAP

About SDAP
General Information
Manual
FAQ
Publications
Who Are We
Advisory Board
New Allergen Submission form

Allergy Links

Our Software Tools
MPACK
FANTOM
GETAREA
InterProSurf
EpiSearch

Allergen Databases
WHO/IUIS Allergen Nomenclature database
FARRP Allergen Protein Database (University of Nebraska)
Allergen Database for Food Safety (ADFS)
COMPARE database
ALLFAM (Medical University of Vienna)
Allermatch (Wageninen University)
Allergome Database

Protein Databases
PDB
MMDB - Entrez
SWISS-PROT
NCBI - Entrez
PIR

Protein Classification
CATH
FSSP
iProClass
ProtoMap
SCOP
VAST

Bioinformatics Servers
BLAST @ NCBI
FASTA @ EMBL-EBI
Peptide Match @ PIR
ClustalW @ EMBL - EBI


                SDAP 2.0 - Structural Database of Allergenic Proteins
Go to: SDAP All allergens       Go to: SDAP Food allergens
Send a comment to Werner Braun      Submit new allergen information to SDAP
  
Alphabetical listing of allergens: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Access to SDAP is available free of charge for Academic and non-profit use.< Licenses for commercial use can be obtained by contacting W. Braun (webraun@utmb.edu). Secure access to SDAP is available from https://fermi.utmb.edu/SDAP


Input Sequence

MKSTTSLALFLLCALTSSYQPSATADIVFDTEGNPIRNGGTYYVLPVIRGKGGGIEFAKT
ETETCPLTVVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFHLCTPLSLNSFSVDRY
SQGSARRTPCQTHWLQKHNRCWFRIQRASSESNYYKLVFCTSNDDSSCGDIVAPIDREGN
RPLIVTHDQNHPLLVQFQKVEAYESSTA
Sequence Info Allergen Name Length Opt Bits Score E Value
CAA56343 Gly m TI 208 1396 325.1 9.9e-91
AAB23482 Gly m TI 203 553 132.9 6.7e-33
AAB23483 Gly m TI 204 527 127.0 4.1e-31
CAA45778 Gly m TI 217 521 125.6 1.2e-30
CAA45777 Gly m TI 217 521 125.6 1.2e-30
AAB23464 Gly m TI 216 504 121.7 1.7e-29
P01071 Gly m TI 181 423 103.3 4.8e-24
P30941 Sola t 4 221 123 34.9 0.0024
CAA45723 Sola t 4 217 118 33.7 0.0052
Please note: Alignment made with FASTA version 36.3.8. As explained in the FASTA manual, the bit score is equivalent to the bit score reported by BLAST. A 1 bit increase in score corresponds to a 2-fold reduction in expectation, and a 10-bit increase implies 1000-fold lower expectation. Sequences with E values < 0.01 are almost always homologous. All FASTA search sequence alignment are printed in Blast format where Query is input sequence, and Sbjct is sequence found in the database.

Sequence Alignment: 1. Allergen Name: Gly m TI Sequence ID: CAA56343

 Score = 325.1 bits 1396,  Expect = 1e-90
 Identities = 208/208 100%, Positives = 208/208 100%, Gaps = 0/208 0%
Query  1    MKSTTSLALFLLCALTSSYQPSATADIVFDTEGNPIRNGGTYYVLPVIRGKGGGIEFAKT 60
            MKSTTSLALFLLCALTSSYQPSATADIVFDTEGNPIRNGGTYYVLPVIRGKGGGIEFAKT
Sbjct  1    MKSTTSLALFLLCALTSSYQPSATADIVFDTEGNPIRNGGTYYVLPVIRGKGGGIEFAKT 60
Query  61   ETETCPLTVVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFHLCTPLSLNSFSVDRY 120
            ETETCPLTVVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFHLCTPLSLNSFSVDRY
Sbjct  61   ETETCPLTVVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFHLCTPLSLNSFSVDRY 120
Query  121  SQGSARRTPCQTHWLQKHNRCWFRIQRASSESNYYKLVFCTSNDDSSCGDIVAPIDREGN 180
            SQGSARRTPCQTHWLQKHNRCWFRIQRASSESNYYKLVFCTSNDDSSCGDIVAPIDREGN
Sbjct  121  SQGSARRTPCQTHWLQKHNRCWFRIQRASSESNYYKLVFCTSNDDSSCGDIVAPIDREGN 180
Query  181  RPLIVTHDQNHPLLVQFQKVEAYESSTA 208
            RPLIVTHDQNHPLLVQFQKVEAYESSTA
Sbjct  181  RPLIVTHDQNHPLLVQFQKVEAYESSTA 208

Sequence Alignment: 2. Allergen Name: Gly m TI Sequence ID: AAB23482

 Score = 132.9 bits 553,  Expect = 7e-33
 Identities = 101/200 50%, Positives = 133/200 66%, Gaps = 11/200 5%
Query  1    MKSTTSLALFLLCALTSSYQPSATADIVFDTEGNPIRNGGTYYVLPVIRGKGGGIEFAKT 60
            MKST  +ALFL+CA+T SY PSATA  V+DT+ +P++NGGTYY+LPV+RGKGGGIE   T
Sbjct  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKGGGIEVDST 60
Query  61   ETETCPLTVVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFHLCTPLSLNSFSVDRY 120
              E CPLTVVQSP E L +G+ L+ +SP+  L I E   LS+KF     ++L +     +
Sbjct  61   GKEICPLTVVQSPNE-LDKGIGLVFTSPLHALFIAERYPLSIKFGSFAVITLCAGMPTEW 119
Query  121  SQGSARRTPCQTHWLQKHNRC--WFRIQRASSESNYYKLVFCTSN-DDSSCGDIVAPIDR 177
            +     R   Q   L   +    WF I+R S E N YKLVFC    +D+ C DI   ID 
Sbjct  120  A--IVEREGLQAVKLAARDTVDGWFNIERVSREYNDYKLVFCPQQAEDNKCEDIGIQIDD 177
Query  178  EGNRPLIVTHDQNHPLLVQFQKVEAYESSTA 208
            +G R L+++  +N PL+VQFQK   + SSTA
Sbjct  178  DGIRRLVLS--KNKPLVVQFQK---FRSSTA 203

Sequence Alignment: 3. Allergen Name: Gly m TI Sequence ID: AAB23483

 Score = 127.0 bits 527,  Expect = 4e-31
 Identities = 98/200 49%, Positives = 129/200 64%, Gaps = 12/200 6%
Query  1    MKSTTSLALFLLCALTSSYQPSATADIVFDTEGNPIRNGGTYYVLPVIRGKGGGIEFAKT 60
            MKST  +ALFL+CA+T SY PSATA  V+DT+ +P++NGGTYY+LPV+RGK GGIE   T
Sbjct  1    MKSTIFFALFLVCAFTISYLPSATAQFVLDTDDDPLQNGGTYYMLPVMRGKSGGIEGNST 60
Query  61   ETETCPLTVVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFHLCTPLSLNSFSVDRY 120
              E CPLTVVQSP     +G+ L+  SP+  L I E   LS+KF     + L      ++
Sbjct  61   GKEICPLTVVQSP-NKHNKGIGLVFKSPLHALFIAERYPLSIKFDSFAVIPLCGVMPTKW 119
Query  121  SQGSARRTPCQTHWLQKHNRC--WFRIQRASSESN-YYKLVFCTSN-DDSSCGDIVAPID 176
            +     R   Q   L   +    WF I+R S E N YYKLVFC    +D+ C DI   ID
Sbjct  120  A--IVEREGLQAVTLAARDTVDGWFNIERVSREYNDYYKLVFCPQEAEDNKCEDIGIQID 177
Query  177  REGNRPLIVTHDQNHPLLVQFQKVEAYESSTA 208
             +G R L+++  +N PL+V+FQK   + SSTA
Sbjct  178  NDGIRRLVLS--KNKPLVVEFQK---FRSSTA 204

Sequence Alignment: 4. Allergen Name: Gly m TI Sequence ID: CAA45778

 Score = 125.6 bits 521,  Expect = 1e-30
 Identities = 102/193 52%, Positives = 126/193 65%, Gaps = 17/193 8%
Query  1    MKSTTSLALFLLCALTSSYQPSATADIVFDTEGNPIRNGGTYYVLPVIRGKGGGIEFAKT 60
            MKST  +ALFL+CA+T+SY PSA AD V+D EGNP+ +GGTYY+L  I   GG I  A T
Sbjct  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLDSGGTYYILSDITAFGG-IRAAPT 59
Query  61   ETETCPLTVVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFH------LCTPLSLNS 114
              E CPLTVVQS  E L +G+  IISSPF+I  I EG  L LKF       LC  +    
Sbjct  60   GNERCPLTVVQSRNE-LDKGIGTIISSPFRIRFIAEGNPLRLKFDSFAVIMLCVGIPTEW 118
Query  115  FSVDRYSQGSARRTPCQTHWLQKHNRCWFRIQRASS-ESNYYKLVFCTSN-DDSSCGDIV 172
              V+   +G A +       +      WFRI+R S  E N YKLVFCT   +D  CGDI 
Sbjct  119  SVVEDLPEGPAVKIGENKDAVDG----WFRIERVSDDEFNNYKLVFCTQQAEDDKCGDIG 174
Query  173  APIDRE-GNRPLIVTHDQNHPLLVQFQKVE 201
              ID + G R L+V+  +N PL+VQFQKV+
Sbjct  175  ISIDHDDGTRRLVVS--KNKPLVVQFQKVD 202

Sequence Alignment: 5. Allergen Name: Gly m TI Sequence ID: CAA45777

 Score = 125.6 bits 521,  Expect = 1e-30
 Identities = 100/193 51%, Positives = 126/193 65%, Gaps = 17/193 8%
Query  1    MKSTTSLALFLLCALTSSYQPSATADIVFDTEGNPIRNGGTYYVLPVIRGKGGGIEFAKT 60
            MKST  +ALFL+CA+T+SY PSA AD V+D EGNP+ NGGTYY+L  I   GG I  A T
Sbjct  1    MKSTIFFALFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGG-IRAAPT 59
Query  61   ETETCPLTVVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFH------LCTPLSLNS 114
              E CPLTVVQS  E L +G+  IISSP++I  I EG  LSLKF       LC  +    
Sbjct  60   GNERCPLTVVQSRNE-LDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEW 118
Query  115  FSVDRYSQGSARRTPCQTHWLQKHNRCWFRIQRASS-ESNYYKLVFCTSN-DDSSCGDIV 172
              V+   +G A +       +      WFR++R S  E N YKLVFC    +D  CGDI 
Sbjct  119  SVVEDLPEGPAVKIGENKDAMDG----WFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIG 174
Query  173  APIDRE-GNRPLIVTHDQNHPLLVQFQKVE 201
              ID + G R L+V+  +N PL+VQFQK++
Sbjct  175  ISIDHDDGTRRLVVS--KNKPLVVQFQKLD 202

Sequence Alignment: 6. Allergen Name: Gly m TI Sequence ID: AAB23464

 Score = 121.7 bits 504,  Expect = 2e-29
 Identities = 99/192 51%, Positives = 125/192 65%, Gaps = 18/192 9%
Query  1    MKSTTSLALFLLCALTSSYQPSATADIVFDTEGNPIRNGGTYYVLPVIRGKGGGIEFAKT 60
            MKST  + LFL+CA+T+SY PSA AD V+D EGNP+ NGGTYY+L  I   GG I  A T
Sbjct  1    MKSTI-FFLFLFCAFTTSYLPSAIADFVLDNEGNPLENGGTYYILSDITAFGG-IRAAPT 58
Query  61   ETETCPLTVVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFH------LCTPLSLNS 114
              E CPLTVVQS  E L +G+  IISSP++I  I EG  LSLKF       LC  +    
Sbjct  59   GNERCPLTVVQSRNE-LDKGIGTIISSPYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEW 117
Query  115  FSVDRYSQGSARRTPCQTHWLQKHNRCWFRIQRASS-ESNYYKLVFCTSN-DDSSCGDIV 172
              V+   +G A +       +      WFR++R S  E N YKLVFC    +D  CGDI 
Sbjct  118  SVVEDLPEGPAVKIGENKDAMDG----WFRLERVSDDEFNNYKLVFCPQQAEDDKCGDIG 173
Query  173  APIDRE-GNRPLIVTHDQNHPLLVQFQKVE 201
              ID + G R L+V+  +N PL+VQFQK++
Sbjct  174  ISIDHDDGTRRLVVS--KNKPLVVQFQKLD 201

Sequence Alignment: 7. Allergen Name: Gly m TI Sequence ID: P01071

 Score = 103.3 bits 423,  Expect = 5e-24
 Identities = 86/168 51%, Positives = 105/168 62%, Gaps = 17/168 10%
Query  26   DIVFDTEGNPIRNGGTYYVLPVIRGKGGGIEFAKTETETCPLTVVQSPFEGLQRGLPLII 85
            D V+D EGNP+ NGGTYY+L  I   GG I  A T  E CPLTVVQS  E L +G+  II
Sbjct  1    DFVLDNEGNPLSNGGTYYILSDITAFGG-IRAAPTGNERCPLTVVQSRNE-LDKGIGTII 58
Query  86   SSPFKILDITEGLILSLKFH------LCTPLSLNSFSVDRYSQGSARRTPCQTHWLQKHN 139
            SSPF+I  I EG  L LKF       LC  +      V+   +G A +       +    
Sbjct  59   SSPFRIRFIAEGNPLRLKFDSFAVIMLCVGIPTEWSVVEDLPEGPAVKIGENKDAVDG-- 116
Query  140  RCWFRIQRASS-ESNYYKLVFCTSN-DDSSCGDIVAPIDRE-GNRPLIVTHDQNHPLLVQ 196
              WFRI+R S  E N YKLVFCT   +D  CGDI   ID + G R L+V+  +N PL+VQ
Sbjct  117  --WFRIERVSDDEFNNYKLVFCTQQAEDDKCGDIGISIDHDDGTRRLVVS--KNKPLVVQ 172
Query  197  FQKVE 201
            FQKV+
Sbjct  173  FQKVD 177

Sequence Alignment: 8. Allergen Name: Sola t 4 Sequence ID: P30941

 Score = 34.9 bits 123,  Expect = 0.002
 Identities = 52/187 27%, Positives = 92/187 49%, Gaps = 37/187 19%
Query  9    LFLLC-------ALTSSYQ-------PSATADIVFDTEGNPIRNGGTYYVLPVIRGK-GG 53
            LFLLC        ++S++        PS  A  V+D  G  + +  +Y ++    G  GG
Sbjct  4    LFLLCLCLVPIVVFSSTFTSKNPINLPS-DATPVLDVAGKELDSRLSYRIISTFWGALGG 62
Query  54   GIEFAKTETETCPLT--VVQSPFEGLQRGLPL--IISSPFKILDITEGLILSLKFHLCTP 109
             + + K+     P    + +   +    G P+  I SS      I E  +L+++F + T 
Sbjct  63   DVYLGKSPNSDAPCANGIFRYNSDVGPSGTPVRFIGSSSHFGQGIFENELLNIQFAISTS 122
Query  110  ---LSLNSFSVDRYSQGSARRTPCQTHWLQKHNRCWFRIQRASSESNYYKLVFCT----- 161
               +S   + V  Y              + + +  WF+I ++S     Y L++C      
Sbjct  123  KLCVSYTIWKVGDYDASLGTMLLETGGTIGQADSSWFKIVKSSQFG--YNLLYCPVTSTM 180
Query  162  ----SNDDSSCGDIVAPIDREGNRPLIVTHDQNHPLLVQFQKVE 201
                S+DD  C   V  + + G R L +  D  +PL V F++V+
Sbjct  181  SCPFSSDDQFCLK-VGVVHQNGKRRLALVKD--NPLDVSFKQVQ 221

Sequence Alignment: 9. Allergen Name: Sola t 4 Sequence ID: CAA45723

 Score = 33.7 bits 118,  Expect = 0.005
 Identities = 51/185 27%, Positives = 91/185 49%, Gaps = 37/185 20%
Query  9    LFLLC-------ALTSSYQ-------PSATADIVFDTEGNPIRNGGTYYVLPVIRGK-GG 53
            LFLLC        ++S++        PS  A  V+D  G  + +  +Y ++    G  GG
Sbjct  4    LFLLCLCLVPIVVFSSTFTSKNPINLPS-DATPVLDVAGKELDSRLSYRIISTFWGALGG 62
Query  54   GIEFAKTETETCPLT--VVQSPFEGLQRGLPLIISSPFKILDITEGLILSLKFHLCTP-- 109
             + + K+     P    + +   +    G P+  S  F    I E  +L+++F + T   
Sbjct  63   DVYLGKSPNSDAPCANGIFRYNSDVGPSGTPVRFSH-FG-QGIFENELLNIQFAISTSKL 120
Query  110  -LSLNSFSVDRYSQGSARRTPCQTHWLQKHNRCWFRIQRASSESNYYKLVFCT------- 161
             +S   + V  Y              + + +  WF+I ++S     Y L++C        
Sbjct  121  CVSYTIWKVGDYDASLGTMLLETGGTIGQADSSWFKIVKSSQFG--YNLLYCPVTSTMSC 178
Query  162  --SNDDSSCGDIVAPIDREGNRPLIVTHDQNHPLLVQFQKVE 201
              S+DD  C   V  + + G R L +  D  +PL V F++V+
Sbjct  179  PFSSDDQFCLK-VGVVHQNGKRRLALVKD--NPLDVSFKQVQ 217

SDAP Home Page | Search SDAP | SDAP Manual | SDAP FAQ | Contact  
UTMB | Search | Directories | UTMB Map | News | Employment | Sitemap 
This site published by Surendra Negi
Copyright   2001-2023  The University of Texas Medical Branch. Please review our privacy policy and Internet guidelines.