SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for WP_020458017.1.31213 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  WP_020458017.1.31213
Domain Number 1 Region: 366-522
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 2.56e-54
Family Cellulose-binding domain family III 0.0000000415
Further Details:      
 
Domain Number 2 Region: 725-865
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 4.97e-39
Family Cellulose-binding domain family III 0.000000137
Further Details:      
 
Domain Number 3 Region: 890-1030
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 5.68e-39
Family Cellulose-binding domain family III 0.000000137
Further Details:      
 
Domain Number 4 Region: 1055-1195
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 7.07e-39
Family Cellulose-binding domain family III 0.000000144
Further Details:      
 
Domain Number 5 Region: 1385-1525
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 7.07e-39
Family Cellulose-binding domain family III 0.000000144
Further Details:      
 
Domain Number 6 Region: 1220-1360
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 9.94e-39
Family Cellulose-binding domain family III 0.000000109
Further Details:      
 
Domain Number 7 Region: 562-702
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 2.36e-38
Family Cellulose-binding domain family III 0.000000158
Further Details:      
 
Domain Number 8 Region: 1549-1689
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 1.2e-36
Family Cellulose-binding domain family III 0.00000115
Further Details:      
 
Domain Number 9 Region: 184-321
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 1.36e-36
Family Cellulose-binding domain family III 0.000000127
Further Details:      
 
Domain Number 10 Region: 30-178
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 5.76e-36
Family Cellulose-binding domain family III 0.00000273
Further Details:      
 
Domain Number 11 Region: 1698-1791
Classification Level Classification E-value
Superfamily Carboxypeptidase regulatory domain-like 9.42e-17
Family Pre-dockerin domain 0.0000031
Further Details:      
 
Domain Number 12 Region: 1794-1850
Classification Level Classification E-value
Superfamily Type I dockerin domain 0.000000000144
Family Type I dockerin domain 0.0000353
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) WP_020458017.1.31213
Sequence length 1853
Comment cellulosomal-scaffolding protein A [Ruminiclostridium thermocellum]; AA=GCF_000015865.1; RF=representative genome; TAX=203119; STAX=1515; NAME=Ruminiclostridium thermocellum ATCC 27405; strain=ATCC 27405; AL=Complete Genome; RT=Major
Sequence
MRKVISMLLVVAMLTTIFAAMIPQTVSAATMTVEIGKVTAAVGSKVEIPITLKGVPSKGM
ANCDFVLGYDPNVLEVTEVKPGSIIKDPDPSKSFDSAIYPDRKMIVFLFAEDSGRGTYAI
TQDGVFATIVATVKSAAAAPITLLEVGAFADNDLVEISTTFVAGGVNLGSSVPTTQPNVP
SDGVVVEIGKVTGSVGTTVEIPVYFRGVPSKGIANCDFVFRYDPNVLEIIGIDPGDIIVD
PNPTKSFDTAIYPDRKIIVFLFAEDSGTGAYAITKDGVFAKIRATVKSSAPGYITFDEVG
GFADNDLVEQKVSFIDGGVNVGNATPTKGATPTNTATPTKSATATPTRPSVPTNTPTNTP
ANTPVSGNLKVEFYNSNPSDTTNSINPQFKVTNTGSSAIDLSKLTLRYYYTVDGQKDQTF
WCDHAAIIGSNGSYNGITSNVKGTFVKMSSSTNNADTYLEISFTGGTLEPGAHVQIQGRF
AKNDWSNYTQSNDYSFKSASQFVEWDQVTAYLNGVLVWGKEPGGSVVPSTQPVTTPPATT
KPPATTKPPATTIPPSDDPNAIKIKVDTVNAKPGDTVNIPVRFSGIPSKGIANCDFVYSY
DPNVLEIIEIKPGELIVDPNPDKSFDTAVYPDRKIIVFLFAEDSGTGAYAITKDGVFATI
VAKVKSGAPNGLSVIKFVEVGGFANNDLVEQRTQFFDGGVNVGDTTVPTTPTTPVTTPTD
DSNAVRIKVDTVNAKPGDTVRIPVRFSGIPSKGIANCDFVYSYDPNVLEIIEIEPGDIIV
DPNPDKSFDTAVYPDRKIIVFLFAEDSGTGAYAITKDGVFATIVAKVKSGAPNGLSVIKF
VEVGGFANNDLVEQKTQFFDGGVNVGDTTEPATPTTPVTTPTTTDDLDAVRIKVDTVNAK
PGDTVRIPVRFSGIPSKGIANCDFVYSYDPNVLEIIEIEPGDIIVDPNPDKSFDTAVYPD
RKIIVFLFAEDSGTGAYAITKDGVFATIVAKVKSGAPNGLSVIKFVEVGGFANNDLVEQK
TQFFDGGVNVGDTTEPATPTTPVTTPTTTDDLDAVRIKVDTVNAKPGDTVRIPVRFSGIP
SKGIANCDFVYSYDPNVLEIIEIEPGDIIVDPNPDKSFDTAVYPDRKIIVFLFAEDSGTG
AYAITKDGVFATIVAKVKEGAPNGLSVIKFVEVGGFANNDLVEQKTQFFDGGVNVGDTTE
PATPTTPVTTPTTTDDLDAVRIKVDTVNAKPGDTVRIPVRFSGIPSKGIANCDFVYSYDP
NVLEIIEIEPGELIVDPNPTKSFDTAVYPDRKMIVFLFAEDSGTGAYAITEDGVFATIVA
KVKSGAPNGLSVIKFVEVGGFANNDLVEQKTQFFDGGVNVGDTTEPATPTTPVTTPTTTD
DLDAVRIKVDTVNAKPGDTVRIPVRFSGIPSKGIANCDFVYSYDPNVLEIIEIEPGDIIV
DPNPDKSFDTAVYPDRKIIVFLFAEDSGTGAYAITKDGVFATIVAKVKEGAPNGLSVIKF
VEVGGFANNDLVEQKTQFFDGGVNVGDTTVPTTSPTTTPPEPTITPNKLTLKIGRAEGRP
GDTVEIPVNLYGVPQKGIASGDFVVSYDPNVLEIIEIEPGELIVDPNPTKSFDTAVYPDR
KMIVFLFAEDSGTGAYAITEDGVFATIVAKVKEGAPEGFSAIEISEFGAFADNDLVEVET
DLINGGVLVTNKPVIEGYKVSGYILPDFSFDATVAPLVKAGFKVEIVGTELYAVTDANGY
FEITGVPANASGYTLKISRATYLDRVIANVVVTGDTSVSTSQAPIMMWVGDIVKDNSINL
LDVAEVIRCFNATKGSANYVEELDINRNGAINMQDIMIVHKHFGATSSDYDAQ
Download sequence
Identical sequences Q06851
CIPA_CLOTM 203119.Cthe_3077 gi|125975556|ref|YP_001039466.1| WP_020458017.1.31213

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]