SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|385777993|ref|YP_005687158.1| from Clostridium thermocellum DSM 1313

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|385777993|ref|YP_005687158.1|
Domain Number 1 Region: 366-522
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 1.67e-54
Family Cellulose-binding domain family III 0.0000000415
Further Details:      
 
Domain Number 2 Region: 719-859
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 3.59e-39
Family Cellulose-binding domain family III 0.000000137
Further Details:      
 
Domain Number 3 Region: 884-1024
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 6.88e-39
Family Cellulose-binding domain family III 0.000000134
Further Details:      
 
Domain Number 4 Region: 556-696
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 1.54e-38
Family Cellulose-binding domain family III 0.000000158
Further Details:      
 
Domain Number 5 Region: 1048-1188
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 7.85e-37
Family Cellulose-binding domain family III 0.00000115
Further Details:      
 
Domain Number 6 Region: 184-321
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 8.9e-37
Family Cellulose-binding domain family III 0.000000127
Further Details:      
 
Domain Number 7 Region: 30-178
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 3.66e-36
Family Cellulose-binding domain family III 0.00000273
Further Details:      
 
Domain Number 8 Region: 1197-1290
Classification Level Classification E-value
Superfamily Carboxypeptidase regulatory domain-like 6.28e-17
Family Pre-dockerin domain 0.0000031
Further Details:      
 
Domain Number 9 Region: 1293-1349
Classification Level Classification E-value
Superfamily Type I dockerin domain 0.000000000101
Family Type I dockerin domain 0.0000353
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|385777993|ref|YP_005687158.1|
Sequence length 1352
Comment cellulosome anchoring protein cohesin subunit [Clostridium thermocellum DSM 1313]
Sequence
MRKVISMLLVVAMLTTIFAAMIPQTVSAATMTVEIGKVTAAVGSKVEIPITLKGVPSKGM
ANCDFVLGYDPNVLEVTEVKPGSIIKDPDPSKSFDSAIYPDRKMIVFLFAEDSGRGTYAI
TQDGVFATIVATVKSAAAAPITLLEVGAFADNDLVEISTTFVAGGVNLGSSVPTTQPNVP
SDGVVVEIGKVTGSVGTTVEIPVYFRGVPSKGIANCDFVFRYDPNVLEIIGIDPGDIIVD
PNPTKSFDTAIYPDRKIIVFLFAEDSGTGAYAITKDGVFAKIRATVKSSAPGYITFDEVG
GFADNDLVEQKVSFIDGGVNVGNATPTKGATPTNTATPTKSATATPTRPSVPTNTPTNTP
ANTPVSGNLKVEFYNSNPSDTTNSINPQFKVTNTGSSAIDLSKLTLRYYYTVDGQKDQTF
WCDHAAIIGSNGSYNGITSNVKGTFVKMSSSTNNADTYLEISFTGGTLEPGAHVQIQGRF
AKNDWSNYTQSNDYSFKSASQFVEWDQVTAYLNGVLVWGKEPGGSVVPSTQPVTTPPATT
KPPATTIPPSDDPNAIKIKVDTVNAKPGDTVNIPVRFSGIPSKGIANCDFVYSYDPNVLE
IIEIKPGELIVDPNPDKSFDTAVYPDRKIIVFLFAEDSGTGAYAITKDGVFATIVAKVKS
GAPNGLSVIKFVEVGGFANNDLVEQRTQFFDGGVNVGDTTVPTTPTTPVTTPTDDSNAVR
IKVDTVNAKPGDTVRIPVRFSGIPSKGIANCDFVYSYDPNVLEIIEIEPGDIIVDPNPDK
SFDTAVYPDRKIIVFLFAEDSGTGAYAITKDGVFATIVAKVKSGAPNGLSVIKFVEVGGF
ANNDLVEQKTQFFDGGVNVGDTTEPATPTTPVTTPTTTDDLDAVRIKVDTVNAKPGDTVR
IPVRFSGIPSKGIANCDFVYSYDPNVLEIIEIEPGDIIVDPNPTKSFDTAVYPDRKIIVF
LFAEDSGTGAYAITKDGVFATIVAKVKEGAPNGLSVIKFVEVGGFANNDLVEQKTQFFDG
GVNVGDTTVPTTSPTTTPPEPTITPNKLTLKIGRAEGRPGDTVEIPVNLYGVPQKGIASG
DFVVSYDPNVLEIIEIEPGELIVDPNPTKSFDTAVYPDRKMIVFLFAEDSGTGAYAITED
GVFATIVAKVKEGAPEGFSAIEISEFGAFADNDLVEVETDLINGGVLVTNKPVIEGYKVS
GYILPDFSFDATVAPLVKAGFKVEIVGTELYAVTDANGYFEITGVPANASGYTLKISRAT
YLDRVIANVVVTGDTSVSTSQAPIMMWVGDIVKDNSINLLDVAEVIRCFNATKGSANYVE
ELDINRNGAINMQDIMIVHKHFGATSSDYDAQ
Download sequence
Identical sequences gi|385777993|ref|YP_005687158.1| WP_014522594.1.19387

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]