SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|385777994|ref|YP_005687159.1| from Clostridium thermocellum DSM 1313

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|385777994|ref|YP_005687159.1|
Domain Number 1 Region: 33-191
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 3.14e-56
Family Cellulose-binding domain family III 0.00000164
Further Details:      
 
Domain Number 2 Region: 403-565
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 3.66e-56
Family Cellulose-binding domain family III 0.00000367
Further Details:      
 
Domain Number 3 Region: 605-766
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 9.57e-56
Family Cellulose-binding domain family III 0.00000719
Further Details:      
 
Domain Number 4 Region: 202-363
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 3.4e-55
Family Cellulose-binding domain family III 0.00000331
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) gi|385777994|ref|YP_005687159.1|
Sequence length 1615
Comment cellulosome anchoring protein cohesin subunit [Clostridium thermocellum DSM 1313]
Sequence
MKRKNKVLSILLTLLLIISTTSVNMSFAEATPSIEMVLDKTEVHVGDVITATIKVNNIRK
LAGYQLNIKFDPEVLQPVDPATGEEFTDKSMPVNRVLLTNSKYGPTPVAGNDIKSGIINF
ATGYNNLTAYKSSGIDEHTGIIGEIGFKVLKKQNTSIRFEDTLSMPGAISGTSLFDWDAE
TITGYEVIQPDLIVVEAEPLKDASVALELDKTKVKVGDIITATIKIENMKNFAGYQLNIK
YDPTMLEAIELETGSAIAKRTWPVTGGTVLQSDNYGKTTAVANDVGAGIINFAEAYSNLT
KYRETGVAEETGIIGKIGFRVLKAGSTAIRFEDTTAMPGAIEGTYMFDWYGENIKGYSVV
QPGEIVAEGEEPGEEPTEEPVPTETSADPTPTVTEEPVPSELPDSYVIMELDKTKVKVGD
IITATIKIENMKNFAGYQLNIKYDPTMLEAIELETGSAIAKRTWPVTGGTVLQSDNYGKT
TAVANDVGAGIINFAEAYSNLTKYRETGVAEETGIIGKIGFRVLKAGSTAIRFEDTTAMP
GAIEGTYMFDWYGENIKGYSVVQPGEIVAEGEEPGEEPTEEPVPTETPVDPTPTVTEEPV
PSELPDSYVIMELDKTKVKEGDVIIATIRVNNIKNLAGYQIGIKYDPKVLEAFNIETGDP
IDEGTWPAVGGTILKNRDYLPTGVAINNVSKGILNFAAYYVYFDDYREEGKSEDTGIIGN
IGFRVLKAEDTTIRFEELESMPGSIDGTYMLDWYLNRISGYVVIQPAPIKAASDEPIPTD
TPSDEPTPSDEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDE
PTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSE
TPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTP
SDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPT
PSDEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEP
TPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSET
PEEPIPTDTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPS
DEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTP
SETPEEPTPSDEPTPSDEPTPSETPEEPTPSDEPTPSDEPTPSETPEEPTPSDEPTPSDE
PTPSETPEEPTPTTTPTPTPSTTPTSGSGGSGGSGGGGGGGGGTVPTSPTPTPTSKPTST
PAPTEIEEPTPSDVPGAIGGEHRAYLRGYPDGSFRPERNITRAEAAVIFAKLLGADESYG
AQSASPYSDLADTHWAAWAIKFATSQGLFKGYPDGTFKPDQNITRAEFATVVLHFLTKVK
GQEIMSKLATIDISNPKFDDCVGHWAQEFIEKLTSLGYISGYPDGTFKPQNYIKRSESVA
LINRALERGPLNGAPKLFPDVNESYWAFGDIMDGALDHSYIIEDEKEKFVKLLED
Download sequence
Identical sequences gi|385777994|ref|YP_005687159.1| WP_014522595.1.19387

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]