SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|374297272|ref|YP_005047463.1| from Clostridium clariflavum DSM 19732

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|374297272|ref|YP_005047463.1|
Domain Number 1 Region: 38-75,108-518
Classification Level Classification E-value
Superfamily Six-hairpin glycosidases 4.54e-136
Family Cellulases catalytic domain 0.0000000238
Further Details:      
 
Domain Number 2 Region: 709-857
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 2.09e-41
Family Cellulose-binding domain family III 0.0001
Further Details:      
 
Domain Number 3 Region: 535-691
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 5.23e-36
Family Cellulose-binding domain family III 0.00028
Further Details:      
 
Domain Number 4 Region: 873-942
Classification Level Classification E-value
Superfamily Type I dockerin domain 1.57e-18
Family Type I dockerin domain 0.0013
Further Details:      
 
Weak hits

Sequence:  gi|374297272|ref|YP_005047463.1|
Domain Number - Region: 90-115
Classification Level Classification E-value
Superfamily EF-hand 0.0472
Family S100 proteins 0.052
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) gi|374297272|ref|YP_005047463.1|
Sequence length 943
Comment Cellulose binding domain-containing protein,dockerin-like protein [Clostridium clariflavum DSM 19732]
Sequence
MLKATIRKKVQAVVLVFSILSTLLVAPVRIAKAAEVEINYAKALQLALHFYDAEKCGRGI
TGGRLEWRGDCHLEDEKVPLIPMETKESPGTNMSQEYIDKYRDVLDPDGDGTLDLSGGFH
DAGDHVKFGLPQAYTASTLGWGFYEFRQAFIDKGLEDHMLDILKWFTDYFLRSTFLDKDG
NLVAFCYQVGNGDVDHTYWGPPELQNQVRPAFFTTPENPAADQCGDAAAALAISYLNWKD
SDPEYAEKCLTAAKALYDFGKKYRGTGYSGGYYGSAYDDDELAWGAVWLNIATGDESYID
DIVRMENGKYTGYLGKIIVNEQNHWQNIWVHCWDVVWGGVFAKLAPITNTERDWYIFRWN
LEYWSGIPHEDPKDNAFLAASPSGYRVINTWGSARYNTTAQLCAFVYRKYTGRTDFTDWA
KSQMDYLMGNNPLNRCYIVGYSENSVKHPHHRAAHGSKTNSMLDPEEHRHTLWGALAGGP
DLEDNHIDETTDYVYNEVAIDYNAGFTGALAGFCTYYGQDHQILPNFPPKEDPIDEYYAE
AKLEQENKERTQITIRLYNYSVHPPHFEEAMKVRYFFNISEMLDAGQTIDDVDLQIMYDE
NASSYGGPIKYKGPYKWDDTGVCYVEFDWSGYKVYGTRELQFALVGAQDENYKFHWDPTN
DFSRQGITNKFEKTPYIAVFLGDELVYGQQPPKGVPTPTPSGNPADLKPSIKVLYKTTDA
SDAAGDIKVTLKIENTGKKPVDLSTLKIRYWYTKDSDDAQECIFDYVKIGKEMVEAKFVD
VSPAVENADNYLEIGFKSGAGVIAPGSDSGDIQFRVTKNGAKYTVSNDYSYDKSTSFTEN
SKITAYINSELIYGEEPDGVGPVVTPTPTPNDYIFGDVNGDKEVNSIDFAIMKQFLLGMI
KEFPYEHGAKAGDLNGDGNINSIDYALLKQYILGIIKEFPIEQ
Download sequence
Identical sequences G8LUA7
WP_014256085.1.34793 gi|374297272|ref|YP_005047463.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]