SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|125973254|ref|YP_001037164.1| from Clostridium thermocellum ATCC 27405

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|125973254|ref|YP_001037164.1|
Domain Number 1 Region: 802-961
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 2.62e-59
Family Cellulose-binding domain family III 0.00000765
Further Details:      
 
Domain Number 2 Region: 1142-1302
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 3.14e-58
Family Cellulose-binding domain family III 0.00000531
Further Details:      
 
Domain Number 3 Region: 38-196
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 8.9e-58
Family Cellulose-binding domain family III 0.00000458
Further Details:      
 
Domain Number 4 Region: 609-769
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 1.02e-57
Family Cellulose-binding domain family III 0.00000753
Further Details:      
 
Domain Number 5 Region: 416-575
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 4.19e-55
Family Cellulose-binding domain family III 0.00000935
Further Details:      
 
Domain Number 6 Region: 972-1132
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 1.38e-54
Family Cellulose-binding domain family III 0.0000111
Further Details:      
 
Domain Number 7 Region: 216-375
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 2.45e-54
Family Cellulose-binding domain family III 0.00000573
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|125973254|ref|YP_001037164.1|
Sequence length 1305
Comment cellulosome anchoring protein cohesin subunit [Clostridium thermocellum ATCC 27405]
Sequence
MKKGISFILVIAIIMAMTSSFAVSAYPATTLSAAGVESGSISLEFDKTTAQVGDIIKAYV
KISNIKNFAGYQVNIKYDPTVLQAVNPDTKVPLNKNTMPKSGNLLSNPEYGSIYGVLNKI
EEGILNFGKAYTYLNDYKLSNSPEETGILAEIGFKVLKVQPTTVKFENTSSMPGSLSGTM
LFDWNGEVITDYTVVQSAVINSSVVNPSPSAAPSKGIVKMELNKNTAFVGDIIIAEIKVD
NFDNIAGYQFNIKYDPQVLQPIDPDTNVPYGKSTMPKDGTILVNPEFGAISAVANKVEEG
ILNFGKSYTYLAAYKASGMAEKSGTIAKIAFKALKAASSTTIKFEETLSMPGSIEGTMIF
DWNGDNVLGYQVIQAGAVSISGQTVTPSPSPTQIPVSPTPIPSQKPTPSSTPVSNASISI
EVDKNTVKVGEMVKAFVKVDGFDSLAGFQVNIKYNPDLLQAVNPDTGEPLKINSMPKSGD
LISNNEYGVISIAVNKPSEGVLNFAKTYTYVGDYKDSGKPEKSGTLAIIGFKALNEGDAT
VRFEDAISMPSSLSGTILLDWDLNRISDYKVVQPDVIKITGSTKPSPSPTSTPVGPSPTA
TPTGGPVSDGQIELKLDKEQAKVGDIIKAAINISDINNFAGYQVNIKYDPAVLQAVNPVT
GEPMSDKSMPADGTILVNTEYGIISAVANKTSEGILNFGKAYTYLDAYKLSNNPEKTGTL
AVIGFKVLKAQDTYIGFENSITMPSSVLGTYLFDWNGDTITGYKVVNPGVIKISSSTVTT
PSPTPTTTPTSTPKPTNPVSTDSYIKLELDKNTAAVGEIIKATVKVNNIKELAGYQINIK
YDPNVLQPVNPYTGAEYTSKTPLANGELIVNSEYGATSMVVHDLTKGVLNFAQIYVFMED
YRNSGKAEETGVLGVIGFKVLKNEKTTIKFEEPASMPASISGTYLIDWNGNKKTDYKVIQ
PEPVNADAVSSGSYIKLEFDKNTASEGEIIRATVKVNNVKNLAGYQICIKYDPNVLQPVN
PNTGAAYTTTTHLVDGELIVKQEYGSTSMAAHRLSNGILNFARTYLYVSDYKEDGKPEET
GILGVIGFKVLKKEKTTVSFYADEALMPNSVSGTYLIDWNSNKKTDYKVIQPEPINGGAL
PENYIALELNKNKAAVGETIKATVRVNNIKNLAGYQVNIVYDPNVLQPIDPVTGAPFTTR
STFANCELLNNDEYGPTNITAHDLTKGALNFARGYSYLNEYRKNGVPETTGVLGEITFKV
LKSQTTKIRFEEPAAMPGSISGTYLFDWYGNQISNYSVIQPDSIN
Download sequence
Identical sequences A3DDE2
203119.Cthe_0736 WP_011837893.1.31213 gi|125973254|ref|YP_001037164.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]