SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for WP_003516263.1.20586 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  WP_003516263.1.20586
Domain Number 1 Region: 802-961
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 2.62e-59
Family Cellulose-binding domain family III 0.00000765
Further Details:      
 
Domain Number 2 Region: 1142-1302
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 3.14e-58
Family Cellulose-binding domain family III 0.00000531
Further Details:      
 
Domain Number 3 Region: 38-196
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 8.9e-58
Family Cellulose-binding domain family III 0.00000458
Further Details:      
 
Domain Number 4 Region: 609-769
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 1.02e-57
Family Cellulose-binding domain family III 0.00000753
Further Details:      
 
Domain Number 5 Region: 972-1132
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 3.14e-55
Family Cellulose-binding domain family III 0.0000101
Further Details:      
 
Domain Number 6 Region: 416-575
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 4.19e-55
Family Cellulose-binding domain family III 0.00000935
Further Details:      
 
Domain Number 7 Region: 216-375
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 2.45e-54
Family Cellulose-binding domain family III 0.00000573
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) WP_003516263.1.20586
Sequence length 1305
Comment cellulosome anchoring protein cohesin region [Ruminiclostridium thermocellum]; AA=GCF_000255615.2; RF=na; TAX=1138384; STAX=1515; NAME=Ruminiclostridium thermocellum AD2; strain=AD2; AL=Complete Genome; RT=Major
Sequence
MKKGISFILVIAIIMAMTSSFAVSAYPATTLSAAGVESGSISLEFDKTTAQVGDIIKAYV
KISNIKNFAGYQVNIKYDPTVLQAVNPDTKVPLNKNTMPKSGNLLSNPEYGSIYGVLNKI
EEGILNFGKAYTYLNDYKLSNSPEETGILAEIGFKVLKVQPTTVKFENTSSMPGSLSGTM
LFDWNGEVITDYTVVQSAVINSSVVNPSPSAAPSKGIVKMELNKNTAFVGDIIIAEIKVD
NFDNIAGYQFNIKYDPQVLQPIDPDTNVPYGKSTMPKDGTILVNPEFGAISAVANKVEEG
ILNFGKSYTYLAAYKASGMAEKSGTIAKIAFKALKAASSTTIKFEETLSMPGSIEGTMIF
DWNGDNVLGYQVIQAGAVSISGQTVTPSPSPTQIPVSPTPIPSQKPTPSSTPVSNASISI
EVDKNTVKVGEMVKAFVKVDGFDSLAGFQVNIKYNPDLLQAVNPDTGEPLKINSMPKSGD
LISNNEYGVISIAVNKPSEGVLNFAKTYTYVGDYKDSGKPEKSGTLAIIGFKALNEGDAT
VRFEDAISMPSSLSGTILLDWDLNRISDYKVVQPDVIKITGSTKPSPSPTSTPVGPSPTA
TPTGGPVSDGQIELKLDKEQAKVGDIIKAAINISDINNFAGYQVNIKYDPAVLQAVNPVT
GEPMSDKSMPADGTILVNTEYGIISAVANKTSEGILNFGKAYTYLDAYKLSNNPEKTGTL
AVIGFKVLKAQDTYIGFENSITMPSSVLGTYLFDWNGDTITGYKVVNPGVIKISSSTVTT
PSPTPTTTPTSTPKPTNPVSTDSYIKLELDKNTAAVGEIIKATVKVNNIKELAGYQINIK
YDPNVLQPVNPYTGAEYTSKTPLANGELIVNSEYGATSMVVHDLTKGVLNFAQIYVFMED
YRNSGKAEETGVLGVIGFKVLKNEKTTIKFEEPASMPASISGTYLIDWNGNKKTDYKVIQ
PEPVNADAVSSGSYIKLEFDKNTASVGEIIRATVKVNNVKNLAGYQICIKYDPNVLQPVN
PNTGAAYTTTTHLVDGELIVKQEYGSTSMAAHRLSNGILNFARTYLYVSDYKEDGKPEET
GILGVIGFKVLKKEKTTVSFYADEALMPNSVSGTYLIDWNSNKKTDYKVIQPEPINGGAL
PENYIALELNKNKAAVGETIKATVRVNNIKNLAGYQVNIVYDPNVLQPIDPVTGAPFTTR
STFANCELLNNDEYGPTNITAHDLTKGALNFARGYSYLNEYRKNGVPETTGVLGEITFKV
LKSQTTKIRFEEPAAMPGSISGTYLFDWYGNQISNYSVIQPDSIN
Download sequence
Identical sequences WP_003516263.1.19387 WP_003516263.1.20586 WP_003516263.1.55520 WP_003516263.1.60145 WP_003516263.1.6636 WP_003516263.1.6965 gi|385778835|ref|YP_005688000.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]