SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|385778355|ref|YP_005687520.1| from Clostridium thermocellum DSM 1313

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|385778355|ref|YP_005687520.1|
Domain Number 1 Region: 37-189
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 5.23e-38
Family Cellulose-binding domain family III 0.000063
Further Details:      
 
Domain Number 2 Region: 232-391
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 4.08e-34
Family CBM4/9 0.0036
Further Details:      
 
Domain Number 3 Region: 726-825,858-927
Classification Level Classification E-value
Superfamily vWA-like 0.0000000000000241
Family Integrin A (or I) domain 0.036
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|385778355|ref|YP_005687520.1|
Sequence length 1050
Comment carbohydrate-binding CenC domain-containing protein [Clostridium thermocellum DSM 1313]
Sequence
MLRRLSYKQSALLIAVCILIQIISLASFGLNVYANSANISLEFYNGDFGASVSSISMNFR
ITNNGSSQISLSDIKLRYYFTDDGVSPITVFIDYANNNGRGINNDVTYTIKDINSSGANK
YIEFGFNAQAGSLEPNTSVLMRARAYQSEYKQSFTQTNDYSFCQSNNDFAAWNKVTGYLN
GVLFSGTEPVMYSPTPSLSTPTPQISPTPVATPSSVPTYQATPTPSGFETPVNWDGKAVP
NGDFESGSVFWSFYCDSLSGANATNLIHSEPSGNKMSKTSITNAGSNHWAIQLKHDGIVL
ENLKTYRLTFDAKSTVPRNIRVSLQNATSSMIEYFGKIVEVEPKMKTYTCEFTFNSTTGT
NVAIVFEMGKIGTETDKAHDIVLDNVHIEKIASPSVSPVPSEDPQGAGITASRSSVYEAE
LGEEVDITLSQSGEIALEGRMDTEKEIVLVLDNSGVLNSYVEDILSPLDFGIYSNHNLTV
QGKDASINGSVHANDVFTSTADSISISQTCSAASFHITSKNVNINEYKNITIPIEMPNFH
SKLIDDAMRNSMVFRPEDYFLSWFPQPMPGQEDIFIFYNLIAGRFEIFGAGTLVINSSMY
FMGNVLISLTNTNNVGEGFIVADGNIIIQGQNLYPNGPNDKLYVYSIGGNIEFQTSNSTI
NGIAYAPGNPANPNSGKIFFSGDKNTINGSIAANELDFFAGGLVVNHTEGQFDTVEEKYI
DKSTYLKLVKDAAKNFVDKFAGSKTKMAVIQYSDSANDNDFKKYDLSLPDKGAALKETID
KIKPGTSGLSNMGDGMRRAYHILNGPPPKGQISKYIVVITGSVPNRWTAVDNKKNEPKTD
NGRADFIKADNESYNSLDYAKDMGRIITSKGINLVFIDFSEEDIGDVLEEIAAESGAKPL
EGTDRHYYKANNFLELLDILNNMTLKIYYDVVLDKVLYEEILPQGVLLVEAPEWISTESV
PMGGVNRIKLTGEINNIPFTFTGTGYSFVVESFKIKVKFLKPGTIVFDGADSRLRYNFNY
VDGAGNIHSKSVDKHFDDMTVNVTMKVDIN
Download sequence
Identical sequences gi|385778355|ref|YP_005687520.1| WP_003517483.1.19387 WP_003517483.1.20586 WP_003517483.1.55520 WP_003517483.1.6636 WP_003517483.1.6965

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]