SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|125975291|ref|YP_001039201.1| from Clostridium thermocellum ATCC 27405

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|125975291|ref|YP_001039201.1|
Domain Number 1 Region: 416-672
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.58e-78
Family Glycosyl hydrolases family 16 0.002
Further Details:      
 
Domain Number 2 Region: 688-849
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.98e-37
Family CBM4/9 0.0016
Further Details:      
 
Domain Number 3 Region: 1021-1173
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.36e-36
Family CBM4/9 0.0017
Further Details:      
 
Domain Number 4 Region: 870-1014
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 3.69e-34
Family CBM4/9 0.0044
Further Details:      
 
Domain Number 5 Region: 1177-1320
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 8.67e-32
Family CBM4/9 0.0013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|125975291|ref|YP_001039201.1|
Sequence length 1321
Comment glycoside hydrolase family protein [Clostridium thermocellum ATCC 27405]
Sequence
MYKRLLSSVLIIMLLLSAWSPISVQASDGINDIRGHWAEEDLNKWMEKGILVGYQDGTIR
PDNNITRAEFVTLINKVFGLYELSREQFADVEDSKWYSREILKARAAGYIAGYGSNVFKP
DNYITRQEAVVIIAKVFELQSGSNYTSKFKDGSLVKEYAKDSVSALVEKGYIAGYEDGTF
RPDNYITRAETIKILNKIIPSLYNEKGDYKNEEVAGNALINTEGVILKDTVINGDLYLAQ
GIQNGDVTLDGVNVKGTVFVNGGGSDSIHFINTKINRVVVNKTGVRIVTSGNTSVESVVV
KSGAKLEEKELTGDGFKNVTVDSQLSAGNEIIFVGDFEQVDVLADDALLETKEAKMKLRI
FGQRIKVNGKAIEKSSKNYIVNGELISTEEEPGPSDAPGAEDDQNSGSPGSSTNPAPTKN
PNEEWRLVWSDEFNGSEINMANWSYDDPTNGRWNGEVQSYTQNNAYIKDGALVIEARKED
ITEPSGETYHYTSSKLITKGKKSWKYGKFEIRAKMPQGQGIWPAIWMMPEDEPFYGTWPK
CGEIDIMELLGHEPDKIYGTIHFGEPHKESQGTYTLPEGQTFADDFHVYSIEWEPGEIRW
YIDGKLYHVANDWYSRDPYLADDYTYPAPFDQNFFLILNISVGGGWPGYPDETTVFPQQM
VVDYVRVYQKDKYPHREKPAKEEVKPREPLEDGNYIYNGGFDVDDSAAVGVDGVPYTSYW
TFLTASGGAATVNVEEGVMHVQIENGGTTDYGVQLLQAPIHLEKGAKYKASFDMKAENPR
QVKLKIGGDGDRGWKDYAAIPPFTVSTEMTNYEFEFTMKDDTDVKARFEFNMGLDDNDVW
IDNVKLIKTEDAPVIDPSEIARPPLLSGNYIYNGTFDQGPNRMGFWNFVVDSTAKATYYI
GSDVNERRFETRIEKGGTSRGAIRLVQPGINIENGKTYKVSFEASAANTRTIEVEIASNL
HNSSIFATTFEISKESKIYEFEFTMDKDSDKNGELRFNLGGSNVNVYIDNVVMKRVSTDE
VEGNLILNGVFNGLAGWGYGAYEPGSADFESHEEQFRAIISSVGNEGWNVQLYQDNVPLE
QGQTYEVSFDAKSTIDRKIIVQLQRNGTSDNNWDSYFYQEVELTNELKTFKYEFTMSKPT
DSASRFNFALGNTENKTYAPHEIIIDNVVVRKVATPSALILNGTFDDGMDHWLLYWGDGE
GNCDVTDGELEINITKVGTADYMPQIKQENIALQEGVTYTLSLKARALEARSIKVDILDS
SYNWYGGTIFDLTTEDAVYTFTFTQSKSINNGVLTINLGTIEGKTSAATTVYLDDILLEQ
Q
Download sequence
Identical sequences A3DJ79
203119.Cthe_2809 WP_020457929.1.31213 gi|125975291|ref|YP_001039201.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]