SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|125975242|ref|YP_001039152.1| from Clostridium thermocellum ATCC 27405

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|125975242|ref|YP_001039152.1|
Domain Number 1 Region: 56-546
Classification Level Classification E-value
Superfamily Six-hairpin glycosidases 1.21e-123
Family Cellulases catalytic domain 0.0000000258
Further Details:      
 
Domain Number 2 Region: 739-889
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 2.04e-43
Family Cellulose-binding domain family III 0.00019
Further Details:      
 
Domain Number 3 Region: 564-719
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 3.14e-39
Family Cellulose-binding domain family III 0.00025
Further Details:      
 
Domain Number 4 Region: 895-961
Classification Level Classification E-value
Superfamily Type I dockerin domain 6.87e-17
Family Type I dockerin domain 0.00052
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) gi|125975242|ref|YP_001039152.1|
Sequence length 961
Comment glycoside hydrolase family protein [Clostridium thermocellum ATCC 27405]
Sequence
MTDSPQKRILKRKNRCKGLITAVIIAAQLLTSSILFAEAPPATFTPAENWEDYDYFNFAE
ALQKSLYFYDAQKCGVEAGYDHGGRLEWRGACHEVDERIPISNTSLSEAFLAKYRHIIDP
DGDGTVDVHGGFHDAGDHVRFGLPQSYTAGTLGWGFYEFRESFRAIGEEEHMIEILRYFT
DTFLRCSFMDEEGNIVAFCYMVGEGDEDHCYWGPPELYPEEYLRSRPADFATFDDPGSDV
CASTAAALCTSYLNFKDEDPEYAEKCLTVAKALYDFAVKYRGLHKGDGYYTSDYDEDELA
WAAVWLYECTGDMKYINDIVAVDETGNYVGYMKRIIPDTFKQNVWYNSWVHCWDAVWGGT
FIKLNELFPENELFDFIARWNVEYLSGGKCPHEDPNDHNYCKPSPAGYTMINGWGSARYN
AAAQLCALVYMKNNPDRTDFGEWAKSQMEYLMGRNPMGYSYIVGYGYEKGLPFAKHPHHR
AAHGSKTNSMNDPEEHRHILWGALVGGPDLNDYHIDSTTEYAYNEVAVDYNAAFVGALAG
LYKYYGQGHEPIPNFPPLEPETDDYFCEAKIVRETKDSTQVLLRIHNESTRPPHYETGMM
ARYFFNISELIENGQSIDDVIFTIEYDEQISMQQEPVVYRGPFKWDDAGTYYFEFDWSGR
KIYGDRELQISFRVKQDSNYMTHWDSSNDYSRQGLTNEYAISKNVPVYLNGVKVYGEEPP
KLSPTPTPTIDPSQTPDANASISVSYKCGVKDGTKNTIRATINIKNTGTTPVNLSDIKVR
YWFTSDGNEQNNFVCDYAAFGTDKVKGIVKKIENSVPGADTYCEISFTEDAGRLAPGGST
GTIPFRIEGAAEYDQTDDYSYNSEMSDDFGDNTKITAYIKDKLKYGVEPVTIIDITLGDL
NYDGKVNSTDYLVLKRYLLGTIDKESDPNFLKAADLNRDGRVNSTDMSLMKRYLLGIITS
F
Download sequence
Identical sequences A3DJ30
gi|125975242|ref|YP_001039152.1| 203119.Cthe_2760 WP_020457910.1.31213 Cth-1914

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]