SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|125973912|ref|YP_001037822.1| from Clostridium thermocellum ATCC 27405

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|125973912|ref|YP_001037822.1|
Domain Number 1 Region: 38-443
Classification Level Classification E-value
Superfamily Oligoxyloglucan reducing end-specific cellobiohydrolase 6.67e-88
Family Oligoxyloglucan reducing end-specific cellobiohydrolase 0.000000256
Further Details:      
 
Domain Number 2 Region: 529-758
Classification Level Classification E-value
Superfamily Oligoxyloglucan reducing end-specific cellobiohydrolase 4.84e-44
Family Oligoxyloglucan reducing end-specific cellobiohydrolase 0.000015
Further Details:      
 
Domain Number 3 Region: 774-840
Classification Level Classification E-value
Superfamily Type I dockerin domain 7.46e-19
Family Type I dockerin domain 0.00056
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) gi|125973912|ref|YP_001037822.1|
Sequence length 842
Comment dockerin type I cellulosome protein [Clostridium thermocellum ATCC 27405]
Sequence
MVKKFTSKIKAAVFAAVVAATAIFGPAISSQAVTSVPYKWDNVVIGGGGGFMPGIVFNET
EKDLIYARADIGGAYRWDPSTETWIPLLDHFQMDEYSYYGVESIATDPVDPNRVYIAAGM
YTNDWLPNMGAILRSTDRGETWEKTILPFKMGGNMPGRSMGERLAIDPNDNRILYLGTRC
GNGLWRSTDYGVTWSKVESFPNPGTYIYDPNFDYTKDIIGVVWVVFDKSSSTPGNPTKTI
YVGVADKNESIYRSTDGGVTWKAVPGQPKGLLPHHGVLASNGMLYITYGDTCGPYDGNGK
GQVWKFNTRTGEWIDITPIPYSSSDNRFCFAGLAVDRQNPDIIMVTSMNAWWPDEYIFRS
TDGGATWKNIWEWGMYPERILHYEIDISAAPWLDWGTEKQLPEINPKLGWMIGDIEIDPF
NSDRMMYVTGATIYGCDNLTDWDRGGKVKIEVKATGIEECAVLDLVSPPEGAPLVSAVGD
LVGFVHDDLKVGPKKMHVPSYSSGTGIDYAELVPNFMALVAKADLYDVKKISFSYDGGRN
WFQPPNEAPNSVGGGSVAVAADAKSVIWTPENASPAVTTDNGNSWKVCTNLGMGAVVASD
RVNGKKFYAFYNGKFYISTDGGLTFTDTKAPQLPKSVNKIKAVPGKEGHVWLAAREGGLW
RSTDGGYTFEKLSNVDTAHVVGFGKAAPGQDYMAIYITGKIDNVLGFFRSDDAGKTWVRI
NDDEHGYGAVDTAITGDPRVYGRVYIATNGRGIVYGEPASDEPVPTPPQVDKGLVGDLNG
DNRINSTDLTLMKRYILKSIEDLPVEDDLWAADINGDGKINSTDYTYLKKYLLQAIPELP
KK
Download sequence
Identical sequences A3DFA0
gi|125973912|ref|YP_001037822.1| gi|385778206|ref|YP_005687371.1| WP_003518268.1.19387 WP_003518268.1.20586 WP_003518268.1.31213 WP_003518268.1.55520 WP_003518268.1.60145 WP_003518268.1.6636 WP_003518268.1.6965 203119.Cthe_1398

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]