SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|125973786|ref|YP_001037696.1| from Clostridium thermocellum ATCC 27405

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|125973786|ref|YP_001037696.1|
Domain Number 1 Region: 34-338
Classification Level Classification E-value
Superfamily Arabinanase/levansucrase/invertase 8.85e-68
Family alpha-L-arabinanase-like 0.0019
Further Details:      
 
Domain Number 2 Region: 480-606
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.5e-37
Family Family 6 carbohydrate binding module, CBM6 0.0000218
Further Details:      
 
Domain Number 3 Region: 335-462
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.19e-33
Family Family 6 carbohydrate binding module, CBM6 0.00023
Further Details:      
 
Domain Number 4 Region: 613-678
Classification Level Classification E-value
Superfamily Type I dockerin domain 9.68e-19
Family Type I dockerin domain 0.00027
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) gi|125973786|ref|YP_001037696.1|
Sequence length 679
Comment carbohydrate-binding family 6 protein [Clostridium thermocellum ATCC 27405]
Sequence
MPKKLKKIIKLCSMVFVIGILTLLLPEKGAADYPIFSQRFTADPAAVVYNGRLYIYCSHD
SDATPGQSTYNIPDITCISTDDLKNWTDHGEVFNAKRDSRWASVSWAPSIVYRNNKFYLY
YGNGGNGIGVAVSDSPTGPFKDPLPGPLVSWNTPGVQPAQNMWLFDPGVFVDDDGQAYMY
FGGNGQNNIRVIKLGNDMISTVGSAMTMSAPRFFEAAYMHKYNGKYYFSYASDFSQGASK
IEYMMSDKPTTGFQYKGVILPQPPDNYSNNNHHAIVEYKGNWYVVYHNRTVAKQRGLDPV
YQRNVCIDQMFYNADGTIKQVVPTVDGLKQLKYVDPYTKNLAVTMHKESGIETEECSEGG
RNVAFIENGDWIQVKGVDFGNVGPTSFEARVASATNGGNIEIRLDSPTGTLIGTCKVEGT
GDWQKWVTKTCSVSKVTGVHDLFFRFTGGSGYLFNFSWWKFNSDATPTPTPPPQPSTVPV
TERSAFSKIEVEDFNDIKSSTIQKIGTPNGGSGIGYIENGDWLAYKNIDFGNGATTFKAL
VASTLSPNIELRLDSPTGTLIGTLKVAATGGFNAYEEQSCNISKVTGKHDLYLVFSGAVN
IDWFTFGGSSGIIKRGDTNSDGKINSTDVTALKRHLLRVTQLTGDNLANADVNGDGNVNS
TDLLLLKRYILGEIENFPI
Download sequence
Identical sequences A3DEX4
gi|385778341|ref|YP_005687506.1| gi|125973786|ref|YP_001037696.1| 203119.Cthe_1271 WP_003517499.1.16390 WP_003517499.1.19387 WP_003517499.1.20586 WP_003517499.1.31213 WP_003517499.1.55520 WP_003517499.1.6636 WP_003517499.1.6965

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]