SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for WP_003517663.1.20586 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  WP_003517663.1.20586
Domain Number 1 Region: 28-479
Classification Level Classification E-value
Superfamily Six-hairpin glycosidases 1.06e-154
Family Cellulases catalytic domain 0.00000000000826
Further Details:      
 
Domain Number 2 Region: 492-637
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 2.56e-46
Family Cellulose-binding domain family III 0.0000133
Further Details:      
 
Domain Number 3 Region: 663-734
Classification Level Classification E-value
Superfamily Type I dockerin domain 5.4e-17
Family Type I dockerin domain 0.00036
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) WP_003517663.1.20586
Sequence length 736
Comment endoglucanase [Ruminiclostridium thermocellum]; AA=GCF_000255615.2; RF=na; TAX=1138384; STAX=1515; NAME=Ruminiclostridium thermocellum AD2; strain=AD2; AL=Complete Genome; RT=Major
Sequence
MKKLIITVIVSAVLLTALIPQLPVFAADYNYGEALQKAIMFYEFQMSGKLPDNIRNNWRG
DSCLGDGSDVGLDLTGGWFDAGDHVKFNLPMAYTATMLAWAVYEYKDALQKSGQLGYLMD
QIKWASDYFIRCHPEKYVYYYQVGNGDMDHRWWVPAECIDVQAPRPSYKVDLSNPGSTVT
AGTAAALAATALVFKDTDPAYAALCIRHAKELFDFAETTMSDKGYTAALNFYTSHSGWYD
ELSWAGAWIYLADGDETYLEKAEKYVDKWPIESQTTYIAYSWGHCWDDVHYGAALLLAKI
TNKSLYKEAIERHLDYWTVGFNGQRVRYTPKGLAHLTDWGVLRHATTTAFLACVYSDWSE
CPREKANIYIDFAKKQADYALGSSGRSYVVGFGVNPPQHPHHRTAHSSWCDSQKVPEYHR
HVLYGALVGGPDASDAYVDDIGNYVTNEVACDYNAGFVGLLAKMYEKYGGNPIPNFMAIE
EKTNEEIYVEATANSNNGVELKTYLYNKSGWPARVCDKLSFRYFMDLTEYVSAGYNPNDI
TVSIIYSAAPTAKISKPILYDASKNIYYCEIDLSGTKIFPGSNSDHQKETQFRIQPPAGA
PWDNTNDFSYQGIKKNGEVVKEMPVYEDGVLIFGVEPNGTGPATPTPKPSVNPSPSPTPT
SDILYGDINLDGKINSSDVTLLKRYIVKSIDVFPTADPERSLIASDVNGDGRVNSTDYSY
LKRYVLKIIPTIPGNS
Download sequence
Identical sequences gi|385779002|ref|YP_005688167.1| WP_003517663.1.16390 WP_003517663.1.19387 WP_003517663.1.20586 WP_003517663.1.55520 WP_003517663.1.6636 WP_003517663.1.6965

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]