SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for WP_003517663.1.16390 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  WP_003517663.1.16390
Domain Number 1 Region: 28-479
Classification Level Classification E-value
Superfamily Six-hairpin glycosidases 1.06e-154
Family Cellulases catalytic domain 0.00000000000826
Further Details:      
 
Domain Number 2 Region: 492-637
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 2.56e-46
Family Cellulose-binding domain family III 0.0000133
Further Details:      
 
Domain Number 3 Region: 663-734
Classification Level Classification E-value
Superfamily Type I dockerin domain 5.4e-17
Family Type I dockerin domain 0.00036
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) WP_003517663.1.16390
Sequence length 736
Comment endoglucanase [Ruminiclostridium thermocellum]; AA=GCF_000493655.1; RF=na; TAX=1349417; STAX=1515; NAME=Ruminiclostridium thermocellum BC1; AL=Contig; RT=Major
Sequence
MKKLIITVIVSAVLLTALIPQLPVFAADYNYGEALQKAIMFYEFQMSGKLPDNIRNNWRG
DSCLGDGSDVGLDLTGGWFDAGDHVKFNLPMAYTATMLAWAVYEYKDALQKSGQLGYLMD
QIKWASDYFIRCHPEKYVYYYQVGNGDMDHRWWVPAECIDVQAPRPSYKVDLSNPGSTVT
AGTAAALAATALVFKDTDPAYAALCIRHAKELFDFAETTMSDKGYTAALNFYTSHSGWYD
ELSWAGAWIYLADGDETYLEKAEKYVDKWPIESQTTYIAYSWGHCWDDVHYGAALLLAKI
TNKSLYKEAIERHLDYWTVGFNGQRVRYTPKGLAHLTDWGVLRHATTTAFLACVYSDWSE
CPREKANIYIDFAKKQADYALGSSGRSYVVGFGVNPPQHPHHRTAHSSWCDSQKVPEYHR
HVLYGALVGGPDASDAYVDDIGNYVTNEVACDYNAGFVGLLAKMYEKYGGNPIPNFMAIE
EKTNEEIYVEATANSNNGVELKTYLYNKSGWPARVCDKLSFRYFMDLTEYVSAGYNPNDI
TVSIIYSAAPTAKISKPILYDASKNIYYCEIDLSGTKIFPGSNSDHQKETQFRIQPPAGA
PWDNTNDFSYQGIKKNGEVVKEMPVYEDGVLIFGVEPNGTGPATPTPKPSVNPSPSPTPT
SDILYGDINLDGKINSSDVTLLKRYIVKSIDVFPTADPERSLIASDVNGDGRVNSTDYSY
LKRYVLKIIPTIPGNS
Download sequence
Identical sequences gi|385779002|ref|YP_005688167.1| WP_003517663.1.16390 WP_003517663.1.19387 WP_003517663.1.20586 WP_003517663.1.55520 WP_003517663.1.6636 WP_003517663.1.6965

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]