SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for WP_020457720.1.31213 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  WP_020457720.1.31213
Domain Number 1 Region: 199-569
Classification Level Classification E-value
Superfamily (Trans)glycosidases 3.9e-78
Family beta-glycanases 0.000000724
Further Details:      
 
Domain Number 2 Region: 43-184
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 1.99e-37
Family Cellulose-binding domain family III 0.00049
Further Details:      
 
Domain Number 3 Region: 586-656
Classification Level Classification E-value
Superfamily Type I dockerin domain 2.35e-17
Family Type I dockerin domain 0.00096
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) WP_020457720.1.31213
Sequence length 660
Comment glycoside hydrolase [Ruminiclostridium thermocellum]; AA=GCF_000015865.1; RF=representative genome; TAX=203119; STAX=1515; NAME=Ruminiclostridium thermocellum ATCC 27405; strain=ATCC 27405; AL=Complete Genome; RT=Major
Sequence
MGQKHFKRSLLSVLTISALIISCLFSFIFVNADDTSEEPALEGLSIHYMDGTLDVKYQSM
RPYIIIHNNSGMDVDMADLRVRYYYEKEGVTEEVLTCFYTAIGADKIFAEFHPELGYAEI
GFTSDAGIIKSGGNSGQLQLVLKKISNGYYDQSNDYSYDPSYTDYAEYDKITLYYKGKLV
WGKEGPPPPPEPTPPPNNDDWLHVEGNLIKDAQGNTVYLTGINWFGFETDGANGFYGLNK
CNLEDSLDLMAKLGFNILRIPISAEIILQWKNGERVETSFVNTYENPRLDGLSSLEILDY
TINHMKKNGMKAMLDMHSSTKDSYQENLWYNKDITMEEFIEAWKWIVERYKDDDTVIAVD
LKNEPHGKYSGPNIAKWDDSDDPNNWKRAAEIIAEEILAINPNLLIVVEGVEAYPMEGYD
YTNCGEFTTYCNWWGGNLRGVADHPVVISAPDKLVYSVHDYGPDIYMQPWFKKDFDINTL
YEECWYPNWYYIVEQNIAPMLIGEWGGKLINENNRKWLECLATFISEKKLHHTFWAFNPN
SADTGGLMLEDWKTVDEEKYAIIEPTLWKKGLDHVIPLGGITEDTFKYGDVNGDFAVNSN
DLTLIKRYVLKNIDEFPSSHGLKAADVDGDEKITSSDAALVKRYVLRAITSFPVEENQNE
Download sequence
Identical sequences A3DHC1
gi|125974633|ref|YP_001038543.1| 203119.Cthe_2147 WP_020457720.1.31213

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]