SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for WP_003516552.1.55520 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  WP_003516552.1.55520
Domain Number 1 Region: 314-751
Classification Level Classification E-value
Superfamily Six-hairpin glycosidases 1.3e-103
Family Cellulases catalytic domain 0.00000843
Further Details:      
 
Domain Number 2 Region: 35-228
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.64e-57
Family Family 30 carbohydrate binding module, CBM30 (PKD repeat) 0.00000000534
Further Details:      
 
Domain Number 3 Region: 804-883,920-1054,1081-1177
Classification Level Classification E-value
Superfamily (Trans)glycosidases 5.68e-25
Family beta-glycanases 0.019
Further Details:      
 
Domain Number 4 Region: 1356-1444
Classification Level Classification E-value
Superfamily PKD domain 3.27e-21
Family PKD domain 0.003
Further Details:      
 
Domain Number 5 Region: 204-311
Classification Level Classification E-value
Superfamily E set domains 1.4e-18
Family E-set domains of sugar-utilizing enzymes 0.0019
Further Details:      
 
Domain Number 6 Region: 1288-1352
Classification Level Classification E-value
Superfamily Type I dockerin domain 1.44e-18
Family Type I dockerin domain 0.00038
Further Details:      
 
Domain Number 7 Region: 1188-1282
Classification Level Classification E-value
Superfamily Glycosyl hydrolase domain 0.0000228
Family Composite domain of glycosyl hydrolase families 5, 30, 39 and 51 0.079
Further Details:      
 
Weak hits

Sequence:  WP_003516552.1.55520
Domain Number - Region: 1452-1598
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.000655
Family Family 28 carbohydrate binding module, CBM28 0.07
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) WP_003516552.1.55520
Sequence length 1601
Comment glycoside hydrolase [Ruminiclostridium thermocellum]; AA=GCF_000255575.1; RF=na; TAX=1094188; STAX=1515; NAME=Ruminiclostridium thermocellum YS; strain=YS; AL=Contig; RT=Major
Sequence
MAKRRLSLLLVLAIMFTMVVPQISASAETVAPEGYRKLLDVQIFKDSPVVGWSGSGMGEL
ETIGDTLPVDTTVTYNGLPTLRLNVQTTVQSGWWISLLTLRGWNTHDLSQYVENGYLEFD
IKGKEGGEDFVIGFRDKVYERVYGLEIDVTTVISNYVTVTTDWQHVKIPLRDLMKINNGF
DPSSVTCLVFSKRYADPFTVWFSDIKITSEDNEKSAPAIKVNQLGFIPEAEKYALVTGFA
EELAVSEGDEFAVINAADNSVAYTGKLTLVTEYEPLDSGEKILKADFSDLTVPGKYYISI
EGLDNSPKFEIGEGIYGPLVVDAARYFYYQRQGIELEEPYAQGYPRKDVTPQDAYAVFAS
GKKDPIDITKGWYDAGDFGKYVNAGATGVSDLFWAYEMFPSQFVDGQFNIPESGNGVPDI
LDEARWELEWMLKMQDKESGGFYPRVQSDNDENIKSRIIRDQNGCTTDDTACAAGILAHA
YLIYKDIDPDFAQECLDAAINAWKFLEKNPENIVSPPGPYNVYDDSGDRLWAAASLYRAT
GEEVYHTYFKQNYKSFAQKFESPTAYAHTWGDMWLTAFLSYLKAENKDQEVVDWIDTEFG
IWLENILTRYENNPWKNAIVPGNYFWGINMQVMNVPMDAIIGSQLLGKYSDRIEKLGFGS
LNWLLGTNPLRFSFVSGYGEDSVKGVFSNIYNTDGKQGIPKGYMPGGPNAYEGAGLSRFA
AKCYTRSTGDWVANEHTVYWNSALVFMAAFANQGSEVNPGPAPEPGVTPNPTEPAKVVDI
RIDTSAERKPISPYIYGSNQELDATVTAKRFGGNRTTGYNWENNFSNAGSDWLHYSDTYL
LEDGGVPKGEWSTPASVVTTFHDKALSKNVPYTLITLQAAGYVSADGNGPVSQEETAPSS
RWKEVKFEKGAPFSLTPDTEDDYVYMDEFVNYLVNKYGNASTPTGIKGYSIDNEPALWSH
THPRIHPDNVTAKELIEKSVALSKAVKKVDPYAEIFGPALYGFAAYETLQSAPDWGTEGE
GYRWFIDYYLDKMKKASDEEGKRLLDVLDVHWYPEARGGGERICFGADPRNIETNKARLQ
APRTLWDPTYIEDSWIGQWKKDFLPILPNLLDSIEKYYPGTKLAITEYDYGGGNHITGGI
AQADVLGIFGKYGVYLATFWGDASNNYTEAGINLYTNYDGKGGKFGDTSVKCETSDIEVS
SAYASIVGEDDSKLHIILLNKNYDQPTTFNFSIDSSKNYTIGNVWAFDRGSSNITQRTPI
VNIKDNTFTYTVPALTACHIVLEAAEPVVYGDLNNDSKVNAVDIMMLKRYILGIIDNINL
TAADIYFDGVVNSSDYNIMKRYLLKAIEDIPYVPENQAPKAIFTFSPEDPVTDENVVFNA
SNSIDEDGTIAYYVWDFGDGYEGTSTTPTITYKYKNPGTYKVKLIVTDNQGASSSFTATI
KVTSATGDNSKFNFEDGTLGGFTTSGTNATGVVVNTTEKAFKGERGLKWTVTSEGEGTAE
LKLDGGTIVVPGTTMTFRIWIPSGAPIAAIQPYIMPHTPDWSEVLWNSTWKGYTMVKTDD
WNEITLTLPEDVDPTWPQQMGIQVQTIDEGEFTIYVDAIDW
Download sequence
Identical sequences A3DD30
203119.Cthe_0624 gi|385778951|ref|YP_005688116.1| gi|125973142|ref|YP_001037052.1| WP_003516552.1.16390 WP_003516552.1.19387 WP_003516552.1.20586 WP_003516552.1.31213 WP_003516552.1.55520 WP_003516552.1.60145 WP_003516552.1.6636 WP_003516552.1.6965

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]