SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for WP_014522595.1.19387 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  WP_014522595.1.19387
Domain Number 1 Region: 33-191
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 3.14e-56
Family Cellulose-binding domain family III 0.00000164
Further Details:      
 
Domain Number 2 Region: 403-565
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 3.66e-56
Family Cellulose-binding domain family III 0.00000367
Further Details:      
 
Domain Number 3 Region: 605-766
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 9.57e-56
Family Cellulose-binding domain family III 0.00000719
Further Details:      
 
Domain Number 4 Region: 202-363
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 3.4e-55
Family Cellulose-binding domain family III 0.00000331
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) WP_014522595.1.19387
Sequence length 1615
Comment cellulosome anchoring protein cohesin subunit [Ruminiclostridium thermocellum]; AA=GCF_000184925.1; RF=na; TAX=637887; STAX=1515; NAME=Ruminiclostridium thermocellum DSM 1313; strain=DSM 1313; AL=Complete Genome; RT=Major
Sequence
MKRKNKVLSILLTLLLIISTTSVNMSFAEATPSIEMVLDKTEVHVGDVITATIKVNNIRK
LAGYQLNIKFDPEVLQPVDPATGEEFTDKSMPVNRVLLTNSKYGPTPVAGNDIKSGIINF
ATGYNNLTAYKSSGIDEHTGIIGEIGFKVLKKQNTSIRFEDTLSMPGAISGTSLFDWDAE
TITGYEVIQPDLIVVEAEPLKDASVALELDKTKVKVGDIITATIKIENMKNFAGYQLNIK
YDPTMLEAIELETGSAIAKRTWPVTGGTVLQSDNYGKTTAVANDVGAGIINFAEAYSNLT
KYRETGVAEETGIIGKIGFRVLKAGSTAIRFEDTTAMPGAIEGTYMFDWYGENIKGYSVV
QPGEIVAEGEEPGEEPTEEPVPTETSADPTPTVTEEPVPSELPDSYVIMELDKTKVKVGD
IITATIKIENMKNFAGYQLNIKYDPTMLEAIELETGSAIAKRTWPVTGGTVLQSDNYGKT
TAVANDVGAGIINFAEAYSNLTKYRETGVAEETGIIGKIGFRVLKAGSTAIRFEDTTAMP
GAIEGTYMFDWYGENIKGYSVVQPGEIVAEGEEPGEEPTEEPVPTETPVDPTPTVTEEPV
PSELPDSYVIMELDKTKVKEGDVIIATIRVNNIKNLAGYQIGIKYDPKVLEAFNIETGDP
IDEGTWPAVGGTILKNRDYLPTGVAINNVSKGILNFAAYYVYFDDYREEGKSEDTGIIGN
IGFRVLKAEDTTIRFEELESMPGSIDGTYMLDWYLNRISGYVVIQPAPIKAASDEPIPTD
TPSDEPTPSDEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDE
PTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSE
TPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTP
SDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPT
PSDEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEP
TPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTPSET
PEEPIPTDTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPS
DEPTPSDEPTPSDEPTPSETPEEPIPTDTPSDEPTPSDEPTPSDEPTPSDEPTPSDEPTP
SETPEEPTPSDEPTPSDEPTPSETPEEPTPSDEPTPSDEPTPSETPEEPTPSDEPTPSDE
PTPSETPEEPTPTTTPTPTPSTTPTSGSGGSGGSGGGGGGGGGTVPTSPTPTPTSKPTST
PAPTEIEEPTPSDVPGAIGGEHRAYLRGYPDGSFRPERNITRAEAAVIFAKLLGADESYG
AQSASPYSDLADTHWAAWAIKFATSQGLFKGYPDGTFKPDQNITRAEFATVVLHFLTKVK
GQEIMSKLATIDISNPKFDDCVGHWAQEFIEKLTSLGYISGYPDGTFKPQNYIKRSESVA
LINRALERGPLNGAPKLFPDVNESYWAFGDIMDGALDHSYIIEDEKEKFVKLLED
Download sequence
Identical sequences WP_014522595.1.19387 gi|385777994|ref|YP_005687159.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]