SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for WP_003519375.1.31213 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  WP_003519375.1.31213
Domain Number 1 Region: 204-364
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 1.17e-54
Family Cellulose-binding domain family III 0.00000708
Further Details:      
 
Domain Number 2 Region: 33-194
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 9.27e-53
Family Cellulose-binding domain family III 0.00000655
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) WP_003519375.1.31213
Sequence length 688
Comment Cell surface glycoprotein 2 [Ruminiclostridium thermocellum]; AA=GCF_000015865.1; RF=representative genome; TAX=203119; STAX=1515; NAME=Ruminiclostridium thermocellum ATCC 27405; strain=ATCC 27405; AL=Complete Genome; RT=Major
Sequence
MKKNNVLTIAAMIALLLTSLLTSITFGETSSIPSRISMELDKTKANIGDIIIATIRIDNI
NNFSGYQLNIKYDPSYLQAVNPLTGEPIKKRTMPAVNGTVLLKGDQYSITEVVENNVDEG
ILNFGKGYANLTEYRKSGKPETTGIIGKIGFKALKLGKTEIKFENTPVMPGAKEGTLLFD
WDAETITEYNVIQPKELAITLPDDAHIALELDKTKVKVGDVIVATVKAKNMTSMAGIQVN
IKYDPEVLQAIDPATGKPFTKETLLVDPELLSNREYNPLLTAVNDINSGIINYASCYVYW
DSYRESGVSESTGIIGKVGFKVLKAANTTVKLEETRFTPNSIDGTLVIDWYGQQIVGYKV
IQPDKITVISEPEVPTQTPTQTPPTTTAPSQTPTQTPPTTTAPSQTPTQTPAVTPTQSAT
PSDPGGGGGGLPGGGGGAVNPSASPTPTPTSKPTPTATKKPEPTEIEEPEPEIPGTVGIH
YSYLTGYPDKMFRPEKSITRAEAAVIFAKLLGANENTKINYNVSYTDVDSSHWASWAIKF
VSYKKLFTGYPDGSFKPNQNITRAEFSTVVFKLLVSEKGLKEEKIEKSKFGDTKGHWAQQ
FIEQLSDLGYINGYPDGTFKPNNNIKRSESVALINRAMGRGPLHGAPQVFEDVPQTHWAF
KDIAEGVLNHRYKLDNEGKEQLLEIIDN
Download sequence
Identical sequences Q06853
gi|385777995|ref|YP_005687160.1| 203119.Cthe_3079 gi|125975558|ref|YP_001039468.1| WP_003519375.1.19387 WP_003519375.1.20586 WP_003519375.1.31213 WP_003519375.1.55520 WP_003519375.1.6636 WP_003519375.1.6965

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]