SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for WP_020457846.1.31213 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  WP_020457846.1.31213
Domain Number 1 Region: 223-438
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.4e-39
Family Clostridium neurotoxins, the second last domain 0.077
Further Details:      
 
Domain Number 2 Region: 51-214
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 5.76e-39
Family Cellulose-binding domain family III 0.0000889
Further Details:      
 
Weak hits

Sequence:  WP_020457846.1.31213
Domain Number - Region: 494-553
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.0133
Family Family 6 carbohydrate binding module, CBM6 0.026
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) WP_020457846.1.31213
Sequence length 1013
Comment hypothetical protein [Ruminiclostridium thermocellum]; AA=GCF_000015865.1; RF=representative genome; TAX=203119; STAX=1515; NAME=Ruminiclostridium thermocellum ATCC 27405; strain=ATCC 27405; AL=Complete Genome; RT=Major
Sequence
MKEKIGKVLRQNKSIISVVVITAILFVYNTGMLFTGIWEGALYNVKAEEVPLKLEFFNNV
KDDNVTLISPYFRVINNSSSDEIYLQHVKIRYYFTLDSSDSEETMNYEIYYAGKSNIDGT
GAVEDIKPNTIVKIAKMDIPTDMADHYLEIGFDESCGTIGPDKKVEVMVSISKEKYKKFI
QTNDYSYNDSAENYVSWEKVTLYLDGELISGIEPNMYASRETGAWYMFDEAVEGSTNEFK
DYKGNHGNAVLYSANGVVPGLNGNSVSLDGVDDYVALPDGIAGTFYNFTIAFWVRLDTIG
EQPIFDFFDSGSNNKYMRLTAESDGKIKFAMTQSGYYGEKTITSGSALTEGVWKHVAVTL
SGDTGTLYINGENVGENNTLSLRPLTFLGETSKGYIGKSHQTDSSEDPYYNSYLHGMIDD
FRIFDRALSADEIKTLASVATRVNDSDPGIHYSSGWSHSQERDKGDYLNDVHELDSPDGE
NCFEYTFTGTGVNVIAPQCSDNGDAEIYIDGKLMKSVAMSVYSGYNSQAVVYSKLGLSLG
THTIKVVFKNGIGIIDALDIMTGEIVSPSPTPTPSPTPSPTPTPSPTPSPTPTPTPTPSS
TPTSTPTPEPSPTSTPTPEPSPTSTPTPEPEPEPTSTPTPELSPSSTPVPTPTPTPTPAP
NPAPEPVPISTPVPEPILIPTPTPTMTPMPTPTPTLEVKSDPYLSDLVVTGAKLKPAFVP
DILNYEAVAEEDVRFVCIVAYARDDGAEITLNGVPVKSGSISHAVELKEGKNELIVKVVA
EDGITSRTYRISVLLEALQLPTPTPDKSGNPFFSSLEDLLKENEVSPDGTKGGIFDDVPR
GYWAEEYIQKLYEKGIISGIDEKTFMPGRPITRAEFTQIIVNSLKIPYREAGLHFNDVTE
KDWYYKSVSSAAAFGIVVGRPDGSFAPNEFITRQDMAVVIAKFLEKKHDGNLEGMGKGLV
FADSGNISEYARDSVAAVVSQGLMVGKPGNMFDPKGLTTRAEAVTVICKLMKY
Download sequence
Identical sequences A3DIC9
gi|125974991|ref|YP_001038901.1| WP_020457846.1.31213 203119.Cthe_2506

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]