SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|125974991|ref|YP_001038901.1| from Clostridium thermocellum ATCC 27405

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|125974991|ref|YP_001038901.1|
Domain Number 1 Region: 223-438
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.4e-39
Family Clostridium neurotoxins, the second last domain 0.077
Further Details:      
 
Domain Number 2 Region: 51-214
Classification Level Classification E-value
Superfamily Carbohydrate-binding domain 5.76e-39
Family Cellulose-binding domain family III 0.0000889
Further Details:      
 
Weak hits

Sequence:  gi|125974991|ref|YP_001038901.1|
Domain Number - Region: 494-553
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.0133
Family Family 6 carbohydrate binding module, CBM6 0.026
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) gi|125974991|ref|YP_001038901.1|
Sequence length 1013
Comment S-layer-like domain-containing protein [Clostridium thermocellum ATCC 27405]
Sequence
MKEKIGKVLRQNKSIISVVVITAILFVYNTGMLFTGIWEGALYNVKAEEVPLKLEFFNNV
KDDNVTLISPYFRVINNSSSDEIYLQHVKIRYYFTLDSSDSEETMNYEIYYAGKSNIDGT
GAVEDIKPNTIVKIAKMDIPTDMADHYLEIGFDESCGTIGPDKKVEVMVSISKEKYKKFI
QTNDYSYNDSAENYVSWEKVTLYLDGELISGIEPNMYASRETGAWYMFDEAVEGSTNEFK
DYKGNHGNAVLYSANGVVPGLNGNSVSLDGVDDYVALPDGIAGTFYNFTIAFWVRLDTIG
EQPIFDFFDSGSNNKYMRLTAESDGKIKFAMTQSGYYGEKTITSGSALTEGVWKHVAVTL
SGDTGTLYINGENVGENNTLSLRPLTFLGETSKGYIGKSHQTDSSEDPYYNSYLHGMIDD
FRIFDRALSADEIKTLASVATRVNDSDPGIHYSSGWSHSQERDKGDYLNDVHELDSPDGE
NCFEYTFTGTGVNVIAPQCSDNGDAEIYIDGKLMKSVAMSVYSGYNSQAVVYSKLGLSLG
THTIKVVFKNGIGIIDALDIMTGEIVSPSPTPTPSPTPSPTPTPSPTPSPTPTPTPTPSS
TPTSTPTPEPSPTSTPTPEPSPTSTPTPEPEPEPTSTPTPELSPSSTPVPTPTPTPTPAP
NPAPEPVPISTPVPEPILIPTPTPTMTPMPTPTPTLEVKSDPYLSDLVVTGAKLKPAFVP
DILNYEAVAEEDVRFVCIVAYARDDGAEITLNGVPVKSGSISHAVELKEGKNELIVKVVA
EDGITSRTYRISVLLEALQLPTPTPDKSGNPFFSSLEDLLKENEVSPDGTKGGIFDDVPR
GYWAEEYIQKLYEKGIISGIDEKTFMPGRPITRAEFTQIIVNSLKIPYREAGLHFNDVTE
KDWYYKSVSSAAAFGIVVGRPDGSFAPNEFITRQDMAVVIAKFLEKKHDGNLEGMGKGLV
FADSGNISEYARDSVAAVVSQGLMVGKPGNMFDPKGLTTRAEAVTVICKLMKY
Download sequence
Identical sequences A3DIC9
gi|125974991|ref|YP_001038901.1| 203119.Cthe_2506 WP_020457846.1.31213

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]