SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|302875920|ref|YP_003844553.1| from Clostridium cellulovorans 743B

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|302875920|ref|YP_003844553.1|
Domain Number 1 Region: 40-392
Classification Level Classification E-value
Superfamily (Trans)glycosidases 1.56e-65
Family beta-glycanases 0.00000139
Further Details:      
 
Domain Number 2 Region: 618-947
Classification Level Classification E-value
Superfamily (Trans)glycosidases 6.73e-59
Family beta-glycanases 0.0000137
Further Details:      
 
Domain Number 3 Region: 524-614
Classification Level Classification E-value
Superfamily E set domains 2.52e-22
Family Cellulosomal scaffoldin protein CipC, module x2.1 0.0015
Further Details:      
 
Domain Number 4 Region: 1169-1333
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.87e-21
Family CBM11 0.021
Further Details:      
 
Domain Number 5 Region: 1073-1166
Classification Level Classification E-value
Superfamily E set domains 3.64e-19
Family Cellulosomal scaffoldin protein CipC, module x2.1 0.0077
Further Details:      
 
Domain Number 6 Region: 1338-1397
Classification Level Classification E-value
Superfamily Type I dockerin domain 0.0000000000000144
Family Type I dockerin domain 0.00087
Further Details:      
 
Domain Number 7 Region: 431-522
Classification Level Classification E-value
Superfamily E set domains 0.0000000000000322
Family Cellulosomal scaffoldin protein CipC, module x2.1 0.0068
Further Details:      
 
Domain Number 8 Region: 979-1075
Classification Level Classification E-value
Superfamily E set domains 0.000000000000098
Family Cellulosomal scaffoldin protein CipC, module x2.1 0.009
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) gi|302875920|ref|YP_003844553.1|
Sequence length 1398
Comment hypothetical protein Clocel_3099 [Clostridium cellulovorans 743B]
Sequence
MKKLLSLILTITMVLGMLVPVYGETNAQTAMESITKVPSGFVYREGTKFMLDGNTFYYAG
TNNYYLNFKSKAEVDDVIDDAADMGLKVIRTWGFLDVGTLNADGTLTNNVDGSGSKDGVY
FQYWDTKTNAPAYNTGDNGLKVLDYAIYKASQKGIRLLIPFTNNWEAFGGMMQYCKWLGL
SQKDMFYTNPTIKQYYKNYVNMLLNRTNAYSGIKYKDDPTIFSWELANEPRCGTDTTGDT
VVNWSKEMSEYVKTVDPYHMVCLGDEGFYNYAYNTAGIDGTWPYHGSEGIDWNRIVALPT
IDFGTIHIYCDQWGTNAAWGTEWIRKHAEDAKALNKPAILEEFGWKDRSTRDQVFTDWLN
VIEGNKYSGLELAGDNYWMLAGLQKDGSVYPDYDQYTVYWDVPNNPTATTAKLIQNHATN
MTNKNLANIRNKITPSTATYDKNVGKGQSISVTVDPKEGTFAAVKNGTRALVAGADYTVL
GNNIIIEKAYLDTLELGSTLLTFDISAGYDPVMAISVIDSNIVVVDSTITPTSVTFDKTV
PNDITVTITPNGNLLKTLFNLYTPLVQGTDYVISGNNVTIKQSYLSKLENGTTALTFDFN
QGTDPTLIVAVKKSVLPTGFVKADGTKFVVDGHPFYFAGANSYDLFTYGDGSSTSTTTDI
ETKFMYKSQIDNIMSQMASDGVTVLRTWGFSNETWHGFETAKGVYNEAEFMLFDYIMDSA
RRNGIKVIITLENYWEAYGGIDKKLQWEGLSGGSHTARAQFFTNENCKAHYKVYAEHFIN
RVNHYTGVAYKDDPTIFAWDLMNEPRYQDAKVNENATGVTLRKWVDEMGGYIKSIDPNHM
VCAGIEGHESRYGFGGDEGNPFIYIQQSPGVDFCSSHPYPDESWANLTPSQNASLMEKWI
SDAHNIVGKPFVAGEFNTHDNKEAYWVSVFGEIEEHNAAGGLFWEYNFRRLSNFTVVHGD
SILEYFKAHSARMSVKNVPENYVTPIKATFDRKADKQEDIALTMKLLEGNTLVAIKNGET
QLKQGIDYIVEEEQVVISKNYLGNLPFEDVTFTFDISSGYDPIVTIAITDSGILNSTVTP
EVAIFDKNIRKQADITISMTTNGNAFVALKNGATSLIGGTDYTVLGENVVIKKSYLQSQN
LGELSLKFDFNQGIDPMLKITVKDSTGEDIIDNFESYSATNPNLQTAYVKNSSGNVVAVM
LESAIKGEGSYSMSYAYTIGSPNYAGITKDLSGKSFVGYKGVAFWISPDGSNRDFTIQIK
ETSGEYWEATYKLSSLVATNVSLEFSKFTHPSWYSGGNGVFDLESITEYSMYIGQGLGTT
GGGTIYIDNIKLIDSAIAVIKGDVNGDKIVNAIDLAYLKKYILDNTILINMECADMNNDS
KVNAIDLAMLKKVILGVA
Download sequence
Identical sequences A0A173N029 D9STQ5
WP_010075885.1.47838 gi|302875920|ref|YP_003844553.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]