SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|257791565|ref|YP_003182171.1| from Eggerthella lenta DSM 2243

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|257791565|ref|YP_003182171.1|
Domain Number 1 Region: 1505-1624
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.0000000000000113
Family Galactose-binding domain 0.0082
Further Details:      
 
Domain Number 2 Region: 450-621
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0000000000000226
Family Fibronectin type III 0.0037
Further Details:      
 
Domain Number 3 Region: 208-255
Classification Level Classification E-value
Superfamily Type I dockerin domain 0.0000196
Family Type I dockerin domain 0.004
Further Details:      
 
Weak hits

Sequence:  gi|257791565|ref|YP_003182171.1|
Domain Number - Region: 1660-1701
Classification Level Classification E-value
Superfamily Histone H3 K4-specific methyltransferase SET7/9 N-terminal domain 0.0628
Family Histone H3 K4-specific methyltransferase SET7/9 N-terminal domain 0.006
Further Details:      
 
Domain Number - Region: 153-185
Classification Level Classification E-value
Superfamily Starch-binding domain-like 0.0804
Family Rhamnogalacturonase B, RhgB, middle domain 0.038
Further Details:      
 
Domain Number - Region: 1120-1186
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 0.0833
Family Matrix metalloproteases, catalytic domain 0.037
Further Details:      
 
Domain Number - Region: 671-793
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.0957
Family Family 6 carbohydrate binding module, CBM6 0.055
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) gi|257791565|ref|YP_003182171.1|
Sequence length 1787
Comment coagulation factor 5/8 type domain-containing protein [Eggerthella lenta DSM 2243]
Sequence
MKRTHRALSAALAVVLASSLCTIPAYAVESDAEPAPPGRVQADETAPNQSPEADHGPGNE
VPDASGTAEASGSKSQTRTENATEPTASPLQAENASRSAVAPPAGELRVTLTEGRPAATA
GTAYEVALADSDGVQVGAQQLNLSLEGDSWSVARFDGLADGSYTLSVRAPGIAPYSQKID
VKGDSSAVELYVGDLAADPQAPARIGVLVPGDVNGDGTVDDADASAIIDDIEAGAASPAC
DANGDGTVSLADLETAVSSFDRVAVEATVARSLPPAAVEAAPGEGTEVASGSLDEVVGAD
AKPVSLRASEAISDEHPVEVSFDLAKDSEQAPKLGGMVINVPASGDHASEAGRLLVELTD
GTVREIGISRAEARAAFFRSSAPTAVIDENGTVVVDFGGQIAVKKVTLVITKTQGGTNLA
EVSKVEFLNNMEERIPEPELNIPGELSAVAGSKQFSVSWKAETNVTGYELSISANGREEV
KRATGTSLSVTQFDEQKIKNGTVFTLKVRSTNGAWRSDWSGSVQVTPKAEKVPDAPEGIS
LSGAYRGFKASWKNMEDTDSYNLFYREKADEGGDFTKVPDLTKTSCSVDGLKDDTEYQVY
LTGANEIGESAPSVMAVVRTANVKPAQLPAYRLVNTENADGDYLNRIVSATYGRGSMIDS
PLDETSGEAKSAYGLFDDDYASYLKVADWDEGGFYPALGKGVTVTFDQPQTLGMVSFADV
QDGVPYGRVSVSYVDDAGAWQTVQANVQMRTGENGRKYVLAKLPQPVTSSKVRIGMGHSW
SSGNVVVAEMRFHAYDSLESDIMALYADDLHLELKDDVTSAAVDELQQRLDTPDPASGEF
NPYRVELQVELDNARKLLATQGLEGTVRVHNGISSARDNRSLGISGLNAWQPLGAVVAEG
DQIVVYTGAKGAVTGKEAPLRLVVSQQHPESSNVSKTIATLKVGRNEITIPSLSSLDVEH
GGQLYVEYTGDNDAADWGVRVSGAQAVPVLDLYQVDDPAERLARTTAYVQALEAYVPALE
ESHGKLHGAGGNAAVRYGYDPKNCVLNATDVMLDQMMYSVPAQQMLAGAGSGTADERAAR
LLASFDAMDQMMELFYQHKGLADSFDAGTDAAVIKSNLLPSQHLNIRYTRMFAGAFMYAA
GNHIGIEWDSVPGLGKGSPVAVDGDGEKASGSYFGWGIAHEIGHNINQAQYAYSEVTNNY
FAQLSQTDGTSASARFSYDEVYDRVTSGDEGRTGSVFTQLAMYWQLRLAYDAGGAYQLYD
TYQQAFDNRFFARVDSYARAPKTAPAPEGTELVLGGGEKQNIIRLASAAAERDLTDFFQR
WGFTADEATKAYVSQFPAEDRALCYANDDARAYARSHEEAGAVLDKDVAQAAAEANGSEV
ALSLGADASAGDSVLGYEIARLTTVDGAQQRQVVGFAQAAADGTASYVDNASSLGNRVVS
YEVTVVDKFLNRSKALVLDPIKLTGNGLQDKSGWTVSTNMSSAQDSVPPADDGDPDAPAP
KPASELMVDGKADTVYTGASDGEDPVITLDMGKPTEVTSLRYTLGAGAEGSAIGDYRIET
SLDGENYTLIKEGALSLDKDGRASLYFDNGKDPWICTYDARYLRITAVGQAGRQLSVAEL
DVFGPSGDDVFFLDANGGAAGILKSDFVYQKADENLDKQFIPKGSLVFTGSYKGNPAYNV
VVLYDENGNVVGGVNADGSTVASQIIMAPEPGDAMLGDVSEGSWVYWIEPGDLASMKLPK
QVRAELYRVDNALTNEGQRLVSDSLPGDVPDNLPDVELGGNATVAAD
Download sequence
Identical sequences C8WI41
WP_015760799.1.14381 479437.Elen_1817 gi|257791565|ref|YP_003182171.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]