SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|374296670|ref|YP_005046861.1| from Clostridium clariflavum DSM 19732

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|374296670|ref|YP_005046861.1|
Domain Number 1 Region: 419-500
Classification Level Classification E-value
Superfamily Invasin/intimin cell-adhesion fragments 5.1e-18
Family Invasin/intimin cell-adhesion fragments 0.008
Further Details:      
 
Domain Number 2 Region: 254-334
Classification Level Classification E-value
Superfamily Invasin/intimin cell-adhesion fragments 4.71e-17
Family Invasin/intimin cell-adhesion fragments 0.0073
Further Details:      
 
Domain Number 3 Region: 70-146
Classification Level Classification E-value
Superfamily Type I dockerin domain 3.92e-16
Family Type I dockerin domain 0.0014
Further Details:      
 
Domain Number 4 Region: 338-417
Classification Level Classification E-value
Superfamily Invasin/intimin cell-adhesion fragments 7.14e-16
Family Invasin/intimin cell-adhesion fragments 0.0061
Further Details:      
 
Domain Number 5 Region: 168-248
Classification Level Classification E-value
Superfamily Invasin/intimin cell-adhesion fragments 0.00000000000000102
Family Invasin/intimin cell-adhesion fragments 0.0075
Further Details:      
 
Domain Number 6 Region: 509-673
Classification Level Classification E-value
Superfamily Fibronectin type III 0.0000000000128
Family Fibronectin type III 0.0027
Further Details:      
 
Domain Number 7 Region: 892-947,975-1074
Classification Level Classification E-value
Superfamily vWA-like 0.00000089
Family Integrin A (or I) domain 0.032
Further Details:      
 
Domain Number 8 Region: 685-748
Classification Level Classification E-value
Superfamily TSP type-3 repeat 0.00000235
Family TSP type-3 repeat 0.0035
Further Details:      
 
Weak hits

Sequence:  gi|374296670|ref|YP_005046861.1|
Domain Number - Region: 1251-1492
Classification Level Classification E-value
Superfamily NHL repeat 0.000759
Family NHL repeat 0.015
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) gi|374296670|ref|YP_005046861.1|
Sequence length 2436
Comment Rhs family protein [Clostridium clariflavum DSM 19732]
Sequence
MKKRRHFKKFLSMFLVLSLIISNFVVSDLYALDEYQTKTEKSENFDVKTVTEDVYAESPL
GFVRMSVQTTKVIYGDINGDSYCNSIDLAIIRSYLLGKIKSFDDIAPSGYDALKAADVNG
DGEINSIDYAFMRKYILGIIREFPAESKEPEPEEPENPEEPEEPEEPEYVEVTGINIRDK
VLKLNVGDSTTISATVLPTNASNKEIKWSSDNDEIVIVNQSGTVTAVKDGTANIIAETVE
GGFKTYCNVIVSQPASGMTLDKSTYSMKTGESIKLNAIFTPENTTNKKIKWSSSNSDVAS
VGQGGVVTAFSEGTAIITAEAEDGGYTSSCTIIVTQKTFGVRLSERVITINAGEKKELKA
LYLPENAGNKKIKWSSSDESIAKVSESGVVTAVKAGEAYINVEPEEGIYSDSCKVIVIQP
VKGISLDKSSLNLKVGFGYSLKVEFTPSDATERNVKWTSSNEKVAKVNEKGSVVALSAGT
ATITATSESGGFTAKCTVTVVPEAMSAPVITGERTSSGVKLSWNAVSGAKSYTIKRGEYI
GDLVDIKTDYTQTSFTDATAESDTTYYYVVCANSDSGISRNSNLLIIKSEPKAPKLFGIK
SAGNARLSWTNANGADRYEIYRSTTKGGQYTLLSKNLLSHSYEDKNIGGETYYYVVKAIN
EKGESEYSNEVEINDSAVKPLDFISNEDSDGDGISNIDELLYCTNPTQTDTDGDGLSDGY
EIILGTDPLSPDTDKDGLYDGAEVLLGTGPLVANSNVSEITSKRQAISRDGRISVDVLGD
GNFIIAPLQIFTSENPKFKDINGNYISGIAGEPIDIEAGGFVIIKADITCNYDKTNLNGI
SESDLGILYFNNSTNQFATISGSPDTANSIIKGQTEVLGSFVIGNRSLAPSSQPVDIIMF
IDESAQAKSNDPASIREIAAYVLGQYLTSNPSFENYVRVGILIYDDSISLNVANGDFMVE
EVIVDGGSEFTGDAGTIMQRLDYVLSNDLYARRNSGYPLPPRPGDVPSKGSILDRYFSSA
TNKKIIIGFSSGPLNRFNYITQAVRDLAPKGFVVDTVAVGANAQYNQLAIIAQNGAVPGK
SFWINQNNNMTEDELISQLSDMCAQLSEQLALQSYVEGTYRPQNSVNIEFSDEYKGMENS
YSNEWITGSGTNLLTGSYMETHKDIQIQSNGYDIVFERTYNSNSNKEDSIVGKGFRTNFD
AKLEEKVSSAKVTASLLNVRSGPSTNHSIIGKVSKGTNLEILENGAGGSGWHKINYNGNT
NAYVSATYVDEISTIEITYPTGTKVSFEVKGDGSYKAPFWSNDTLEKQGNEYIVTSDDMS
KYVFDVSTKRLIRLEDRVGNALRIIYDSKGQIDYVIDDVGRKLDFTFNANGKVESIEDPL
TNRSVKYNYSNGLLTKVIDSELKETTYEYDSNDRIVKVIDANNNTAVKIDYDVFGRIVRQ
YDAEGNVTYQVYSDATNERYIIDARGNESRVRFNLDMRVVEEVDALGNKVVYEYSYYDPG
SNKWEVIPDVDIRTLEDNKDPNTKAYTKYLEAGKDKRLKIKETMYDKRGNATTNEYDENN
NLVGTVDPLKQTTSMVYDPVYKHNLISKTDKKGNTTTYVYDNEGKYGPKGALLVKEIDPL
GNELIYDYYTNESGIKIKGLVKTVTEKKRVDQKDPNSELIEFKVTEYKYDDLYNNRTQII
DTLGNSTYEEYDAAGRLQKVTNARGYTTKYTYDKNDRIIVEEIFPQTNERKILKRTESIY
DNVGNKTFVIEERFAADRPDYPKEDLVTETVYDRNNRPVQIYDAEGYRVSYTYDEAGNKV
TETDKRGFTTIYKYDELNRVTEVIDPLNNTTTYEYDANGNVIKITDAKNRVTFIDYDELD
RKWKERIQYNEDGEEKEAVYEYLYDENDNPYREIDPNGKIIEYEYDALNRITKEVDGLGL
KDKNGNSMEKIVTYSYNYESVVEDGKTVKYEVMTEKDYLTSTKIRPIVIKNDALGRMRIK
IETNNGDKTIQEYDEVGNLKSVKDARGNVTSYEYDGLNNIIKVIDATRINYSEAVFDSVG
NVIEKIDRRNNKTEYRYNKLNQVIKTTTWYTDENGVKQEVVSAILYDEAGNKKIATDAEM
NSTIFEYDELGRLVAETNPMYNTRYYGYDAVGNQVWVTDWKEHDSSESNVHPIINSKGET
VYRLKTTYEYDDFDRLKTVISTEGEITSYTYDVIGNIKTVTVDGVRKNTYSYDKMYRLEK
MLDGEDRAETYDEYDLFGRLLQKTDRNGQVHIYTYDEYDNVEIHTVTRKVKENGVEKVVK
DVRETYYDALGNVVRTVDETGETIYNYNELNLLDNKVLPDGKTVEYEYDEEGNIKEIKDP
SGNVTTYLFDEMNRMKTVTTKDGTTTYAYTKNGNRKSLKLPNNVLTTYEYDARNVLIKLV
NQVGSSVDIYEYEYDENSLQTAKIEPKGKQALNTIT
Download sequence
Identical sequences G8LYG8
gi|374296670|ref|YP_005046861.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]