SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for D4KLK8 from Uniprot 2018_03 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  D4KLK8
Domain Number 1 Region: 317-423
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000000549
Family Collagen-binding domain 0.0092
Further Details:      
 
Domain Number 2 Region: 972-1151
Classification Level Classification E-value
Superfamily Fibronectin type III 0.000000000195
Family Fibronectin type III 0.0032
Further Details:      
 
Domain Number 3 Region: 454-566
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000157
Family Collagen-binding domain 0.0023
Further Details:      
 
Domain Number 4 Region: 716-827
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000262
Family Collagen-binding domain 0.003
Further Details:      
 
Domain Number 5 Region: 838-959
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000085
Family Collagen-binding domain 0.0051
Further Details:      
 
Domain Number 6 Region: 579-690
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000824
Family Collagen-binding domain 0.0057
Further Details:      
 
Weak hits

Sequence:  D4KLK8
Domain Number - Region: 196-297
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000222
Family Collagen-binding domain 0.013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) D4KLK8
Sequence length 1153
Comment (tr|D4KLK8|D4KLK8_9FIRM) Uncharacterized protein {ECO:0000313|EMBL:CBL07760.1} KW=Complete proteome OX=657315 OS=Roseburia intestinalis M50/1. GN=ROI_04650 OC=Roseburia.
Sequence
MKKRHQAGKRIAAWLMALSLCMTAVPAEQVVYAAGGQNETEVTELTELTEVPESAEAMKD
SETTEKTGAGEKAETEEAGTESKAEVTETTESEETSEIPADTESTESSEIPADTESTESS
EIPAGTESTENSEMPTDTESTEIPETTENTESTQATEDSEMTDDTESTEALIGDTEAADT
EKEESEDILKKDGNVTPGSTYAAATSIGLNTTYNATTSRTSVTHWYKFTMSKAGMVQLKF
AHVNLSSASNSTAWRIDFIADQVGGLVTVTSGMQDTQNQTAKIGLDAGTYYVQITGTGVL
TVGSAAYSFSVQYEDTYCETEQNNTIATADNYDRLGQTITGSISSASDTDYYKITSTTKG
YLSFQLQHDKVSGRLSTDIYSVAVCDAAGNTMYTMTSKKDEEKTESVNFGLDAGTYYLKV
SGIQYMDASGSLTVQGANGETYKLKASWTNADNWESESNDDINTADTMTSGKAVYGSLYG
VSDSDYYGFQTTKDGYIVINLQHSKVTGRQNKAIYAVTVCDTSGNSIYEMTSKAEDESTD
SIKLGLSAGKYYIKVAGQNAYYGGNYVIKTTFKACSTWEHESNDTYDTANTAVSGTTYSG
DIRTYSDVDYFKTSLSANGYINVKLTHPVVSGQETTNMFVLSVIRKVDKDQYTEVYTTKI
RGGDTSISTPNLGLPKGEYYIKIAGTGNTTGTLLSGTSYPVNYDVCIIAKTASDREVESN
DSAATANTVKNGKTYYGSTSSSSDKDYYKIKMSKAGYLQIKFGHKNSQSTASCYNVVLYN
KDNSEIYKFTNTGTETSYTSCKLGLDAGEYYVCVSQASTLYTGDYTICMTQKAASGWETE
NNGDWASADNIKVGKAVNGVITGYTSDEDCYRFTLTKAQYINFSLAHEKINDAGRSWYVT
LYNANGKRVSRKDDDHIYSYAGSTYTESKAVKLSKGTYYLKVQAFAKNAVEKEYTLCVNK
IENRKTSVTSVKSTAYNKLKVSWKVVPAATSYQIYRSTAKDGDYQNIKTINSVGTSSWTD
GSVKTGKTYYYKIKTVVKTQNGEQTSGFSNVKSAKAVPAKTTLKAKASDAKNVKLTWSKV
KGASGYEIYRSNSKDGKCSKVKTISKGSTTSYKNGKLKKSTTYYYKIRAYRKVNGKKVYG
SYSSVVSVKTKAK
Download sequence
Identical sequences D4KLK8
gi|479202210|ref|YP_007830801.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]