SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for WP_006856673.1.8661 from NCBI 2017_08 genome

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  WP_006856673.1.8661
Domain Number 1 Region: 317-423
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000000549
Family Collagen-binding domain 0.0092
Further Details:      
 
Domain Number 2 Region: 972-1151
Classification Level Classification E-value
Superfamily Fibronectin type III 0.000000000195
Family Fibronectin type III 0.0032
Further Details:      
 
Domain Number 3 Region: 454-566
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000085
Family Collagen-binding domain 0.0028
Further Details:      
 
Domain Number 4 Region: 716-827
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.00000000262
Family Collagen-binding domain 0.003
Further Details:      
 
Domain Number 5 Region: 838-959
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000085
Family Collagen-binding domain 0.0051
Further Details:      
 
Domain Number 6 Region: 579-690
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000000824
Family Collagen-binding domain 0.0057
Further Details:      
 
Weak hits

Sequence:  WP_006856673.1.8661
Domain Number - Region: 196-297
Classification Level Classification E-value
Superfamily Collagen-binding domain 0.000275
Family Collagen-binding domain 0.013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) WP_006856673.1.8661
Sequence length 1153
Comment hypothetical protein [Roseburia intestinalis]; AA=GCF_000156535.1; RF=representative genome; TAX=536231; STAX=166486; NAME=Roseburia intestinalis L1-82; strain=L1-82; AL=Scaffold; RT=Major
Sequence
MKKRHQAGKRIAAWLMALSLCMTAVPAEQVVYAAGGQNETEVTELTELTEVPESAEAMKD
SETTEKTGAGEKAEAEEAGTESKAEVTETTESEETSEIPADTESTESSEIPADTESTESS
EIPADTESTESSEMPADTESTEIPETTENTESTQATEDSEMTEDTESTEALIGDTEAADT
EKEESEDILKKDGNVTPGSTYAAATSIGLNTTYNATTSRTSVTHWYKFTMSKAGMVQLKF
AHANLSSASNSTAWRIDFIADQVGGLVTVTSGMQDTQNQTAKIGLDAGTYYVQITGTGVL
TVGSAAYSFSVQYEDTYCETEQNNTIATADNYDRLGQTITGSISSASDTDYYKITSTTKG
YLSFQLQHDKVSGRLSTDIYSVAVCDAAGNTMYTMTSKKDEEKTESVNFGLDAGTYYLKV
SGIQYMDASGSLTVQGANGETYKLKASWTNADNWESESNDDINTADTMTSGKAVYGSLYG
VSDSDYYGFQTTKDGYIVINLQHSKVTGWQNKAIYAVTVCDTSGNSIYEMTSKAEDESTD
SIKLGLSAGKYYIKVAGQNAYYGGNYVIKTTFKACSTWEHESNDTYDTANTAVSGTTYSG
DIRTYSDVDYFKTSLSANGYINVKLTHPVVSGQETTNMFVLSVIRKVDKDQYTEVYTTKI
RGGDTSISTPNLGLPKGEYYIKIAGTGNTTGTLLSGTSYPVNYDVCIIAKTASDREVESN
DSAATANTVKNGKTYYGSTSSSSDKDYYKIKMSKAGYLQIKFGHKNSQSTASCYNVVLYN
KDNSEIYKFTNTGTETSYTSCKLGLDAGEYYVCVSQASTLYTGDYTICMTQKAASGWETE
NNGDWASADNIKVGKAVNGVITGYTSDEDCYRFTLTKAQYINFSLAHEKINDAGRSWYVT
LYNANGKRVSRKDDDHIYSYAGSTYTESKAVKLSKGTYYLKVQAFAKNAVEKEYTLCVNK
IENRKTSVTSVKSTAYNKLKVSWKVVPAATSYQIYRSTAKDGDYQNIKTINSVGTSSWTD
GSVKTGKTYYYKIKTVVKTQNGEQTSGFSNVKSAKAVPAKTTLKAKASDAKNVKLTWSKV
KGASGYEIYRSNSKDGKCSKVKTISKGSTTSYKNGKLKKSTTYYYKIRAYRKVNGKKVYG
SYSSVVSVKTKAK
Download sequence
Identical sequences C7G9H9
WP_006856673.1.8661

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]