SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000004140 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000004140
Domain Number 1 Region: 697-864
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.29e-79
Family Thrombospondin C-terminal domain 0.000000573
Further Details:      
 
Domain Number 2 Region: 574-695
Classification Level Classification E-value
Superfamily TSP type-3 repeat 1.96e-34
Family TSP type-3 repeat 0.00000514
Further Details:      
 
Domain Number 3 Region: 493-576
Classification Level Classification E-value
Superfamily TSP type-3 repeat 1.28e-18
Family TSP type-3 repeat 0.0000655
Further Details:      
 
Domain Number 4 Region: 198-243
Classification Level Classification E-value
Superfamily Assembly domain of cartilage oligomeric matrix protein 0.00000000000000248
Family Assembly domain of cartilage oligomeric matrix protein 0.0021
Further Details:      
 
Domain Number 5 Region: 418-517
Classification Level Classification E-value
Superfamily TSP type-3 repeat 0.000000000034
Family TSP type-3 repeat 0.00017
Further Details:      
 
Domain Number 6 Region: 282-321
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000028
Family EGF-type module 0.017
Further Details:      
 
Domain Number 7 Region: 335-378
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000488
Family EGF-type module 0.011
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000004140
Domain Number - Region: 24-162
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.00554
Family Laminin G-like module 0.0096
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000004140   Gene: ENSGACG00000003156   Transcript: ENSGACT00000004154
Sequence length 864
Comment pep:known_by_projection group:BROADS1:groupXX:446124:451932:1 gene:ENSGACG00000003156 transcript:ENSGACT00000004154 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
VVDVLGLQDSKQMAAAVERLSVGLSALSDVYVASTFRLPAKLGGVLLGVYSKQDNRKYLE
VAAMGKISKVVVRYVRADGRIHTVNLQSAHLSEGRTTSIILRLGGLRRDNMNMELYVNCR
LADSSQGLPPLVALPREAEQVEFRHGQKAYGRMQGAVESLRLALGGSVATAGALMDCPFQ
GDASAYNAVSGNINSILGDHTKALIGQLIIFNQILGELRQDIREQVKEMSLIRNTILECQ
VCGFHEPRSRCSPHPCYKGVSCTESLNYPGFTCGPCPPGTTGNGTHCQDMDECELQPCFS
PDSCVNTVGGFICHPCPPGLWGAPLSGTGMDYAKTHRQDCVDIDECLDLPDACVMNSVCI
NTLGSYKCGGCKPGFLGNQTSGCFPRQSCAALTFNPCDSNAHCTMERNGEVACRGETPSP
CNVGWAGNGNTCGPDTDIDGYPDRPLPCMDNHKHCKQDNCVSTPNSGQEDADNDGIGDQC
DEDADGDRIKNVEDNCRLVPNKDQQNSDTDSFGDACDNCPSVPNIDQKDTDSNGQGDACD
QDIDGDGIPNVLDNCPNVPNPMQTDRDRDGVGDACDSCPELSNPMQTDVDNDLVGDVCDT
NQDTDGDGLQDSRDNCPDIPNSSQLDSDNDGLGDDCDHDDDNDGVLDDYDNCRLIVNPNQ
KDSDVNGVGDVCENDFDNDAVMDLVDVCPESAEVTLTDFRAYQTVILDPEGDAQIDPTWV
VLNQGMEIVQTMNSDPGLAVGYTAFNGVDFEGTFHVNTITDDDYAGFIFGYQDSSSFYVV
MWKQTEQTYWQSVPFRATAQPALQLKAVKSRTGPGEFLRNALWHTGDTAGEVKLLWKDPR
NVGWKDKTSYRWHLSHRPQVGYIR
Download sequence
Identical sequences G3NFN5
69293.ENSGACP00000004140 ENSGACP00000004140 ENSGACP00000004140

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]