SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000010458 from Gasterosteus aculeatus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000010458
Domain Number 1 Region: 871-1084
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.32e-106
Family Thrombospondin C-terminal domain 0.00000000312
Further Details:      
 
Domain Number 2 Region: 6-209
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.81e-38
Family Laminin G-like module 0.000000704
Further Details:      
 
Domain Number 3 Region: 748-869
Classification Level Classification E-value
Superfamily TSP type-3 repeat 6.28e-36
Family TSP type-3 repeat 0.00000028
Further Details:      
 
Domain Number 4 Region: 408-458
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.00000000000000196
Family TSP-1 type 1 repeat 0.00013
Further Details:      
 
Domain Number 5 Region: 607-691
Classification Level Classification E-value
Superfamily TSP type-3 repeat 0.000000000000144
Family TSP type-3 repeat 0.00011
Further Details:      
 
Domain Number 6 Region: 680-750
Classification Level Classification E-value
Superfamily TSP type-3 repeat 0.000000000000262
Family TSP type-3 repeat 0.00021
Further Details:      
 
Domain Number 7 Region: 289-347
Classification Level Classification E-value
Superfamily FnI-like domain 0.000000000000324
Family VWC domain 0.0041
Further Details:      
 
Domain Number 8 Region: 354-403
Classification Level Classification E-value
Superfamily TSP-1 type 1 repeat 0.000000000262
Family TSP-1 type 1 repeat 0.00084
Further Details:      
 
Domain Number 9 Region: 463-516
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000954
Family EGF-type module 0.024
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000010458   Gene: ENSGACG00000007883   Transcript: ENSGACT00000010480
Sequence length 1086
Comment pep:novel group:BROADS1:groupXVIII:6757557:6766662:-1 gene:ENSGACG00000007883 transcript:ENSGACT00000010480 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
SRDDNSVFDLFEHVQVPKKSQAVTLVKGDDPYSPAYKILNPDLIPPVSESAFRDLIDSIH
AERGFLLLLNFKQFKRTRGSLLTVEKRDGSGPVFEIVSNGKANTLDIVFSTENKQQVVSI
EDVDLATGHWKNITLFVQEDRAQLFAGCDEVNTAELDAPIQSILTLETPDSAQLRIGKGA
VRDRFMGVLQNVRFVFGASLEAVLRNKGCQSSLTADSMILENLNGSSAIRTEYTGHKTKG
EDLQMVCGFSCEDLVSMFKELKGLGVVVKELSSELRQMTDENKLIKNRVGVCLHNGIVHK
NKDEWTVDDCTECTCQNSATVCRKISCPLIPCANATVPDGECCPRCGTPSDYAEDGWSPW
SEWTHCSVSCGRGIQQRGRSCDRINNNCEGTSVQTRDCYLQECDKRFNGNWGPWSPWDAC
TLTCGSGVQTRKRMCNDPAAKYGGKECVGNAKDTQMCNKKACPVDGCLSNPCFAGAKCIS
FPDGSWKCGKCPVGYTGNGIKCKDVDECKEVPDACFEFNGVHRCENTDPGYNCLPCPPRY
AGPQPFGRGVAQAAAKKQVCTARNPCLDGSHDCNKNARCNYLGHFADPMYRCECKPGYAG
NGHICGEDTDLDGWPNADLVCVENATYHCKKDNCPNLPNSGQEDYDKDGIGDACDNDDDN
DGIPDDRDNCPFVYNPRQYDYDRDDVGDRCDNCPYNSNPDQTDTDNNGEGDACAVDIDGD
GIQNEKDNCPFVYNVDQKDTDLDGVGDMCDNCPLEHNPDQVDSDDDRVGDTCDSNQDIDE
DGHQNNLDNCPYIPNANQADHDKDGKGDACDHDDDNDGIPDDKDNCRLAFNPDQLDADGD
GRGDACKDDFDQDNVPDIYDVCPENFDISETDFRKFQMVPLDPKGTSQIDPNWVVRHQGK
ELVQTVNCDPGIAVGYHEFNSVDFSGTFFINTERDDDYAGFVFGYQSSSRFYVVMWKQIT
QTYWSNKPTKAQGYSGLSIKMVNSTTGPGEHLRNALWHTGNTPGQVRTLWHDPKNVGWKD
FTAYRWHLIHRPRTGLIRVVMYEGKKIMADSGNIYDKTYAGGRLGLFVFSQEMVYFSDLK
YECRGK
Download sequence
Identical sequences G3NYN7
ENSGACP00000010458 ENSGACP00000010458 69293.ENSGACP00000010458

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]