SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000000338 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000000338
Domain Number 1 Region: 573-817
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 1.3e-65
Family Eukaryotic proteases 0.000016
Further Details:      
 
Domain Number 2 Region: 313-418
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 4.97e-28
Family Spermadhesin, CUB domain 0.00064
Further Details:      
 
Domain Number 3 Region: 200-306
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 1.57e-21
Family Spermadhesin, CUB domain 0.00095
Further Details:      
 
Domain Number 4 Region: 58-157
Classification Level Classification E-value
Superfamily SEA domain 0.00000000000562
Family SEA domain 0.011
Further Details:      
 
Domain Number 5 Region: 496-532
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.000000000288
Family LDL receptor-like module 0.0012
Further Details:      
 
Domain Number 6 Region: 459-494
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000223
Family LDL receptor-like module 0.00098
Further Details:      
 
Domain Number 7 Region: 420-460
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000236
Family LDL receptor-like module 0.0014
Further Details:      
 
Domain Number 8 Region: 539-585
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000000301
Family LDL receptor-like module 0.0012
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000000338   Gene: ENSGACG00000000267   Transcript: ENSGACT00000000338
Sequence length 818
Comment pep:known_by_projection scaffold:BROADS1:scaffold_114:165945:176030:1 gene:ENSGACG00000000267 transcript:ENSGACT00000000338 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MQFLPASDPQRLQKKPGRRRKTWIGVGLLLAATSVALLTGLLVWHFHREIRSDIRVKKLY
IGSMEIQNHVYVEDYEDPESSGFSHLASLVGQQLKLIYSKNSVLAKHFKGSSVQAFSEGG
GVGGGVVAYYQSEFDVHVLQQASLDAAVESLEVGGQQRRGRLLLRPSDALEVNNVVSTAI
DPRMTRKSLSVRKPFNVHVGGGGEVQSPGFPDSSYPPNVYLQWRLRAKPGHRVRLDFHTL
ILEDDCQKDFVQIYDSLAPLRERVLTEQCGYPHESLSFISSGNVLLLLFITSEQQNFPGF
RANFSQIPAGGLECGGTLREHQGSFSSPFFPSNYPPKTECVWNIQAPKEMFLKLHFKKFF
LGNSSSQCSNDYVEVNGQRLCGRKPESTVVTSRSNKMSIMFTSDSSYVDQGFTAEYEAFV
PTNPCPGRFQCSNNLCINQTLQCDGWNDCGDDSDEDDCKCKASQMKCRNGRCKPKFWECD
GFDDCGDGSDEENCGKCKAGEFLCRNGRCVPQKSKCNGKDDCSDGSDESRCEKSLVLQQC
SEFTFRCRNGRCISKLNPECDGELDCEDGSDEKDCTCGMRPYQSSRIVGGEASREGEWPW
QVSLHVAGTGHVCGGSVLSNRWLLTAAHCVQDNGPNKYSQADQWEALLGLHMQSQTNEWT
VRRKVRRIIAHPDYNSFTYDNDLAVMELDASVTLNQNIWPICLPSATYDFPAGRMLVGPS
VLHKEGGGRLVCQKSTAKTQMDLARNQDGGDHEAEPRTEVQKLTQGDSGGPLSVTAPGGR
VYLAGVVSWGDGCGRRNRPGVYTRITEYRGWITEQTGV
Download sequence
Identical sequences G3N4W8
ENSGACP00000000338 ENSGACP00000000338 69293.ENSGACP00000000338

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]