SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000005240 from Gasterosteus aculeatus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000005240
Domain Number 1 Region: 1023-1116
Classification Level Classification E-value
Superfamily C-type lectin-like 3.67e-31
Family Link domain 0.00018
Further Details:      
 
Domain Number 2 Region: 561-704
Classification Level Classification E-value
Superfamily FAS1 domain 7.98e-21
Family FAS1 domain 0.0052
Further Details:      
 
Domain Number 3 Region: 1114-1275
Classification Level Classification E-value
Superfamily FAS1 domain 7.46e-19
Family FAS1 domain 0.0082
Further Details:      
 
Domain Number 4 Region: 358-403,437-546
Classification Level Classification E-value
Superfamily FAS1 domain 3.27e-17
Family FAS1 domain 0.0048
Further Details:      
 
Domain Number 5 Region: 3-130
Classification Level Classification E-value
Superfamily FAS1 domain 0.00000000000275
Family FAS1 domain 0.0041
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000005240
Domain Number - Region: 954-990
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000209
Family Merozoite surface protein 1 (MSP-1) 0.071
Further Details:      
 
Domain Number - Region: 255-382
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00022
Family Growth factor receptor domain 0.012
Further Details:      
 
Domain Number - Region: 889-1036
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.00121
Family Growth factor receptor domain 0.02
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000005240   Gene: ENSGACG00000003984   Transcript: ENSGACT00000005255
Sequence length 1276
Comment pep:novel group:BROADS1:groupXVII:1458284:1472606:-1 gene:ENSGACG00000003984 transcript:ENSGACT00000005255 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
EPPTLMAFLNSSTFTLFAKHALMYNLSADLSGLDFTLLLPTDEAIRQHLSTTNSSLLDAD
VFKYHVILNELLFPDHLSDGALKSTLLGADYQVQFHLNHNNQTAVNEVPLDGGVTETQRG
VILVLPQVLMIHRNRCSQEVNLQVQGRCAACEGPPRCLFNYKPIKKQFPDNMKSNCKYRK
RVGSRRVSVPGCVIKCLRLTKDHSCCPGYYGHECFKCPGGIGSWCSNHGECQDGNLGNGE
CRCFEGFHGTACEDCEPGRYGATCSSKCACDHGKCEDGLAGSGRCVCYKGWRGSSCSIEI
KDDACGGGCDENGNCVTGPKGTAAACVCVAGYEGNGTYCTELDLCSRSNGGCSEFAVCLK
VSAGERTCTCGEGYTGDGVVCLERRPTGGVPPTREFLLPSNGVVTCWVLSLVKGVSSERP
LCGLQRISSLCFLCVSQKSNSRVLYGDGPFTAFIPLEETNRNFSFEEWDQSGRLSELVRY
HILSCETLTLSDLKTTKVAVATSGHTLHFSLREGSVWVNNGTRIVRSDYVTSNGVIHHLD
ALLTPYRLQDKPRLPADSVTMNYTSAAAFYGYSRFYQLVQDVGLIPVLQMSIHQPLTMFW
PTDEALDSLPAARKHWLSSPDHQEQVASLVKAHIIRNSRVLGIGQPDKISKLRSMHGSTI
KYSCDRTLVQGAVLINDNAAKVVERYMNFKEGVAYGIDQLLEPPGLGAFCDSPENRTTSG
RCGRCPFPPACPFRHLDTGKTELCYNRFSSFGRRMMGCKKTCQFSSWVQKCCKNHYGRDC
QVCPGGLEAPCGDHGVCDDGMKGSGRCLCHEGFVGKACNLCSLRHYGPNCTACECGQMGR
CDGGMEGSGKCVCAPGWGGERCQIDIGSIPEECRRCHAQADCVPGSGCRCKPGFQGNGTF
CDPEPPPDLCSEYNGGCHQNADCNQTGLLVNCTCRRGYRGDGYSCEPINRCIEEQNGGCS
DFASCKFTGPNERRCECLPGYLGNGVQCLEKVVPPVDRCLEDNGGCDPAADCKDLHYHAN
TAGVFHLRSPDGKYKMNFSQADAACRAEGAALATFKQLGDAQQLGMHLCVAGWMEAGKVG
YPIRFPSVRCGDNHVGLVIYKEPVDQSSKYDAYCYRLKDVSCVCPAGYIGNGDFCNGVLT
SVLASYSNFSIFYKCLLDYSGWSSEGKQLVDFMSHRKSEVTLFVPHNAGFGPNQNLSGRD
LEYHISNNHSVRPFKELRHQEVIVSRLGFNLTVTHGNNEVSDQSIKMVNRQMLLEWDIPA
VNGIIHVIEAPLTAPP
Download sequence
Identical sequences G3NIT0
ENSGACP00000005240 ENSGACP00000005240 69293.ENSGACP00000005240

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]