SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000015829 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000015829
Domain Number 1 Region: 10-149
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.42e-43
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00039
Further Details:      
 
Domain Number 2 Region: 675-691,727-889
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.06e-34
Family Laminin G-like module 0.0026
Further Details:      
 
Domain Number 3 Region: 293-477
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 6.65e-29
Family Laminin G-like module 0.0038
Further Details:      
 
Domain Number 4 Region: 920-1124
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.28e-26
Family Laminin G-like module 0.0074
Further Details:      
 
Domain Number 5 Region: 122-293
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 2.49e-22
Family Laminin G-like module 0.0071
Further Details:      
 
Domain Number 6 Region: 536-594
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 0.0000000000774
Family Fibrinogen C-terminal domain-like 0.0056
Further Details:      
 
Domain Number 7 Region: 508-544
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000152
Family EGF-type module 0.013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000015829   Gene: ENSGACG00000011956   Transcript: ENSGACT00000015860
Sequence length 1253
Comment pep:known_by_projection group:BROADS1:groupVI:16324890:16344210:1 gene:ENSGACG00000011956 transcript:ENSGACT00000015860 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
CRRPLVSGLPPPAFSSSSRASGSHGPAFAKLNRREGAGGWSPLSSDPHQWLEVDLGGRTR
ITAVATQGRYGSSDWLTSYQLMFSDAGHNWKQYRQEDSRGALPGNTNADSVVLHKLQHPV
IARHLRLLPLDWNPRGRIGLRLETYGCSYGSHVVSLGGRGGLLFRLSPGPRRTSGASVSL
TFKTLRNSGVLLRAEGRSEHGLVLQLEEGKLLLLLLRKGRTASPDGRRLVSLGSLLDDQH
WHHVSMELGVDGLNLTVDGSSLWVRLPPRLAHWDHQQVSVGHFHGCLENLVYNGVNLLEL
AEKKDQRVTAEGDVIFSCAESVPVAMTFTSAQSFLRLPLATEAPPSTRTSAGLQFRTWNE
AGLLLTFDLPEQEGAAWLYLSGARLHLRIHKAGRAPLELSAGSALNDGQWHSAELTSRRG
HLSVSVDGGEGATAHASPPFLVAMGGQLFFGGCPADGGRQDCVNPFKAFQGCMRLLTVHN
QPVDPIPLQQRLMGNYSHLQIDMCGILDRCSQSHCEHGGSCRQSWSSFHCNCSGTGYSGA
TCHSSIYEQSCEAYKHRGNASGYYHIDVDGSGPIRARLMYCNMTEDHTWTVIQHNNTALT
RISAHFAYASDEEQLAAVIGQSERCEQELSYDCRKSRLLNTADGPLLSWWVGGPGDGQVQ
TYWGGAPPGSQRCSCGLQQNCVDPRHACNCDADRSEWAKDSGLLTHKETLPVRSLVLGDV
QRSDSEGAYRVGPLRCHGDKNVWNAALFDQETSYLHFPTFHGELSADISFLFKTSSSSGV
FLENLGIKDFIRIELRSSTEVLFSFDVGDGPTEVAVTTGFPLDDDRWHRVRAERNVRAAS
LRLDELPAATREAPADGHVHLQLNSQLFIGGTASRQRGFRGCMRSLQLNGVPLDLEARAR
ITPGVQAGCPGHCSSYGSLCQNRGRCVERTSSFSCDCSSSAHTGAFCDRAEVSASFKSET
SVSYAFNELDRGSDGSPSPAVSSDLDLRAEDLSLSFRSLQSPALLLFVGSSRREYLALLL
NQHDMLEVRYRLDSSRAADVLRSKVTNLADGRLHAVVVSRRAAAAWVQIDQNSKEDFNLT
SDVEFNGVRSLVLGRVHDSAELDPGLSRLASLGFTGCLSGVLFNSVSPLKAALLHPDTSP
VVVTGPLVQSICGSTSADPHAAGTRHHLSGQSGSVGTGQPVENAVRRDSALIGGGIAVAI
FVTVSALAVTARVLYRGKGTCRRPAGKTAKPDDGDGGAGSRGGPSENQREYFI
Download sequence
Identical sequences G3PE05
ENSGACP00000015829 ENSGACP00000015829 69293.ENSGACP00000015829

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]