SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000023258 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000023258
Domain Number 1 Region: 564-580,617-780
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.36e-34
Family Laminin G-like module 0.0031
Further Details:      
 
Domain Number 2 Region: 194-357
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 4.53e-27
Family Laminin G-like module 0.0052
Further Details:      
 
Domain Number 3 Region: 811-1016
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 4.17e-26
Family Laminin G-like module 0.012
Further Details:      
 
Domain Number 4 Region: 41-186
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.8e-18
Family Laminin G-like module 0.0068
Further Details:      
 
Domain Number 5 Region: 418-474
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 0.0000000658
Family Fibrinogen C-terminal domain-like 0.0064
Further Details:      
 
Domain Number 6 Region: 390-425
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000639
Family EGF-type module 0.015
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000023258
Domain Number - Region: 2-54
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.000907
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.0038
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000023258   Gene: ENSGACG00000017595   Transcript: ENSGACT00000023304
Sequence length 1151
Comment pep:known_by_projection group:BROADS1:groupIII:14763856:14820222:-1 gene:ENSGACG00000017595 transcript:ENSGACT00000023304 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MFSDTGHNWKQHRQEDSIGSFPGNSNADSVVQYKLQQPAVARFLRLLPLRWNPSGRMGLR
LEAYGCPHTSDVLSLRGGGLAYRLSPGPRRSSGEVVSLEFKTARSSGTLLQAEGEGGLGL
SLELERGKLLLLTRAGPPSSEPRRVASLGSLLDDQQWHRLAVERRGSQLNVTVDEHAAER
LRLPAEFPGWEAEQGNVTFSCAESVSVAVTFPGPRSFLRLPGAAPSLSEGVSVGLQFRTW
NDAGLLLTFDLPEEGGVAWLYLSEARLRLQIHKAGGALLELSAGSALNDGLWHSVELTSG
RGRLTIAVDTEERGVANAGHPVAAGSQLFFGGCPAAEDTQDCKNPFGVFQGCMRLLRLED
QPVDLIKVQQRLLGNYSHLQIDMCGIIDSRCSPSRCEHGGRCTQSWTAFRCNCSASGYSG
ATCHSSVYEQSCEAYKHNGNTSGHFYIDVDGSGPIRPQLVYCNMTENTWMEIRHNNTELT
GVRPSAGVDQHSVHFDYSAEEEQLLAAISQSEHCEQELSYRCRKSRLLNTPEGSPFSWWL
GGPGPGRVQTYWGGAQPGSRQCACGLRGDCVDPQHYCNCDADRTDWYSPEDSGLLTHEES
LPVRSLVLGDIQRAGSEAAYRVGPLRCYGDKNFWNAAFFDKETSYLHFPTFHGELSADIS
FLFKTTASSGVFLENLGIKDFIRIELSSSTRVVFSLDVGDGPLEVRVESSVPLNDDRWHR
VRAERNVREASLRLDALPAARRGAPADGHLHLQLNSQLFIGGTASRQKGFRGCIRALQLN
GVTLDLEERARITPGVRAGCPGHCGSYGSLCRNRGRCAERANGFLCDCGLSAHTGAFCHT
EVSASFLPGTTVSYTFKQPHESGRNSSARPSSVYSDTTLRGEDVSLSFRTNQSPALLLYV
SSHHGESLALLINKHEALEVRYKLDGSRAAEVLRSTARSLADGRLHAVSVRRRTDAVSLQ
IDQRPKEDFNLTADGEFNAIKSLVLGRVHGSEDVDPELSPLASLGFTGCLSVVRFNPVSP
LKAALLHPHSSPVVITGPLVQSTCGSSASANPHAAEDTHHLSDQSGSAGSGQPLVNAVRT
DSALIGGVIAVAIFLIASGLALTARFLYRRKETYGNQEAGGVKQEDSGDFTFTTQRDSQS
VSTENPKEYFI
Download sequence
Identical sequences G3Q068
ENSGACP00000023258

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]