SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000023260 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000023260
Domain Number 1 Region: 9-148
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.28e-41
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00045
Further Details:      
 
Domain Number 2 Region: 698-714,751-914
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.7e-34
Family Laminin G-like module 0.0031
Further Details:      
 
Domain Number 3 Region: 322-491
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.1e-27
Family Laminin G-like module 0.0052
Further Details:      
 
Domain Number 4 Region: 945-1150
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 4.89e-26
Family Laminin G-like module 0.012
Further Details:      
 
Domain Number 5 Region: 122-310
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 7.18e-23
Family Laminin G-like module 0.0068
Further Details:      
 
Domain Number 6 Region: 552-608
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 0.0000000748
Family Fibrinogen C-terminal domain-like 0.0064
Further Details:      
 
Domain Number 7 Region: 524-559
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000718
Family EGF-type module 0.015
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000023260   Gene: ENSGACG00000017595   Transcript: ENSGACT00000023306
Sequence length 1285
Comment pep:known_by_projection group:BROADS1:groupIII:14763856:14803558:-1 gene:ENSGACG00000017595 transcript:ENSGACT00000023306 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
CDGPLVSNLPPASFRSSSQLSSSHAPGFAKLNRRDGAGGWSPLTSDGYQWLEVDLGQRTK
IAAVATQGRYGSSDWLTSYLLMFSDTGHNWKQHRQEDSIGSFPGNSNADSVVQYKLQQPA
VARFLRLLPLRWNPSGRMGLRLEAYGCPHTSDVLSLRGGGLAYRLSPGPRRSSGEVVSLE
FKTARSSGTLLQAEGEGGLGLSLELERGKLLLLTRAGPPSSEPRRVASLGSLLDDQQWHR
LAVERRGSQLNVTVDEHAAERLRLPAEFPGWEAEQRASLAPPPVQLSVAAARGLVSERNF
DGCLENLVYNGLDLVELFKSKDRRVTVVGNVTFSCAESVSVAVTFPGPRSFLRLPGAAPS
LSEGVSVGLQFRTWNDAGLLLTFDLPEEGGVAWLYLSEARLRLQIHKAGGALLELSAGSA
LNDGLWHSVELTSGRGRLTIAVDTEERGVANAGHPVAAGSQLFFGGCPAAEDTQDCKNPF
GVFQGCMRLLRLEDQPVDLIKVQQRLLGNYSHLQIDMCGIIDSRCSPSRCEHGGRCTQSW
TAFRCNCSASGYSGATCHSSVYEQSCEAYKHNGNTSGHFYIDVDGSGPIRPQLVYCNMTE
NTWMEIRHNNTELTGVRPSAGVDQHSVHFDYSAEEEQLLAAISQSEHCEQELSYRCRKSR
LLNTPEGSPFSWWLGGPGPGRVQTYWGGAQPGSRQCACGLRGDCVDPQHYCNCDADRTDW
YSPEDSGLLTHEESLPVRSLVLGDIQRAGSEAAYRVGPLRCYGDKNFWNAAFFDKETSYL
HFPTFHGELSADISFLFKTTASSGVFLENLGIKDFIRIELSSSTRVVFSLDVGDGPLEVR
VESSVPLNDDRWHRVRAERNVREASLRLDALPAARRGAPADGHLHLQLNSQLFIGGTASR
QKGFRGCIRALQLNGVTLDLEERARITPGVRAGCPGHCGSYGSLCRNRGRCAERANGFLC
DCGLSAHTGAFCHTEVSASFLPGTTVSYTFKQPHESGRNSSARPSSVYSDTTLRGEDVSL
SFRTNQSPALLLYVSSHHGESLALLINKHEALEVRYKLDGSRAAEVLRSTARSLADGRLH
AVSVRRRTDAVSLQIDQRPKEDFNLTADGEFNAIKSLVLGRVHGSEDVDPELSPLASLGF
TGCLSVVRFNPVSPLKAALLHPHSSPVVITGPLVQSTCGSSASANPHAAEDTHHLSDQSG
SAGSGQPLVNAVRTDSALIGGVIAVAIFLIASGLALTARFLYRRKETYGNQEAGGVKQED
SGDFTFTTQRDSQSVSTENPKEYFI
Download sequence
Identical sequences G3Q070
ENSGACP00000023260 69293.ENSGACP00000023260 ENSGACP00000023260

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]