SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000023319 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000023319
Domain Number 1 Region: 39-178
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 1.35e-41
Family Discoidin domain (FA58C, coagulation factor 5/8 C-terminal domain) 0.00045
Further Details:      
 
Domain Number 2 Region: 732-748,783-936
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 4.01e-33
Family Laminin G-like module 0.0033
Further Details:      
 
Domain Number 3 Region: 342-525
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.12e-30
Family Laminin G-like module 0.0061
Further Details:      
 
Domain Number 4 Region: 152-340
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1e-26
Family Laminin G-like module 0.0092
Further Details:      
 
Domain Number 5 Region: 967-1172
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.85e-26
Family Laminin G-like module 0.01
Further Details:      
 
Domain Number 6 Region: 583-642
Classification Level Classification E-value
Superfamily Fibrinogen C-terminal domain-like 0.000000000921
Family Fibrinogen C-terminal domain-like 0.0051
Further Details:      
 
Domain Number 7 Region: 553-590
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000147
Family EGF-type module 0.012
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000023319   Gene: ENSGACG00000017644   Transcript: ENSGACT00000023365
Sequence length 1307
Comment pep:known_by_projection group:BROADS1:groupIII:14929551:14958963:1 gene:ENSGACG00000017644 transcript:ENSGACT00000023365 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MLHLIVHTHISCCRVSQQQRYNILFLLTEVCDSPLVSNLPPASFRSSSQLSASHAPSFAK
LNRRDGAGGWSPLTSDGYQWLEVDLGQRTKIAAVATQGRYGSSDWLTSYLLMFSDTGHNW
KQHRQEDSIGSFPGNSNADSVVQYKLQQPAVARFLRLLPLHWNPSGRMGLRLEAYGCPHT
SHVLGLDGSSGLLYRPSPGTRGSVREVVTLQFKTLRNSGTLLRAEGGGGGLGLGLELERG
RLLLLTRAGFCSPPSSEPRRVASLGSLLDDQQWHRLAVERRGSQLNVTVDEHAERLRLPA
EFAGWEAEQLSVGTVPSRGSQSPDVSKGNFHGCLENLKFNSVNVVELAKNGDLRVAVRGN
VTFSCAESVSVAVTFPGPRSFLRLPGATPSSSGGVSVGFQIRTWNKAGLLLTFDLPQQGG
VVWLYLSEARLRLQIHKAGRALLELSAGSALNDGQWHSVALTSSRGRLSIGVDGEGGGSA
QAAPPYAVAVESHLFFGGCPAEDNEPRCRNPFNVFLGCMRLLSLNHLMVDLMMVQKKQLG
IFSHLQIDMCGIIDRCSPSRCEHGGRCTQSWTAFRCNCSASGYSGATCHSSVYEQSCEAY
KHNGNTSGHFYIDVDGSGPIRPQLVYCNMTGEEENTWMEIQHNNTEVTRVRPSPGARQRS
LHFDYSTGDEQLSAAIGQSEHCEQELSYRCRKSRLLNTPEGSPFSWWLGGPGPGRVQTYW
GGAQPGSRQCACGLRGDCVDPQHYCNCDADRTEWAEDSGLITHKESLPVRSLVLGDVQRP
ESEAAYRVGPLRCHGDRNFWNAAFFDKETSYLHFPTFHGELSADISFLFKTTASSGVFLE
NLGIKDFIRIELSSSTRVVFSLDVGDGPLEVRVESSVPLNDDRWHRVRAERNVREASLRL
DADGHLHLQLNSQLFIGGTASRQKGFRGCIRALQLNGVTLDLEERARITPGVRAGCPGHC
GSYGSLCRNRGRCAERANGFLCDCGLSAHTGAFYLREVSASFKSGTTVSYTFKEPHESGR
NSSARPSSVHSDTTLRGEDVSLSFRTNQSPALLLYVSSRHGESLALLINKHDKLEVRYKL
DGRRGAEVLRSAARSLADGRLHAVSVRRRADGVSLQIDQHAREDFNLTSDGELNGIKSLV
LGRVHGSEDVDPELSRLASLGFTGCLSVVRFNSVSPLKAALLHPHSSPVVITGPLVQSTC
GSSASANPRAAEDTHHLSDQSGSVGSGQPLVKSIRTGSALIGGVIAVAIFLIASGLALTA
RFLYRRRETHGNQEAGGVKREDSGDFTFTSQRDSRSVSTENPKEYFI
Download sequence
Identical sequences G3Q0C9
ENSGACP00000023319 ENSGACP00000023319 69293.ENSGACP00000023319

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]