SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000020030 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000020030
Domain Number 1 Region: 935-1087
Classification Level Classification E-value
Superfamily C-type lectin-like 2.1e-41
Family C-type lectin domain 0.00000512
Further Details:      
 
Domain Number 2 Region: 148-256
Classification Level Classification E-value
Superfamily C-type lectin-like 2.27e-33
Family Link domain 0.0017
Further Details:      
 
Domain Number 3 Region: 280-395
Classification Level Classification E-value
Superfamily C-type lectin-like 5.5e-25
Family Link domain 0.0032
Further Details:      
 
Domain Number 4 Region: 883-925
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000011
Family EGF-type module 0.01
Further Details:      
 
Domain Number 5 Region: 1088-1148
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.000000000111
Family Complement control module/SCR domain 0.003
Further Details:      
 
Domain Number 6 Region: 42-149
Classification Level Classification E-value
Superfamily Immunoglobulin 0.000000000215
Family V set domains (antibody variable domain-like) 0.029
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000020030   Gene: ENSGACG00000015182   Transcript: ENSGACT00000020069
Sequence length 1180
Comment pep:known_by_projection group:BROADS1:groupIII:7108954:7139661:-1 gene:ENSGACG00000015182 transcript:ENSGACT00000020069 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MALCRCSAERQTLCALVLLLSLGLGAASSVVNMRRITLPPVHQPLAGTAVLPCVYTLQTG
SSSQAPHLLWTHARVPAEGEQVVLAAKGDAIKVNKAFSGRVTMPGYAANPLNATMEISGL
RTNDSGTYRCQVVMGNEYERDAAPLVVSGVVFHYQAAGTRYALSFADAQRACQENSAEMA
TPAQLWAAHHDGFASCAAGWLDDQTVRYSVQLPGLGCYGHKESSTGVMNYGKRDPKELFD
VYCFAKELNGTHKTYTRAHAQTSQPAGHPTDVVMTTSCLCPPGEVFHSAVPGRLSLSSAS
DRCVSLGGQLATAGQLYLAWKDGLDSCAPGWLSDGSVRYPVTRPRPDCAGRQPGVHTVAP
NSTEDNSTALYDAYCYREQFPQPIVLIPITNVNRKHPTLCFPGKVEKSGSISQIYTSLWK
PWSYLAGMSQADPTGADGPDVTTPHATTPGGTTDASDVSPSNWTGLVDLQEEASGHSTSA
CRGDDRLSWTGESRWSPAPTLRSFSARRSTPLPPTMISKIVKSIWKPWNYLAGTEDGTQP
PSTQAGEEETATKVTGEGSNSPNAASPSTNSPPLQSSEPSPSLSTSPSPPRSDVPFQTAG
SGDPATSTHAESQDSSTRTSTSAHWVPVEEAVLNSTLDRTPLLSGVPDVEEPAWSHAVGS
GALLPGNMEEESSRGVVDVSNITLSSTGWISPFTMTALEISTVYQRELKFSWNILLRLSN
TLPQTDCQMLLWVDEIKLSEVGTAICMNGFNSKDGFSFPPSFSPLLEMPAPSPRINVLIQ
KSYFQTAFASVQKRDSPLCQLRPPTCIDFSAPLALMNGQSHRVYLSHTRARIQTAVNLLV
ITVGVCDFSLIVLCSHAAHYAWWSEPRSRFPYGNHDNCGWDDEVEPCVTNPCLHGGKCLP
QGTGYSCYCPQGYTGENCEIDVDDCLSEPCENGGTCIDKIDSFLCLCLPSYAGDTCEKDV
EGCEHGWRKFHGHCYRYFTHRHTWEDAEKDCREHSAHLSSVVSATEQEFINGLGHDNAWI
GLNDRTVEEDFQWTDSNELVYENWRESQPDNFFAGGEDCVVTIAHEDGKWNDVPCNYNLP
YICKKGTVLCGTPPAVEDAHLIGRRRSHYDIHAVVRYQCSEGFFQRHIPTARCRADGSWE
RPRIICTKSRRSQRYRRHHHNQHNERRGHRRHGGEGHKAR
Download sequence
Identical sequences G3PQZ5
ENSGACP00000020030 69293.ENSGACP00000020030 ENSGACP00000020030

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]