SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000020026 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000020026
Domain Number 1 Region: 918-1071
Classification Level Classification E-value
Superfamily C-type lectin-like 2.1e-41
Family C-type lectin domain 0.00000512
Further Details:      
 
Domain Number 2 Region: 109-219
Classification Level Classification E-value
Superfamily C-type lectin-like 7.37e-35
Family Link domain 0.0017
Further Details:      
 
Domain Number 3 Region: 221-306
Classification Level Classification E-value
Superfamily C-type lectin-like 9.66e-25
Family Link domain 0.002
Further Details:      
 
Domain Number 4 Region: 867-909
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000108
Family EGF-type module 0.01
Further Details:      
 
Domain Number 5 Region: 1072-1132
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000000011
Family Complement control module/SCR domain 0.003
Further Details:      
 
Domain Number 6 Region: 6-110
Classification Level Classification E-value
Superfamily Immunoglobulin 0.000000000289
Family V set domains (antibody variable domain-like) 0.029
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000020026   Gene: ENSGACG00000015182   Transcript: ENSGACT00000020064
Sequence length 1169
Comment pep:known_by_projection group:BROADS1:groupIII:7108939:7127624:-1 gene:ENSGACG00000015182 transcript:ENSGACT00000020064 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
PVHQPLAGTAVLPCVYTLQTGSSSQAPHLLWTHARVPAEGEQVVLAAKGDAIKVNKAFSG
RVTMPGYAANPLNATMEISGLRTNDSGTYRCQVVMGNEYERDAAPLVVSGVVFHYQAAGT
RYALSFADAQRACQENSAEMATPAQLWAAHHDGFASCAAGWLDDQTVRYSVQLPGLGCYG
HKESSTGVMNYGKRDPKELFDVYCFAKELNGEVFHSAVPGRLSLSSASDRCVSLGGQLAT
AGQLYLAWKDGLDSCAPGWLSDGSVRYPVTRPRPDCAGRQPGVHTVAPNSTEDNSTALYD
AYCYRETSYTMTHNRGKVEKSGSISQIYTSLWKPWSYLAGMSQADPTGADGPDVTTPHAT
TPGGDVTSVCESSGSFVTLQLIPGQTLLDWGEPLEPGPDSEEFLRPPVDATPADKKVGPD
STFHLEIVKSIWKPWNYLAGTEDGTQPPSTQAGEEETATKVTGEGSNSPNAASPSELIYS
ASNVSQAGGAKHKTDKMRKAPMPISAIQAQRSTPKPKLIHLNKKVDSIKEELPLLKRAGD
LWTPCPSLPYTSELLNPDNRPLCSSNKNLLVMPNAKIVKLLNAHHFKSPPGSPDTADEVP
DALSLALTLPLCNHRNPPSLSTSPSPPRSDVPFQTAGSGDPATSTHAESQDSSTRTSTSA
HWVPVEEAVLNSTLDRTPLLSGVPDVEEPAWSHAVGSGALLPGNMEEESSRGVVDVSNIT
LSSTERTQILLEHFAEVIQHLTPDRLPDAAVGRKLSEVGTAICMNGFNSKGVVPTEGHKH
TVAPLLHKQEKRMDALASHCRLRLSPSWFGFLVAVRIFSLTGLGGTSSVTEIDEDRFPPS
FSPLLEMPAPSPRINVLIQKSYFQIDEVEPCVTNPCLHGGKCLPQGTGYSCYCPQGYTGE
NCEIDVDDCLSEPCENGGTCIDKIDSFLCLCLPSYAGDTCEKDVEGCEHGWRKFHGHCYR
YFTHRHTWEDAEKDCREHSAHLSSVVSATEQEFINGLGHDNAWIGLNDRTVEEDFQWTDS
NELVYENWRESQPDNFFAGGEDCVVTIAHEDGKWNDVPCNYNLPYICKKGTVLCGTPPAV
EDAHLIGRRRSHYDIHAVVRYQCSEGFFQRHIPTARCRADGSWERPRIICTKSRRSQRYR
RHHHNQHNERRGHRRHGGEGHKAREDAHS
Download sequence
Identical sequences G3PQZ1
ENSGACP00000020026

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]