SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000005155 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000005155
Domain Number 1 Region: 2122-2266
Classification Level Classification E-value
Superfamily C-type lectin-like 2.1e-42
Family C-type lectin domain 0.000000351
Further Details:      
 
Domain Number 2 Region: 468-575
Classification Level Classification E-value
Superfamily C-type lectin-like 5.74e-42
Family Link domain 0.0019
Further Details:      
 
Domain Number 3 Region: 150-267
Classification Level Classification E-value
Superfamily C-type lectin-like 4.08e-40
Family Link domain 0.0015
Further Details:      
 
Domain Number 4 Region: 579-692
Classification Level Classification E-value
Superfamily C-type lectin-like 3.37e-30
Family Link domain 0.0045
Further Details:      
 
Domain Number 5 Region: 33-151
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000074
Family V set domains (antibody variable domain-like) 0.034
Further Details:      
 
Domain Number 6 Region: 2267-2327
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.00000000000639
Family Complement control module/SCR domain 0.0026
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000005155   Gene: ENSGGOG00000005262   Transcript: ENSGGOT00000005289
Sequence length 2352
Comment pep:known_by_projection chromosome:gorGor3.1:15:68614009:68653110:1 gene:ENSGGOG00000005262 transcript:ENSGGOT00000005289 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MTTLLWVFVTLRVITAAVTVETSDHDNSLSVSIPQPSPLRVLLGTSLTIPCYFIDPMHPV
TTAPSTAPLAPRIKWSRVSKEKEVVLLVATEGRVRVNSAYQDKVSLPNYPAIPTDATLEI
QSLRSNDSGVYRCEVMHGIEDSEATLEVVVKGIVFHYRAISTRYTLDFDRAQRACLQNSA
IIATPEQLQAAYEDGFHQCDAGWLADQTVRYPIHTPREGCYGDKDEFPGVRTYGIRDTNE
TYDVYCFAEEMEGELPCPLGGTQLPSQGHLPLPISPLTLEITQAASSGPDSLCPPLPKIR
PKCSRRAVGVGTDLYEPSAHSSSPSSRSQAERLKLVFAPPLGEDFVDIPENFFGVGGEED
ITVQTVTWPDMELPLPRNITEGEARGSVILTVKPIFEVSPSPLEPEEPFTFAPEIGATAF
PEVENETGEATRPWGFPTPGLGPATAFTSEDLVVQVTAVPGQPHLPGGVVFHYRPGPTRY
SLTFEEAQQACLRTGAVIASPEQLQAAYEAGYEQCDAGWLQDQTVRYPIVSPRTPCVGDK
NSSPGVRTYGVRPSTETYDVYCFVDRLEGEVFFATRLEQFTFQEALEFCESHNATLATTG
QLYAAWSRGLDKCYAGWLADGSLRYPIVTPRPACGGDKPGVRTVYLYPNQTGLPDPLSRH
HAFCFRGISVVPSPGEEEGGTPTSPSGVEEWIATQVVPGVAAVPAEEETTAVPSRETTAI
LEFTTEPENQTEWEPAYTPVGTSPLPGILPTWPPTGAATEESTEGPSATEVPSASEEPSP
SEVPFPSEEPSPSEEPFPSVRPFPSVELFPSEEPFPSKEPSPSEEPSASEEPYTPLPPVP
SWTELPSSGEESGAPDVSGDFIGSGDVSGHLDFSGQLSGDRASELPSGDLDSSGLTSTVG
SGLPVESGLASGDEERIEWSSTPTVGELPSGAEILEGSASGVGDLSGLPSGEVLETSAFG
VGDLSGLPSGEVLETTAPGVEDISGLPSGEVLETTAPGVEDISGLPSGEVLETTAPGVED
ISGLPSGEVLETTAPGVEEISGLPSGEVLETTAPGVEDISGLPSGEVLETTAPGVEEISG
LPSGEVLETTAPGVEEISGLPSGEVLETTAPGVEEISGLPSGEVLETTAPGVEEISGLPS
GEVLETTAPGAEEISGLPSGEVLETTAPGVEDISGLPSGEVLETTAPGVEDISGLPSGEV
LETTAPGVEEISGLPSGEVLETTAPGVEEISGLPSGEVLETTAPGVEEISGLPSGEVLET
TAPGVEDISRLPSGEVLETSTSAVGDLSGLPSGGEVLEISASGVEDISISGLPSGEVVET
SASGIEDVSELPSGEGLETSASGVEDLSRLPSGEEVLEISASGVGDLSGLPSGGEGLETS
ASEVGTDLSGLPSGREGLETSASGAEDLSGLPSGKEDLVGSASGDLDLGKLPSGTLGSGQ
APETSDLPSGFSGEYSGVDLGSGPPSGLPDFSGLPSGFPTVSLVDSTLVEVVTASTASEL
EGRGTIGISGAGEISGLPSSELDISGRASGLPSGTELSGQASGSPDVSGEIPGLFGVSGQ
PSGFPDTSGETSGVTELSGLSSGQPGVSGEASGVLYGTSQPFGITGLSGETSGVPDLSGQ
PSGLPGFSGATSGVPDLVSGATSGSGESSGITFVDTSLVEVAPTTFKEEEGLGSVELSGL
PSGEADLSGKPGMVDVSGQFSGTVDSSGFTSQTPEFSGLPSGIAEVSGESSRAETGSSLP
SGAYYGSGTPSSFPTVSLVDRTLVESVTQAPTAQEAGEGPSGILELSGAHSGAPDMSGEH
SGFLDLSGLQSGLVEPSGEPPGTPYFSGDFASTTNVSGESSVAMGTSGEASGLPEVTLIT
SEFVEGVTEPTISQELGQRPPVTHTPQLFESSGEVSTAGDVSGATPVLPGSGVEVSSVPE
SSSETSAYPEAGFGASAAPEASREDSGSPDLSETTSAFHKADLERSSGLGVSGSTLTFQE
GEASAAPEVSGESTTTNDMGTEAPGLPSATPTASGDRTEISGDLSGHSSRLGVVISTSIP
ESEWTQQTQRPAETHLEIESSSLLNSGEETHKVETATSPTDASIPASPEWKRESESTAAA
PARSCAEEPCGAGTCKETEGHVICLCPPGYTGEHCNIDQEVCEEGWNKYQGHCYRHFPDR
ETWVDAERRCREQQSHLSSIVTPEEQEFVNNNAQDYQWIGLNDRTIEGDFRWSDGHPMQF
ENWRPNQPDNFFAAGEDCVVMIWHEKGEWNDVPCNYHLPFTCKKGTVACGEPPVVEHART
FGQKKDRYEINSLVRYQCTEGFVQRHMPTIRCQPSGHWEEPRITCTDPTTYKRRLQKRSS
RHPRRSRPSTAH
Download sequence
Identical sequences ENSGGOP00000005155 ENSGGOP00000005155

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]