SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSOPRP00000006836 from Ochotona princeps 76

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSOPRP00000006836
Domain Number 1 Region: 1825-1968
Classification Level Classification E-value
Superfamily C-type lectin-like 1.92e-42
Family C-type lectin domain 0.000000351
Further Details:      
 
Domain Number 2 Region: 150-260
Classification Level Classification E-value
Superfamily C-type lectin-like 4.39e-38
Family Link domain 0.0021
Further Details:      
 
Domain Number 3 Region: 261-355
Classification Level Classification E-value
Superfamily C-type lectin-like 3.65e-27
Family Link domain 0.0027
Further Details:      
 
Domain Number 4 Region: 33-151
Classification Level Classification E-value
Superfamily Immunoglobulin 0.00000000000386
Family V set domains (antibody variable domain-like) 0.025
Further Details:      
 
Domain Number 5 Region: 608-685
Classification Level Classification E-value
Superfamily C-type lectin-like 0.0000000000113
Family Link domain 0.012
Further Details:      
 
Domain Number 6 Region: 1969-2027
Classification Level Classification E-value
Superfamily Complement control module/SCR domain 0.000000000445
Family Complement control module/SCR domain 0.0026
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSOPRP00000006836   Gene: ENSOPRG00000007438   Transcript: ENSOPRT00000007464
Sequence length 2053
Comment pep:known_by_projection genescaffold:pika:GeneScaffold_3240:8219:124023:1 gene:ENSOPRG00000007438 transcript:ENSOPRT00000007464 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MATLLLMLMALCVTTAADSGDVLDPGSALSVSIPQPSPLKALLGTSLTIPCYFLDPVHPV
TTAPSTAPLDPRIKWSRVSRDKEVVLLVATEGRVRVNSAYQDRVSLPNYPAIPSDATLEV
QSLRSNDSGIYRCEVMHGLEDSEATLEVVVKGIVFHYRAISTRYTLDFDRAQRACLQNSA
IIATPEQLQAAYEDGFHQCDAGWLADQTVRSPIHTPREGCYGDKDDFPSVRIYVIRDTNE
TYDVYCFAEEIEGEVFYATSPEKFTFQEAASECRRLGARLATTGQLYLAWQAGMDMCSAG
WLADRSVRYPISKARPNCGGNLLGVRTVYLHANQTGYPDPSSHYDAICYTGEDFVDIPEN
FFGVGGEEDITVQTVTWPDMELLLPQNVTEGEARGSAILTAKPELDTSPKAPEPEEPLIL
VPHTGTSAFPNAETRTEGAAMAFTSEDLVVQVTAVPGLPRSPGXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXTYDVYCYVDKLQGEVFFATRLEFTFQGAADLSYHDPLLVSPLRVHFFT
LSLLRGFPGWLADGSLRYPIVTPRPACGGDKPGVRTVYLYPNQTGLPDPLSRHHAFCFRG
TSEAPTAGEEEGGTPTSSYDLEEQIVTQVGPGVAAVPFGEETTTVLDLTLEPGNQTGWET
ASHLVGTSLWSGIPPTWSPTGTATEENTEGLSTAEVPSASEELFTPSSLVPLGTELPGSG
EAPGAPDISGDFTGSGEASGHFDTPGQSLGGSASGLPSGEVDSSGLTSLVGSGLPVGSGL
ASGDEDRIELSSTPKVGGMTSGGEDLEGSASGAGDLSQFPSGEVPETSTSGAGGLSELPS
GEVPETYASGVGDLGELPSGEVSEISASGAGGLGELSSGEVPETAASGEGSLIVHGAIRI
PDTSGEMSGGEAVETSASGVGDLGELPSGEVPETSTSGAEGLGELPSRGEAVETSASGVG
DLGELPSGEVPETSTSGAEGLGELPSGGEAPETSASGAEGLGELPSGGEVPETSASGGSL
GELSSGEALETSASGAGGLGELPSGEEAVVTSASGVGSLVELPSGEVLETLVSGAEDLSE
LSSGKEELVASASGALDFDRTLGSGQVPETSGLPSGNSEEYSEVDLGSGPSSGLPDFSGL
PSGFPTVSLVDTTLVEVITTTSASELEGRGIIDISGAGEISERPFSGLDISGGVSGLPSG
AEFSGQTSGSPDTSGETSGLIVHGQPSGFPDTSGEVSGVTELSGQASGSPDTSGDTYGVT
EVSGRASGSPDTNGEISGLIVSGQPSGFSGTSEGMSGVTELXXXXXXXXXXXXSIPFVDT
SLVEVTTTTFEEEGLGSVELSGLPSGETDVLGSSQLIDVSGQSSGTIDSSGFTPLAPGFS
GFTSGAAEISGESSATEVGSSWPSGAFDSSGLSSSFPTVSFIDRTLVESETQTPTAQEAG
EGPSGILELSGARSGVPDMYVDHSGSPDLSGLQSGLVEPSGEPLSSPYSSGDFPSTTDLS
GEFSAVTSTSGEVSGLPQVTLIMSKFMEGVTEPTASQELGQRPPVTHVPPLFESSGEALA
SGDMSGIAPTFPGPGIGASSVPESHSELSTHPELGTRAPDAPEAGVEGAYGLPGSSNTLA
FPEGSTKGSASPEMSAESTLTYTAGTETSGLPPATPTTSRDITEVSGDLSGHTSGLVDVT
TTVQEAEWIEQTQHPAEAHLDIKSSSPSYLGEDTAKTATSPTDAPIRSSPDGAGESRPAA
AAPARSCAEEPCGAGTCQETEGHVTCLCPPGRVGEHCDIDQEVCEEGWIKFQGHCYRHFP
DRETWVDAESRCREQQSHLSSIVTPEEQEFVNNNAQDYQWIGLNDRTIEGDFRWSDGHLL
QFENWRPNQPDNFFATGEDCVVMIWHEKGEWNDVPCNYHLPFTCKKGTVACGEPPVEHAR
TFGRKKERYEINSLVRYQCVEGFIQRHVPTIRCQPSGHWEEPRITCTDSAAYTHRAQKRS
SRSLQSSRASTAA
Download sequence
Identical sequences ENSOPRP00000006836 ENSOPRP00000006836

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]