SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSOPRP00000012237 from Ochotona princeps 76

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSOPRP00000012237
Domain Number 1 Region: 1230-1318,1347-1569
Classification Level Classification E-value
Superfamily NHL repeat 2.62e-24
Family NHL repeat 0.0021
Further Details:      
 
Domain Number 2 Region: 1467-1708
Classification Level Classification E-value
Superfamily Calcium-dependent phosphotriesterase 0.00000000163
Family SGL-like 0.07
Further Details:      
 
Domain Number 3 Region: 925-995
Classification Level Classification E-value
Superfamily Carboxypeptidase regulatory domain-like 0.0000053
Family Pre-dockerin domain 0.061
Further Details:      
 
Domain Number 4 Region: 634-659
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000419
Family EGF-type module 0.044
Further Details:      
 
Weak hits

Sequence:  ENSOPRP00000012237
Domain Number - Region: 764-789
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000228
Family EGF-type module 0.04
Further Details:      
 
Domain Number - Region: 667-693
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000404
Family EGF-type module 0.054
Further Details:      
 
Domain Number - Region: 734-758
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000838
Family EGF-type module 0.042
Further Details:      
 
Domain Number - Region: 800-835
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00152
Family EGF-type module 0.046
Further Details:      
 
Domain Number - Region: 600-627
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00472
Family Integrin beta EGF-like domains 0.044
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSOPRP00000012237   Gene: ENSOPRG00000013387   Transcript: ENSOPRT00000013408
Sequence length 2770
Comment pep:known_by_projection genescaffold:pika:GeneScaffold_2970:653185:1012788:-1 gene:ENSOPRG00000013387 transcript:ENSOPRT00000013408 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDVKERKPYRSLTRRRDAERRYTSSSADSEEGKTPQKSYSSSETLKAYDQEARLAYGNRV
KDMVPQEAEEFCRAGANFSLRELGLGEVTPPHGTLYRTDIGLPHCGYSMGASSDADMEAD
TVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENTETDHPGGLQNHPRLRTPP
PPLSHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTDHSLSGEPPTSGGAPPEPPAHAQD
NWLLNSNIPLETRNLGKQPFLGTLQDNLIEMDILSASRHDGAYSDGHFLFPGGTSLFCTT
SPGYPLTSSTVYSPPPRPLPRGTFARPAFNLKKPSKYCNWKCAALSAIVISATLVILLAY
FVAMHLFGLNWHLQPMEGQMYEITEDTASSWPVPTDVSLYPSGGTGLETPDRKGKGATEG
KPSSSFSEDSFIDSGEIDVGRRASQKIPPGTFWRSQVFIDHPVHLKFNVSLGKAALVGIY
GRKGLPPSHTQFDFVELLDGRRLLTQEARSLEGPQRQSRGVVPPSSHETGFIQYLDSGIW
HLAFYNDGKESEVVSFLTTAIESVESCPSNCYGNGDCISGTCHCFLGFLGPDCGRASCPV
LCSGNGQYMKGRCLCHSGWKGAECDVPTNQCIDVACSNHGTCIMGTCICNPGYKGESCEE
VDCMDPTCSGRGVCVRGECHCSVGWGGTNCETPRATCLDQCSGHGTFLPDTGLCSCDPSW
TGHDCSIEICAADCGGHGVCVGGTCRCEDGWMGAACDQRACHPRCAEHGTCRDGKCECSP
GWNGEHCTIAHYLDRVVKEGCPGLCNGNGRCTLDLNGWHCVCQLGWRGAGCDTSMETACG
DSKDNDGDGLVDCMDPDCCLQPLCHINPLCLGSPDPLDIIQETQAPVSQQNLHSFYDRIK
FLVGKDSTHVIPGENPFDGGHTCVIRGQVMTSDGTPLVGVNISFVNNPLFGYTISRQDGS
FDLVTHGGISIVLRFERAPFITQEHTLWLPWDRFFVMETITMRHEENEIPSCDLSSFARP
HPIVSPSPLTSFASSCAEKGPIVPEIQALQEEIAISGCKMRLSYLSSRTAGYKSVLRISL
THPTIPFNLMKVHLMVAVEGRLFRKWFAAAPDLSYYFIWDKTDVYNQKVFGLSEAFVSVG
YEYESCPDLILWEKRTAMLQGYEIDASKLGGWSLDKHHALNIQSGILHKGNGENQFVSQQ
PPVIGSIMGNGRRRSISCPSCNGLADGNKLLAPVALTCSSDGSLYVGDFNYIRRIFPSGN
VTNILELRNKDFRHSHSPAHKYYLTTDPMSGAVFLSDTNSRRVFKIKSTTVVKDLVKNAE
VVAGTGDQCLPFDDTRCGDGGKATEATLTNPRGITVDKFGLIYFVDGTMIRRIDQNGIIS
TLLGSNDLTSARPLSCDSVMDISQVRLEWPTDLAVSPMDNSLYVLDNNVVLQISENHQVR
IVAGRPMHCQVPGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEKKIHRIRQVTT
SGEISLVAGAPSGCDCKNDANCDCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIR
IRFIRKNKPFLNTQNMYELSSPIDQELYLFDTSGKHLYTQSLPTGDFLYNFTYTGDGDVT
LITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALKSVTTQGHELAMMTYHGNS
GLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTIT
TNLSASGAFYTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKRN
VTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRVHNRNLLSLDFDRVTRTEKIYDDHRKF
TLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRITSRIFA
DGKTWSYTYLEKSMVLLLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIY
QPPEGNASVIQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAG
MLKTINLQNEGFTCTIRYRQIGPLIDRQIFRFTEEGMVNARFDYNYDNSFRVTSMQAVIN
ETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKEVQYEIF
RSLMYWMTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDL
NGNLHLLSPGNSARLTPLRYDLRDRITRLGDVQYKMDEDGFLRQRGGDVFEYNSAGLLMK
AYNRASGWSVRYRYDGLGRRVSSKSSHSHHLQFFYADLTNPTKVTHLYNHSSSEITSLYY
DLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGTGLMIKQILYTAYGEIYMDTNPHFQII
IGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDHELWKHLSSSNIMPFNLYMFKNNNPIS
NSQDIKCFMTDVNSWLLTFGFQLHNVIPGYPPPDTDAMEPSYELVHTQMKTQEWDNSKSI
LGVQCEVQKQLKAFVTLERFDQLYGSSITGCQQGPKTKRFASSGSVFGKGVKFALKDGRV
TTDIISVANEDGRRVAAILNNAHYLENLHFTISGVDTHYFVKTGPSEGDLAILGLSGGQR
TLENGVNVTVSQINTMLNGRTRRYTDIQLQYGALCLNTRYGTTVDEEKARVLELARQRAV
RQAWAREQQRLRDGEEGLRAWTEGEKQQVLNTGRVQGYDGFFVISVEQYPELSDSANNIH
FMRQSEMGRR
Download sequence
Identical sequences ENSOPRP00000012237 ENSOPRP00000012237

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]