SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSOPRP00000014011 from Ochotona princeps 76

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSOPRP00000014011
Domain Number 1 Region: 4964-5111
Classification Level Classification E-value
Superfamily SET domain 0.00000000000000196
Family Viral histone H3 Lysine 27 Methyltransferase 0.057
Further Details:      
 
Domain Number 2 Region: 1058-1118
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000000000465
Family PHD domain 0.0039
Further Details:      
 
Domain Number 3 Region: 275-326
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.000000000000797
Family PHD domain 0.0022
Further Details:      
 
Domain Number 4 Region: 215-276
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.000000000576
Family PHD domain 0.012
Further Details:      
 
Domain Number 5 Region: 1133-1202
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00000000115
Family PHD domain 0.025
Further Details:      
 
Domain Number 6 Region: 1012-1066
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000109
Family PHD domain 0.017
Further Details:      
 
Domain Number 7 Region: 1663-1713
Classification Level Classification E-value
Superfamily HMG-box 0.00000209
Family HMG-box 0.0037
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSOPRP00000014011   Gene: ENSOPRG00000015259   Transcript: ENSOPRT00000015338
Sequence length 5114
Comment pep:known_by_projection genescaffold:pika:GeneScaffold_3795:844:35557:-1 gene:ENSOPRG00000015259 transcript:ENSOPRT00000015338 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDSQKPPGEDKDSEPAXXXXXXXXXXXXXXXXXXXXXXXXXXVPSPGSARLQEPRRDCSG
GPVRRCALCNCGEPSLHGQRELRRFELPFDWPRCPVVSPGGSPGPREAVLSSEDLSQIGF
PEGLTPAHLGEPGGFCWAHHWCAAWSAGVWGQEGPELCGVDKAIFSGISQRCSHCTRLGA
SIPCRSPGCPRLYHFPCAATSGSFLSMKTLQLLCPEHSEGAAHLEEARCAVCEGPGELRD
LLFCTSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHT
FCLKPPMEELPAHSWKCKACRVCRACGAGSAELSPHCEWFENYSLCHRCHEAQGGQPVSS
AAGQHPPVCSRFSPPEPGGDTPTDEPDALYVACQGQPKGGHVTSMQPKEPGPLQCEAKPL
GRAGAQLEPRLEPPEEEMPLLPLPEESPLSPPPEESPTSPPEASRLSPPPEESPPEELPT
SPPPEASRLSPPPEESPMSPPPEESPMSPPPEASRLFPPFEESPLSPPPEESPLSPPPEA
SRLSPPPEDSPMSPPPEDSPMSPPPEDSPMSPPPEVSRLCPPPEESPLSPPALSPLGELT
YPFGAKGDSDPESLAAPILETPISPPPEAHCTDPEPVPPMILPPSPGSPLGPASPILMEP
LPPPCSPLLQHSLPPPSSPPSQCSPLALPLSLPSPLSPVGKAEPLSDEPELHQMETEKVP
EPECPALEPSVTSPLPSPMEELSCPAPSPAPALENFPGLGEDMAPLDGAAAAHAQPAAGE
APGSELKGCPELLDPEELAPVTPMEVYGPECKQPGQGSPCEEQEEPRATVAPIPPTLIKS
DIVNEISNLSQGDASASFPGSEPLLGSPDPEGGGSLSMELGVSTDVSPARDEGSLRLCTD
SLPETDDSLLCDTGTAVSGGKAEGDKGRRRSSPARSRIKQGRSSSFPGRRRPRGGAHGGR
GRGRARLKSTTSSVETLVVADIDGSPSKEEEEDDDDTMQNTVVLFSNTDKFVLMQDMCVV
CGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVECIVCEVCGQASDPSRL
LLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFHCEWQNSYTHCGPCA
SLVTCPICHAPYVEEDLLIQCRHCERWMHAGCESLFTEDDVEQAADEGFDCVSCQPYVVK
PVAPVAPPELVPVKAKEPEPQYFRFEGVWLTETGMAVLRNLTMSPLHKRRQRRGRLGLPG
EVGLEGSEPSDALGPDDKKDGDLDAEELLKGEGGVEHMECEIKLEGPTSPDAEPGKEETE
ESKKRKRKPYRPGIGGFMVRQRKSHTRVKKGPAAQSEVLSGDGQPDEGETVMPVDLPAEG
SGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLSRKGLFAVGGRPGF
GLGPPKGKGDGGSDRKEPSALHKGDDGPDVADDESHGPEGKADTPGPEDGGVKASPVPSD
PEKPSTPGEGMLSSDLDRIPTEELPKMESKDLQQLFKDVLGSEREQHLGCGTPGLEGNRT
PLQRPIVHGGLPLGSLPSSSPLDSYPGLCQSPFLDSRERGGFFSPEPGEPDSPWTGSGGT
TPSTPTTPTTEGEGDGLSYNQRSLQRWEKDEELGQLSTISPVLYANINFPNLKQDYPDWS
SRCKQIMKLWRKVPAADKAPYLQKAKDNRAAHRINKVQKQAESQINRQPKVGDTARKTER
PALHLRIPPQQGALGSPPPAAAPTVFLGSPTPPAGMSTSADGFLKPPAGTVPGPDSPGEL
FPKLPPQVPAQVPSQDPFGLAPAYALEPRFPVAPPTYPTYPNVAGAPAQSPVPGASTRPG
PGLPGEFHVTPPGTPRHQPSTPDPFLKPRCPSLDNLAVPESPGVGGGKASEPLLSPPPLG
EARKALEVKKEELGASSPSYGPPNLGFVDPSSSGPHLGGLELKAPDVFKAPLTPRASQVE
PQSPGLGLRPQEPPPAQALAPSPPNHPDIFRPSPYPDPYAQPPLTPRPQPPAPESCCALP
PRSLPSDPFSRVPASPQSQSSSQSPLTPRPLSAEAFCPSPVTPRFQSPDPYSRPPSRPQS
RDPFAPLHKPPRPQPSEVAFKAGSLAHTPLGAGGFPAALPSGPAAELHAKVPSGQPPNFA
RSPGTGAFVSTPSPMRFTFPQAAGEPPLKPPVPQPGLPPPHGINSHFGPGPTLGKPQSTN
YAVATGNFHPSGSPLGPGSGSTVEGYGLSPLRPTSVLPPPAPDGSLPYLSHGASQRASIT
SPVEKREEPGAGMSSSLVAPELPGTQDPGMSSLSQTELEKQRQRQRLRELLIRQQIQRNT
LRQEKETAAAAVGAVGPPGSWGAEPSSPAFEQLSRGQAPFAGTQDKGSLVGLPPGKLGGS
VLGPGPFPSDDRLSRPPPPATPSSVDVNGRQLVGGSQAFYQRAPYPGSLPLQQQQLWQQQ
QQQQQQQQATSAASMRLAMSTRFPSTPGPELSRQALGSPLPGIPTRLPGPGEPVPGPAGP
AQFIELRHNVQKGLGPGGAPFPGQGPPQRPRFYPVTEDPHRLAPEGLRGLVLSGLPSQKP
SAPPAPELSNNLHAPPLTKASTLPAGLELVSRPPSSTELSRPPPLALETGKLPCEDPELD
DDFDAHKALEDDEELAHLGLGVDVAKGDDELGTLENLETNDPHLDDLLNGDEFDLLAYTD
PELDTGDKKDIFNEHLRLVESANEKAEREALLRGVEPGPSVSEERPPPVADASEPRLAEV
KPKVEEGGRHPSPCQFTINTPKVEPAPATTSLGLGLKPGQNMMGSRETRMGTGPFSSGGH
TAEKGPFGTTGGPPAHLLAPSPLSGSAGSSLLEKFELESGPLNLPGGPAASGDELDKMES
SLVASDLPLLIEDLLEHEKKELQQRQQLSAQLQPAQQQQQQQQQQLILSATGPAQAMALP
HEGSSPSLSGPQQQLALGIGGARQPGLGQPLMPTQPPAHALQQRLAPSMAMMSNQGHMLS
GQHGGQAGLVPPQNPQPVLSQKPMGTMPPSMCMKPQQLAVQQQLANSFFPDTDLDKFAAE
DIIDPIAKAKMVALKGIKKVMAQGSIGVAPGMNRQQVSLLAQRLSGGPGSDLQNHVVPGS
GQERNAGDPSQPRPNPPTFAQGVINEADQRQYEEWLFHTQQLLQMQLKVLEEQIGVHRKS
RKALCAKQRTAKKAGREFPEADAEKLKLVTEQQSKIQKQLDQVRKQQKEHTNLMAEYRNK
QQQQQQQQQQHSAVLTLSPSQSPRLLTKLPGQLLPGHGLQPLQGPPGGQAGGLRMPPGAM
ALPGQPGGPFLNSSMAQQQHSGGAGSLTGPSGGFFPGSLTLRGLAPDSRLVQERQLQLQQ
QRMQLAQKLQQQQQQQQHLLGQVAIQQQQQQGSGVQANQALGPKPQGLLPPSNHQGLLVQ
QLSPQPPQGPQGMLGPAQVAVLQQQQQHPGALGPQGPHRQVLMTQSRVLTSPQLAQQGQG
LMRQRLVTAQQQQQQQQQHSQQQQQGSMPGLSHIQQGLMSHSGQPTLNGQSMSSLQPQQQ
LQQQQLQQQLQQQQQQQLQQQQFQQQQQQMGLLNQSRTLLSPQQQQPQQQQQQQVTLGPS
MPAKPLQHFSSPGALGPTLLLPGKEQNIVETALPSEVAEGASAHQGGGPLGVGTTPEPMA
AEPGEVKPSLSGDSQLLLVQPQAQPQPQPQPGSLQLQPPLRLPGQQQQQQVNLLHSAGMG
SHGQLGSGSSEASAVPQLLVQPSVSVGDQPGPVTQNLLGPQPSLLEQPLQNNTGPQLPKP
GPAPQAGQGLPGVGVMPAVGQLRAQLQGVLAKNPQLRHLSPQQQQQLQALLVQRQLQQSQ
AVRQAPPFQEPGTQPSPLQGLLGCQPQPGGFPGPQTGPPQELGAGPRPQGPPRPPVPQGA
SPAGPALGPVHPTPPPSSPQEPKRPSSQLPSPNAQLPPTHPGTPKPLGRVSPASAQHTDT
FFGKGLGPWDPPDNLAETQKPEQSNLVAGHLEQVNGQVVPEPPQLSIKQEPREESCALGA
PAVKREANGEPVGTAGTSNHLLLAGPRSEAGHLLLQKLLRAKNVQLNAGRGPEGLRTEIN
GHIDSKLAGLEQKLQSTPINKEDVAARKPLTSKPKRVQKAGDRLVSSRKKLRKEDGLRAS
EALLKQLKQELSLLPLTEPTITANFSLLAPFGSGCPVSGQNQLRGAFGSGTLTTGPDYYS
QLLTKNNLSNPPTPPSSLPPTPPPSVQQKMVNGVTASEELGENPKDATSAGDTEGTLRDA
SEVKSLDLLAALPTPPHNQTEDVRMESDEDSDSPDSIVPASSPESILGEEAPRYPQLGSG
RWEQDDRALSPVIPIIPRASIPVFPDAKPYVALDLDVSGKLPAVAWEKGQGSEVSVMLTV
SAAAAKNLNGVMVAVAELLSMKIPNSYEVLFPESPARIGMVPKKGDAEGAVGKEKGVGDK
NPDAGPEWLKQFDAVLPGYTLKSQLDILSLLKQXXXXXXXXXXXXXXXXXXXXXXXXLSA
PPEPSPPPSLAPSPASPPAEPLVELPSESAEPPVPSPLPLASSPESARPKPRARPPEEGE
DSHPPRLKKWKGVRWKRLRLLLTIQKTSGHQEDEREVAEFMEQLGTALRPDKVPRDMRRC
CFCHEEGDGATDGPARLLNLDLDLWVHLNCALWSTEVYETQGGALMNVEVALHRGLLTKC
SLCQRTGATSSCNRMRCPNVYHFACAIRAKCMFFKDKTMLCPMHKVKGPCEQELSSFAVF
RRVYIERDEVKQIASIIQRGERLHMFRVGGLVFHAIGQLLPHQMADFHSATALYPVGYEA
TRIYWSLRTNNRRCCYRCSISENNGRPEFIIKVMEQGLEDLIFTDASPQAVWNRIIEPVA
AMRKEADMLLFPEYLKELFGLTVHVLRIAESLPGVRCQYLSRWPHPLMLPMIPTGCARSE
PKILSHYKRPHTLNSTSMSKAYQSTFTGETHTPYSKQFVHSKSSQYRRLRTEWKNNVYLA
RSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXEVVTFDKEDKIIIISSRRIKGEELTYDYQFDFEDDQHKI
PCHCGAWNCRKWMN
Download sequence
Identical sequences ENSOPRP00000014011 ENSOPRP00000014011

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]