SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000024047 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000024047
Domain Number 1 Region: 4945-5103
Classification Level Classification E-value
Superfamily SET domain 8.63e-49
Family Histone lysine methyltransferases 0.0065
Further Details:      
 
Domain Number 2 Region: 1150-1210
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000000000465
Family PHD domain 0.0039
Further Details:      
 
Domain Number 3 Region: 267-326
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000000000598
Family PHD domain 0.0022
Further Details:      
 
Domain Number 4 Region: 1225-1294
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00000000113
Family PHD domain 0.025
Further Details:      
 
Domain Number 5 Region: 1104-1158
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000109
Family PHD domain 0.017
Further Details:      
 
Domain Number 6 Region: 224-277
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000792
Family PHD domain 0.01
Further Details:      
 
Domain Number 7 Region: 1756-1806
Classification Level Classification E-value
Superfamily HMG-box 0.00000209
Family HMG-box 0.0037
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000024047   Gene: ENSGGOG00000007949   Transcript: ENSGGOT00000022054
Sequence length 5103
Comment pep:known_by_projection chromosome:gorGor3.1:12:46976681:47012275:-1 gene:ENSGGOG00000007949 transcript:ENSGGOT00000022054 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDSQKLAGEDKDSEPAADGPAASEDPSATESDLPNPHVGEVSVLSSGSPRLQETPQDCSG
GPVRRCALCNCGEPSLHGQRELRRFELPFDWPRCPVVSPGGSPGPNEAVLPSEDLSQIGF
PEGLTPAHLGEPGGSCWAHHWCAAWSAGVWGQEGPELCGVDKAIFSGISQRCSHCTRLGA
SIPCRSPGCPRLYHFPCATASGSFLSMKTLQLLCPEHSEGAAHLEEARCAVCEGPGELCD
LFFCTSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHT
FCLKPPMEELPAHSWKCKACRVCRACGVGSAELNPNSEWFENYSLCHRCHKAQGGQPIRS
VAEQHTPVCSRFSPPEPGDTPTDEPDALYVACQGQPKGGHVTSMQPKEPGPLQCEAKPLG
RAGVQLEPQLEAPLNEEMPLLPPPEESPLSPPPEESPTSPPPEASRLSPPPEESPASPLP
EALHLSRPPEESPLSPPPEESPLSPPPESSPFSPLEESPFSPPEESPPSPALETPLSPPP
EASPLSPPFEESPLSPPPEELPTSPPPEASRLSPPPEESPMSPPPEESPMSPPPEASRLF
PPFEESPLSPPPEESPLSPPPEASRLSPPPEDSPMSPPPEESSMSPPPEVSRLSPLPVVS
RLSPPPEESPLSPPPAAPPALSPLGELEYPFGAKGDSDPESPLAAPILETPISPPPEANC
TDPEPVPPMILPPSPGSPVGPASPILMEPLPPQCSPLLQHSLVPQNSPPSQCSPPALPLS
VPSPLSPIGKVVGVSDEAELHEMETEKVSEPECPALEPSATSPLPSPMGDLSCPAPSPAP
ALDDFSGLGEDTAPLDGIDAQGSQPEPGQTPGSLASELKGSPVLLDPEELAPVTPMEVYP
ECKQTAGQGSPCEEQEEPRAPVAPTPPTLIKSDIVNEISNLSQGDASASFPGSEPLLGSP
DPEGGGSLSMELGVSTDVSPARDEGSLRLCTDSLPETDDSLLCDAGTAISGGKAEGEKGR
RRSSPARSRIKQGRSSSFPGRRRPRGGAHGGRGRGRARLKSTASSIETLVVADIDSSPSK
EEEEEDDDTMQNTVVLFSNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVN
SKITKVMLLKGWRCVECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWK
CKWCVSCMQCGAASPGFHCEWQNSYTHCGPCASLVTCPICHAPYVEEDLLIQCRHCERWM
HAGCESLFTEDDVEQAADEGFDCVSCQPYVVKPVAPVAPPELVPMKVKEPAEPQYFRFEG
VWLTETGMALLRNLTMSPLHKRRQRRGRLGLPGEAGLEGSEPSDALGPDDKKDGDLDTDE
LLKGEAYAGGVEHMECEIKLEGPVSPDVEPGKEETEESKKRKRKPYRPGIGGFMVRQRKS
HTRTKKGPAAQAEVLSGDGQPDEVIPADLPAEGAVEQSLAEGDEKKKQQRRGRKKSKLED
MFPAYLQEAFFGKELLDLSRKALFAVGVGRPSFGLGTPKAKGDGGSERKELPTSQKGDDG
PDIADEESRGLEGKADTPGPEDGGVKASPVPSDPEKPGTPGEGMLSSDLDRISTEELPKM
ESKDLQQLFKDVLGSEREQHLGCGTPGLEGSRTPLQRPFLQGGLPLGNLPSSSPMDSYPG
LCQSPFLDSRERGGFFSPEPGEPDSPWTGSGGTTPSTPTTPTTEGEGDGLSYNQRSLQRW
EKDEELGQLSTISPVLYANINFPNLKQDYPDWSSRCKQIMKLWRKVPAADKAPYLQKAKD
NRAAHRINKVQKQAESQINKQTKVGDIARKTDRPALHLRIPPQPGALGSPPPAAAPTIFI
GSPTTPAGLSTSADGFLKPPAGSVPGPDSPGELFLKLPPQVPAQVPSQDPFGLAPAYPLE
PRFPTAPPTYPPYPSPTGAPAQPPMLGASSRSGAGQPGEFHTTPPGTPRHQPSTPDPFLK
PRCPSLDNLAVPESPGVGGGKASEPLLSPPPFGESRKALEVKKEELGASSPSYGPPNLGF
VDSPSSGPHLGGLELKTPDVFKAPLTPRASQVEPQSPGLGLRPQEPPPAQALAPSPPSHP
DIFRPGSYPDPYAQPPLTPRPQPPPPESCCALPPRSLPSDPFSRVPASPQSQSSSQSPLT
PRPLSAEAFCPSPVTPRFQSPDPYSRPPSRPQSRDPFAPLHKPPRPQPPEVAFKAGSLAH
TSLGAGGFPAALPSGPAGELHAKVPSGQPPNFVRSPGTGAFVGTPSPMRFTFPQAVGEPS
LKPPVPQPGLPPPHGINSHFGPGPTLGKPQSTNYTVATGNFHPSGSPLGPSSGSTGESYG
LSPLRPPSVLPPPAPDGSLPYLSHGASQRSGITSPVEKREDPGTGMGSSLATAELPGTQD
PGMSGLSQTELEKQRQRQRLRELLIRQQIQRNTLRQEKETAAAAAGAVGPPGSWGAEPSS
PAFEQLSRGQTPFAGTQDKSSLVGLPPSKLSGPILGPGSFPSDDRLSRPPPPATPSSMDV
NSRQLVGGSQAFYQRAPYPGSLPSSMRFAMSARFPSTPGPELGRQALGSPLAGISTRLPG
PGEPVPGPAGPAQFIELRHNVQKGLGPGGTPFPGQGPPQRPRFYPVSEDPHRLAPEGLRG
LAVSGLPPQKPSAPPAPELNNSLHPTPHTKGPTLPTGLELVNRPPSSTELGRPTPLALEA
GKLPCEDPELDDDFDAHKALEDDEELAHLGLGVDVAKGDDELGTLENLETNDPHLDDLLN
GDEFDLLAYTDPELDTGDKKDIFNEHLRLVESANEKAEREALLRGVEPGPLGPEERPPAA
ADASEPRLASVLPEVKPKVEEGGRHPSPCQFTIATPKVEPAPAANSLGLGLKPGQSMMGS
RDTRMGTGPFSSSGHTAEKASFGATGGPPAHLLTPSPLSGPGGSSLLEKFELESGALTLP
GGPAASGDELDKMESSLVASELPLLIEDLLEHEKKELQKKQQLSAQLQPDSLLSAPGPAQ
AMSLPHEGSSPSLAGSQQQLSLGLAGARQPGLPQPLMPTQPPAHALQQRLAPSMAMVSNQ
GHMLSGQHGGQVGLVPQQSSQPVLSQKPMGTMPPSMCMKPQQLAMQQQLANSFFPDTDLD
RFAAEDIIDPIAKAKMVALKGIKKVMAQGSIGVAPGMNRQQVSLLAQRLSGGPGSDLQNH
VAAGSSQERSAGDPSQPRPNPPTFAQGVINEADQRQYEEWLFHTQQLLQMQLKVLEEQIG
VHRKSRKALCAKQRTAKKAGREFPEADAEKLKLVTEQQSKIQKQLDQVRKQQKEHTNLMA
EYRNSAVLALSPSQSPRLLTKLPGQLLPGHGLQPPQGPPGGQAGGLRLPPGGMALPGQPG
GPFLNTALAQQQQQQHSGGAGSLAGPSGGFFPGNLALRSLGPDSRLLQENLLGQVAVQQQ
QQQGPGVQTNQALGPKPQGLLPPSSHQGLLVQQLSPQPPQGPQGMLGPAQVAVLQQQHPG
ALGPQGPHRQVLMTQSRVLSSPQLAQQGQGLMGHRLVTEGSMAGLSHLQQSLMSHSGQPK
LSAQPMGSMGLLNQSRTLLSPQQQQQQQVALGPGMPAKPLQHFSSPGALGPTLLLTGKEQ
NTVDPAVSSEATEGPSTHQGGPLAIGTTPESMATEPGEVKPSLSGDSQLLLGSLQLQPPL
RLPGQQQQQVSLLHTAGGGSHGQLGSGSSSEASSVPHLLAQPSVSLGDQPGPMTQNLLGP
QQPMLERPMQNNTGPPQPPKPGPVLQSGQGLPGVGIMPTVGQLRAQLQGVLAKNPQLRHL
SPQQQQQLQALLMQRQLQQSQAVRQTPPYQEPGTQTSPIQGPLGCQPQLGGFPGPQTGPL
QELGAGPRPQGPPRLPAPPGALSTGPVLGPVHPTPPPSSPQEPKRPSQLPSPSSQLPTEA
QLPPTHPGTPKPQGPTLELPPGRVSPAAAQLADTLFSKGLGPWDPPDNLAETQKPEQSSL
VPGHLDQVNGQVVPEASQLSIKQEPREEPCALGVQSVKREANGEPIGAPGTSNHLLLAGP
RSEAGHLLLQKLLRAKNVQLSTGRGSEGLRAEINGHIDSKLAGLEQKLQGTPSNKEDAAA
RKPLTPKPKRVQKASDRLVSSRKKLRKEDGVRASEALLKQLKQELSLLPLTEPAITANFS
LFAPFGSGCPVNGQSQLRGAFGSGALPTGPDYYSQLLTKNNLSNPPTPPSSLPPTPPPSV
QQKMVNGVTPSEELGEHPKDAASARDSERALRDTSEVKSLDLLAALPTPPHNQTEDVRME
SDEDSDSPDSIVPASSPESILGEEAPRFPHLGSGRWEQEDRALSPVIPLIPRASIPVFPD
TKPYGALDLEVPGKLPATTWEKGKGSEVSVMLTVSAAAAKNLNGVMVAVAELLSMKIPNS
YEVLFPESPARAGTEPKKGEAEGPGGKEKGLGGKSPDTGPDWLKQFDAVLPGYTLKSQLD
ILSLLKQESPAPEPPTQHSYTYNVSNLDVRQLSAPPPEEPSPPPSPLAPSPASPPTEPLV
ELPAEPLAEPPVPSPLPLASSPESARPKPRARPPEEGEDSRPPRLKKWKGVRWKRLRLLL
TIQKGSGRQEDEREVAEFMEQLGTALRPDKVPRDMRRCCFCHEEGDGATDGPARLLNLDL
DLWVHLNCALWSTEVYETQGGALMNVEVALHRGLLTKCSLCQRTGATSSCNRMRCPNVYH
FACAIRAKCMFFKDKTMLCPMHKIKGPCEQELSSFAVFRRVYIERDEVKQIASIIQRGER
LHMFRVGGLVFHAIGQLLPHQMADFHSATALYPVGYEATRIYWSLRTNNRRCCYRCSIGE
NNGRPEFVIKVIEQGLEDLVFTDASPQAVWNRIIEPVAAMRKEADMLRLFPEYLKGEELF
GLTVHAVLRIAESLPGVESCQNYLFRYGRHPLMELPLMINPTGCARSEPKILTHYKRPHT
LNSTSMSKAYQSTFTGETNTPYSKQFVHSKSSQYRRLRTEWKNNVYLARSRIQGLGLYAA
KDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMFRINNEHVIDATLTGGPARYIN
HSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHKIPCHCGAWNCRK
WMN
Download sequence
Identical sequences ENSGGOP00000024047

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]