SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSHAP00000004216 from Sarcophilus harrisii 76_7.0

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSHAP00000004216
Domain Number 1 Region: 4906-5064
Classification Level Classification E-value
Superfamily SET domain 8.5e-49
Family Histone lysine methyltransferases 0.0065
Further Details:      
 
Domain Number 2 Region: 1042-1103
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000000000687
Family PHD domain 0.0039
Further Details:      
 
Domain Number 3 Region: 266-326
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.000000000031
Family PHD domain 0.0028
Further Details:      
 
Domain Number 4 Region: 1117-1186
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00000000166
Family PHD domain 0.021
Further Details:      
 
Domain Number 5 Region: 219-274
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000048
Family PHD domain 0.017
Further Details:      
 
Domain Number 6 Region: 996-1050
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000109
Family PHD domain 0.017
Further Details:      
 
Domain Number 7 Region: 1644-1694
Classification Level Classification E-value
Superfamily HMG-box 0.00000995
Family HMG-box 0.0037
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSHAP00000004216   Gene: ENSSHAG00000003715   Transcript: ENSSHAT00000004258
Sequence length 5064
Comment pep:known_by_projection scaffold:DEVIL7.0:GL861731.1:72273:107850:-1 gene:ENSSHAG00000003715 transcript:ENSSHAT00000004258 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDDPKPSGEDKDSEPAADGPEASEESGPTEPDPPNPPLGDASVHSPGSSKSQDPPQDCSK
GPLRRCAFCNCGEWSLHGQRELRCFKLPPDWPQGSARPPEGPGLGEAVPLNDDLSQIGFP
EGSTPAHLGKPGELCWAHHWCAAWSVGGQGQDGLELHGVDKAIFSGISQRCSHCTRLGAS
IPCRSPGCPRLYHFPCAAASGSFQSMKTLQLLCLEHGEEAEHLEDAHCVVCDGLGELRDL
LFCTSCGQHYHGACLDTALTARKRAGWQCPDCKVCQTCRQPGEDSMMLVCEACDKGYHTF
CLKPAIQSLPPDSWKCKTCRVCRACGACPAELDPNCQWYENYSLCERCQRQQNPQGGRVG
SSETEQSLHVCSKYSQSELGMASVGAPGDFDTACQEQLEGEKEACRKSEEPGPLHCEAKP
LGKGVLQINFASVNSCQLHRTLGTLVRLHSLRQGVGPGCPCLMLSVAYRIWLLMSAGQAG
TQDESLPEVLSPPEVAPLDEEMPLEPLLLPEEPLLSPQLPPTGKELGKHPSSGIQSVPQR
DLLATVQACTNLKAQCEVQNTVPLKSSFFLPFPDPPPPPLSPATAVAPPSLSPLGEQEEL
PDAKGDSEPGTPPEVPILDTPISPPPEASCFEPEPIAPIDLTPSPASPEELGELNSPIPL
EPPPPQCSPLPLAPSPLGFMEKAPKVLDETESQEMETEPECPALEPSGISPPACPTGDLS
CPAPSPAPSPAPALDVFSNLVEDTICLDETEATGPVPEAGQTSGSSGSDPKGSPLLLDPE
ELAPVTPMEVYGSDGKQAGQSSPCEDPEEPVAPPVHTPPTLIKSDIVNEISNLSQGDASA
SFPGSEPLLGSPDPEGGGSLSMELGVSTDVSPARDEGSFRLCTDSLPETDDSLLCDSGAT
GAGGKAEGDKGRRRSSPARSRIKQGRSSSFPGRRRPRGGAHGGRGRGRARLKSTTSSVET
LVVADIDSSPSKEEEEEDDDTMQNTVVLFSNTDKFVLMQDMCVVCGSFGRGAEGHLLACS
QCSQCYHPYCVNSKITKVMLLKGWRCVECIVCEVCGQASDPSRLLLCDDCDISYHTYCLD
PPLLTVPKGGWKCKWCVSCMQCGAVSPGFHCEWQNSYTHCGPCASLVTCPACRAPYVEED
LLIQCRHCERWMHAGCESLFTEEEVEQAADEGFDCASCQPYVVKPVVPIAPPELVPVKVK
EPEPQYFRFEGVWLTETGMAVLRNLSMSPLHKRRQRRGRLGLLGEGGLEGPEPSDPGGPD
DKKDGDLEAEELLKSEGVGVEHMECEIKLEAPASPDGEPGKEETDEGKKRKRKPYRPGIG
GFMVRQRKSHARLKKGPAVLVEVLSGEGQPDEVVTVDPPTEGAVEQSTAEGDEKKKRRGR
KKSKLEDMFPAYLQEAFFGKALLDLSRKALLVAGDGRPGFVPGTLRGKGDPGPDRKESSA
SQKGDDGPDIADEESRGPEGNSETPGPEEGGIKTSPVPSDSEKPGTPGEGMLSSDLDKIP
TEELPKMESKDLQQLFKDVLGSEREQQIGCGTPNLDGSHTPMQQRPFLQGGLPLGNLPTS
SPMDSYPGLCQSPFLDNRERGGFFSPEPGEPDSPWTGSGGTTPSTPTTPTTEGEGDGLSY
NQRSLQRWEKDEELGQLSTISPVLYANINFPNLKQDYPDWASRCKQIMKLWRKVPATDKA
PYLQKAKDNRAAHRINKVQKQAESQISKQTKVGDIARKTDRPTLHLRIPPQPGMLGSPPP
ASAPTVFIGSPPTPAGLSTSSDGFLKPPAGTVPGPDSPGDLFLKLPPQVPAQVPLQDPFG
LAPSYTLESRFPTAPPAYPPYPGSAVPPAKPQMLGAPPRPGAGQPGEFQSTPPGTPRHQP
STPDPFLKPRCPSLDNLAIPESPGVGGAKASEQLISPPPFGEPPRKALEVKKEELGAASP
GYGPPSLGFGDSPSSGPHLGSLELKAPDVFKAPLTPRASQVEPQSPGIGLRPQEPPSAHA
VAPSPPSHADLYRPTPSGSYPETYSQPPLTPRPQPPPPESCCPLPPRSLPSDPFTRVPAS
PQSQSSSQSPLTPRPLSTEAFCQSPVTPCFQSPDPYSRPPSRPQSRDPFAPLHKPPRSQP
PEVAFKAGPLAHTPLGVGGFPVALPSGHPNEHHTKSSGGQPPSFVRSPGANVFVGNPSAM
RFTFPQAVGEPPLKPPASQPSHPPPHGINSHYGPGPTVGKPQSTNYTTATGSFHPGGSPL
GPGTGPPGEGYGLSPMRPPSVLPQPAPDGPLAYLPHGASQRGSITSPIDKREDPVSGMAT
SLVGPELQGPQDPSMANLSQTELEKQRQRQRLRELLIRQQIQRNTLRQEKETAAATSAVG
PGNWGTESSNPAFEQLNRGPTSYPGTQDKSGLTGLPPNKLSGSMLGPSPFPAEERLSRPP
PPPTPSSIDMNGRQLVGGTQAFFPRGPFPGPLPQQQQQQLWQQQQQQQATVASMRLSMPA
RFPQPPGPELGRTILGPPLSGLSNRLPGPGEPLPGPAGPAQFIELRHNVQKGPGPGSAPY
HLQGPPQRPRFFPVAEDSHRLAPESLRPLAVPALPPQKPSAPPPPELSNSLHSLPHAKGP
SLPAGLELVGRSPSSTETARPPLVLESAKLPCEDPELDDDFDAHKALEDDEELAHLGIGV
DVAKGDDELGTLGNLETNDPHLDDLLNGDEFDLLAYTDPELDTGDKKDIFNEHLRLVESA
NEKAEREALLRGTEPGLSGLEERPAPTTDGPDSHLMPGASEVKPKLEEGGHQASPCQFTV
ATPKVDPGPPGTLGLGLKPGQSLLGPRDNRLSMGPFPNNVHTVEKSPFGGTAGPPAQLLT
PNPLGGPGGASLLEKFDLESGGLTLPGGPPPSGDELDKMESSLVASELPLLIEDLLEHEK
KELQKKQQLSAQLQQPPPPPPPPPPPPQQQQQHPLLATSATAQSMPLPQEGPSPGLAVTP
QQLALGLGARQSSLVSTQAMMTTQQPSHALQQRLVPPMAMVPNQGHMLSGGQASLVPQQN
PQPVLAQKPMGAVPPSMCMKPQQLAMQQQLANSFFPDTDLDKFAAEDIIDPIAKAKMVAL
KGIKKVMAQGSIGVPPGMNRQQVSLLAQRLSGAPGNTDLQNHVAAGSGQERSNSDPSQPR
PNPPTFAHGVINEADQRQYEEWLFHTQQLLQMQLKVLEEQIGVHRKSRKALCAKQRTAKK
AGREFPEADAEKLKLVTEQQSKIQKQLDQVRKQQKEHTNLMAEYRNKQQQQQQQQQQQQQ
QQQQQQHSAVLALSPSQSPRLLAKLPGQLLPGHGLQPPQGPLGGQAGSLRLPPGGVTLTG
QPSGPFLNPALGQQQQQQQHQAGAGTLAGPSAGFFPGNLALRGLGADARLLQERQLQLQQ
QRMQLSQKLQQQQQQQHLLGPKPQGLLPPGSHQGLLVQQLSPQPTPGPQGMLGPAQVAVL
QQQQQQVHPGGLGPQGPHRQVLLAQPRMLGSPQLAQQAQGLMAHRLVMAQQQQQQQQQQQ
QQQQQQQQQQQQQQALAGLSHLQQGLMPPSGQPKLGTQSMGTQQQQQLGLLNQGRTLLPK
PLQHFPSHGALGPTLLLSLPGKEQAGAETVLVPEGTEAPSVHLGGPLALGAPTETLPPEP
VEVKPSIPGDSHLLLSQPQPHTQMNSLQLQAPLRLPGQQQQQQQQQQVSLLHPAGSTSHG
QMTGGQPPEVTSMSHLLTQPLSSVGERPSGLDRPLKGSPRPPPPKPSPLPHPGQGLPGHS
GMPTVGQLRAQLQGVLAKNPQLRHLSPQQQQQLQALLVQRQLQQGQVLRQTVPYLEPGTQ
SSSLQGLLGRQPQPGGFLGAPSGPLQELGAVPRPQGPSRLSTSQGAISSGSTLGFAQPPP
LPSSPQEPKRPSPRLLPPSPQLSAEVQLTPTQIENPKPQESSLELPSGDIPPPSSAASQL
PDTFFAKGPGPWETPDNLAEAHKTEQNNMGHGEVQQVNGQEMPEPPCLMIKQEPREEPCA
MGAQLIKRETNGEPMGTPGTSNHLLLASSRSEAGHLLLQKLLRAKNVQLGTGHGPEGLRA
EINGHIDSKLAGLEQKLQGTPVSKEDVAAKKPLTPKPKRVQKAGDRIASSRKKLRKEDGV
KANEALLKQLKQELSLLPLSEPNITNNFSLFAPFGSGCLINGRNQLRGAFGSGALPTGPD
YYSQLLTKNNLSNPPTPPSSLPPTPPPSVQQKLVNGVTPSEDLGEQQKDAAPARESEGGA
RGWGEVGSLDLLAALPTPPHNQTEDVRMESDEESDSPDSIVPASSPESILGEEAPRFPQL
VSGQQEQEDRALSPVIPIIPRASIPVFPDAKPYGTLEVEPTGKLSATTWEKGKGSEVSVM
LTVSAAAAKNLNGVMVAVAELLSMKIPSSYEVLFPESPARSAGAEPKKGEVEGPGGKEKS
LGGKPTESGSDWLKQFDAVLPGYTLKSQLDILSLLKQESPVPELPTQHCYIHNVSNLDVR
QLSAPPPEEPSPPPSPSAPSPTSPPAEPSAPSPPPLVPSPPMPEAARPKPRARPPEEGED
SRPPRLKKWKGLRWKRLRLLLTIQKGSGRREGEREVAEFMEQLGTALRPDKVPRDLRRCC
FCHEEGDGATDGPARLLNLDLDLWVHLNCALWSTEVYETQGGALMNVEGALHRGLLTKCS
LCQRTGATNSCNRLRCPSVYHFACAIRAKCMFFKDKTMLCPLHKLKGPCEQELSSFAVFR
RVYIERDEVKQIASIIQRGERLHMFRVGGLVFHAIGQLLPHQMADFHSATALYPVGYEAT
RIYWSLRTNNRRCCYRCSIIENNGRPEFIIKVMEQGLEDLVFTDASPQAVWNRIIEPVAA
MRKEADMLRLFPEYLKGEELFGLTVHAVLRIAESLPGVESCHNYLFRYGRHPLMELPLMI
NPTGCARSEPKILTHYKRPHTLNSTSMSKAYQSTFTGETNTPYSKQFVHSKSSQYRRLRT
EWKNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMF
RINNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQ
FDFEDDQHKIPCHCGAWNCRKWMN
Download sequence
Identical sequences G3VM11
ENSSHAP00000004216 ENSSHAP00000004216

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]