SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSTOP00000003541 from Ictidomys tridecemlineatus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSTOP00000003541
Domain Number 1 Region: 5324-5482
Classification Level Classification E-value
Superfamily SET domain 9.29e-49
Family Histone lysine methyltransferases 0.0065
Further Details:      
 
Domain Number 2 Region: 262-326
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00000000000000421
Family PHD domain 0.0022
Further Details:      
 
Domain Number 3 Region: 1371-1431
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000000000509
Family PHD domain 0.0039
Further Details:      
 
Domain Number 4 Region: 1446-1515
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00000000062
Family PHD domain 0.021
Further Details:      
 
Domain Number 5 Region: 1325-1379
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000117
Family PHD domain 0.017
Further Details:      
 
Domain Number 6 Region: 224-277
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.000000162
Family PHD domain 0.01
Further Details:      
 
Domain Number 7 Region: 1973-2023
Classification Level Classification E-value
Superfamily HMG-box 0.000000824
Family HMG-box 0.0038
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSTOP00000003541   Gene: ENSSTOG00000003904   Transcript: ENSSTOT00000003951
Sequence length 5482
Comment pep:known_by_projection scaffold:spetri2:JH393322.1:6822057:6859405:-1 gene:ENSSTOG00000003904 transcript:ENSSTOT00000003951 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDSQKSPGEDKDSEPATDGPAASEEPGATEPDLPNPHVGEVSVPSSGSPRLQEPPQGCSG
GPVRRCALCNCGEPSLHGQRELQRFELPFDWPRSPVVLPGGNPGPSEAVLPSEDLSQIGF
PEGLTPAHLGEPGGSFWAHHWCAAWSAGVWGQEGPELCGVDKAIFSGISQRCSHCTRLGA
SIPCRSPGCPRLYHFPCATASGSFLSMKTLQLLCPEHSEGAAHLEEAHCAVCEGPGELCD
LFFCTSCGHHYHGACLDTALTPRKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHT
FCLKPPMEELPAHSWKCKACRVCRACGTGSAELNPNSEWFENYSLCHRCHKAQGGQPISS
VAEQHSPVCSRFSPPEPGDIPTDEPDALCVACQGQPKGGHVTSMQPKESGPLQCEAKPLG
RAGAQLESRLETTLNEEMPLLPPPEESPLSPPPEESPTSPPPEASRLSPPPEESPASPPP
EISPFSPPEASPPSPSLETPLSPPPEASPLSPPFEESPLSPPPEELPSSPPPEASRLSPP
PEESPMSPPPEESPMSPPPEASRLFPPFEESPLSPPPEESPLSPPPEASRLSPPPEDSPM
SPPPEDSPMSPPPEVSCLSPLPEVSHLSPLPEVSRLSPPPEESPLSPPPEESPTSPPPEA
SRLSPPPEDSPTSPPPEDLSASPPPEDSLMSLPLEESPLSPLPEELRLCPQPEEPHLSPQ
PEEPQLCSQSEKLHLSLQPEEPRLCPQPKELHMSPRSEEPHLSPRSEEPHLSPRSEEPHL
SPRSEEPHLSPRPEEPHLSPRPEEPCLTPQLEEPCRSLQPEESPEDPHLCPASEELPLFP
PPGEPPLSPLLGEPALSEPGEPPLSPLPEEMPISPSGEPSLSPQLMPSDPLPPPLSPIIP
AAAPPALSPLGELEYPFGAKGDSDSESPLAAPILETPISPPPEANCTDPEPVPPMILPPS
PGSPMGPASPILMEPLAPQCSPLLQHPLPLPDSPPSRCSPPALPLSIPSPLSPMGKAMDI
SDEAELHEMETEKGPEPECPALEPSATSPLPSPMGDLSCPAPSPAPALDDFSGLGEDTAP
LDGTDAPEAGQTPASLASEPKGSPVLLDPEELAPVTPMEVYGPECKQAGQGSPCEEQEEP
RVPVAPIPPTLIKSDIVNEISNLSQGDASASFPGSEPLLGSPDPEGGGSLSMELGVSTDV
SPVRDEGSLRLCTDSLPETDDSLLCDAGTAISGGKAEGDKGRRRSSPARSRIKQGRSSSF
PGRRRPRGGAHGGRGRGRARLKSTTSSIETLVADIDSSPSKEEEEEDDDTMQNTVVLFSN
TDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVECIV
CEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFHC
EWQNSYTHCGPCASLVTCPICHAPYVEEDLLIQCRHCERWMHARCQSLLTEDDVEQAADE
GFDCVSCQPYVVKPVVPVAPPELVPMKVKEPEPQYFRFEGVWLTETGMAVLRNLTMSPLH
KRRQRRGRLGLPGEAGLEGSEPSDALGPDDKKDGDLDTDELLKGEGGVEHMECEIKLEGP
ASPDVEPGKEETEESKKRKRKPYRPGIGGFMVRQRKSHTRVKKGPAAQTEVLSGDGQPDE
VMPTDLPAEGSMEQNLTDGDEKKKQQRRGRKKSKLEDMFPVYLQEAFFGKELLDLSRKAL
FAVGVSRPTFGLGTPKARGDGGSDRKELTTSQKGDDGPDIADEESRGPEGMADIPGPEDG
GVKASPVPSDPEKPGTPGEGMLSSDLDRIPTEELPKMESKDLQQLFKDVLGSEREQHLGC
GTPGLEGSRTPLQRPFIQGGLPLVNLPSSSPMDSYPGLCQSPFLDSRERGGFFSPEPGEP
DSPWTGSGGTTPSTPTTPTTEGEGDGLSYNQRNLQRWEKDEELGQLSTISPVLYANINFP
NLKQDYPDWSNRCKQIMKLWRKVPAADKAPYLQKAKDNRAAHRINKVQKQAESQINKQTK
VGDIARKTDRPALHLRIPPQPGALGSPPPAAAPTIFIGSPTTPAGLSTSADGFLKPPVGT
VPGPDSPGELFLKLPPQVPAQVPSQDPFGLAPAYTLEPRFPVAPPTYPPYPSTTGAPAQP
PMLGASSRPGTGQPGEFRTTPPGTPRHQPSTPDPFLKPRCPSLDNLAVPESPGVVGGKAS
EPLLSPPSFGESRKALEVKKEELGASSPSYGPQNLGFVDSPSSGPHLGGLELKAPDVFKA
PLTPRASQVEPQSPGLGLRPQEPPSSQSLAPSPPSHPDIFRPGPYPDPYAQPPLTPRPQP
PPPESCCALPPRSLPSDPFSRVPASPQSQSSSQSPLTPRPLSAEAFCPSPVTPRFQSPDP
YSRPPSRPQSRDPFAPLHKPPRPQPPEVAFKAGPLAHTPLGAGGFPAALPSGPAGELHAK
VPSGQPPNFARSPGTGAFVGTPSPMRFTFPQAVGEPSLKPPVPQPGLPPPHGINSHFGPG
PTLGKPQSTNYTVATGNFHPSGSPLGSSSGSTGEGYGLSPLRPASVLPPPAPDGSLSYLS
HGASQRASITSPVEKQDPGSTMSSSLAAPDLPGTQDPGMSNLSQTELEKQRQRQRLRELL
IRQQIQRNTLRQEKETAAAAAGAAGLPGSWGAEPSSPAFEQLSRGQTPFTGTQDKSSLVG
LPPSKLGGPILGPGAFPTDERLSRPPPPATPSSLDMNSRQLVGGSQAFYQRGPYPASLPL
QQQQQQQLWQQQQQQQATTATSMRLAMSTRFPSTPGPELGRQALGSPLAGIPTRLPGPAE
PVPGPAGPAQFIELRHNVQKGLGPGGAPFPGQGPPQRPRFFPVNEDPHRLAPEGLRGLAI
SGLPPQKPSVPPAPELNNSLHPTVHTKSPALPAGLELVSRPPSSTELGRPPPLALETGKL
PCEDSELDDDFDAHKALEDDEELAHLGLGVDVAKGDDELGTLENLETNDPHLDDLLNGDE
FDLLAYTDPELDTGDKKDIFNEHLRLVESANEKAEREALLRGVEPGPLGPEERPPPAADA
SEPRIASVLPEVKPKVEEGGRHPSPCQFTITAPKGESAPATTSLGLGLKPGQSVMGTRDT
RMGTGPFSSGGHTVEKGSFGTTGGPSAHLLTPSPLSGPGGSSLLEKFELESGALTLPGGH
ATSGDELDKMESSLVASELPLLIEDLLEHEKKELQKKQQLSAQLQPAQQQQQQQHPLLST
PGPAQALPLPHEGSSSLAGPQQQLALGLAGARQPGLSQPLMPTQTPAHALQQRLAPSMAM
VSNQGHMLSGQHGGQAGLVPQQSPQPVLAQKPMGTMPPSMCMKPQQLVMQQQLANNFFPD
TDLDKFAAEDIIDPIAKAKMVALKGIKKVMAQGSIGVAPGMNRQQVSLLAQRLSGGPGSD
LQNHVAPGSGQERNASDPSQPRPNPPTFAQGVINEADQRQYEEWLFHTQQLLQMQLKLLE
EQIGVHRKSRKALCAKQRTAKKAGREFPEADAEKLKLVTEQQSKIQKQLDQVRKQQKEHT
NLMAEYRNKQQQQQQQQQQQQQQQQQQQQQQHSAVLALSPSQSPRLLTKLPGQLLPGHGL
QPPQGPPGGQAGGLRLPPGGMALPGQPTGPFLNTALAQQQQQQHSGGAGSLAGASGGFFP
GNLALRSLGPDSRLLQERQLQLQQQRMQLAQKLQQQQQQQQHLLGQVSVQQQQQQGPGVQ
ANQTLGPKPQGLLPPGSHQGLLVQQLSPQPPQGSQGMLGPSQVAVLQQQHPAALGPQGPH
RQVLMTQSRVLSSPQLAQQGHGLMGHRLVTAQQQQQQQQQQQQQQHQQQGSMAGLSHLQQ
GLISHSGQPKLSTQPLGSLQQQQLQQQQQQHQQQQQQLQQQQQQLQQQQQQLQQQQQQLQ
QQQQQQLQQQQQLQQQQQQQQQQQMGLLNQGRTLLSPQQQQQQQVTLGPGMPAKPPQHFS
SSGALGPTLLLTGKEQNIVETALPSEVTEGPSTHQGGPLGIGTAPESLATEPGEVKPSLS
GDSQLLLVQPQAQPQPNSLQLQPPLRLPGQPQQQVNLLHTAGAGSHGQLGSGPSSEASSM
PHLLAQPSVSLGEQPMPMTQNLLGPQQPLGLERPMQNNTGPQPSKAGSGAQSGQGLPGVG
VMPTVGQLRAQLQGVLAKNPQLRHLSPQQQQQLQALLMQRQLQQSQAVRQTPPYQEPGTQ
SSPLQGLLGCHPQPGGFPGSQTGPLQELGAGPRPQGPPRLPTPQGALSTGPVLGPVHPTP
PPSSPQEPKRSSQLPSPSAQLTPTHPGTPKPQGPISELPPERVSPAAAQLVDTFFGKGLG
PWDPPDNLAEAQKLEQSSLVPGHLEQVNGQVVPEPPQLSIKQEPREEPCALGPQAVKREA
NGEPIGTPGTSNHLLLAGSRSEAGHLLLQKLLRAKNVQLGAGRGPEGLRAEINGHIDSKL
AGLEQKLQGTPSNKEDAAARKPLTPKPKRVQKASDRLVSSRKKLRKEDGVRANEALLKQL
KQELSLLPLTEPTITANFSLFAPFGSGCPVSGQSQLRGAFGSGALSSSPDYYSQLLTKNN
LSNPPTPPSSLPPTPPPSVQQKMVNGVTPSEELGEHPKDTASSRDTEGALRDASEVKSVD
LLAALPTPPHNQTEDVRMESDEDSDSPDSIVPASSPESILGEEAPRFPQLGSGQWEQDDR
ALSPVIPIIPRASIPVFPDIKPYGTLNLEVPGKLPATTWEKSKGSEVSVMLTVSAAAAKN
LNGVMVAVAELLSMKIPNSYEVLFPESPARASIEPKKGEAEGPGGKEKGLGVKSSDTGPD
WLKQFDAVLPGYTLKSQLDILSLLKQESPAPEPATQHSYTYNVSNLDVRQLSAPPPEEPS
PPPSPLAPSPASPPAEPLVELATEPSADPPMPSPLPLASSPESARPKPRARPPEEGEDSR
SPRLKKWKGVRWKRLRLLLTIQKGSGRQEDEREVAEFMEQLGTALRPDKVPRDMRRCCFC
HEEGDGATDGPARLLNLDLDLWVHLNCALWSTEVYETQGGALMNVEVALHRGLLTKCSLC
QRTGATSSCNRMRCPNVYHFACAIRAKCMFFKDKTMLCPMHKIKGPCEQELSSFAVFRRV
YIERDEVKQIASIIQRGERLHMFRVGGLVFHAIGQLLPHQMADFHSATALYPVGYEATRI
YWSLRTNNRRCCYRCSIGENNGRPEFIIKVMEQGLEDLVFTDASPQAVWNRIIEPVAAMR
KEADMLRLFPEYLKGEELFGLTVHAVLRIAESLPGVESCQNYLFRYGRHPLMELPLMINP
TGCARSEPKILTHYKRPHTLNSTSMSKAYQSTFTGETNTPYSKQFVHSKSSQYRRLRTEW
KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMFRI
NNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFD
FEDDQHKIPCHCGAWNCRKWMN
Download sequence
Identical sequences ENSSTOP00000003541 ENSSTOP00000003541

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]