SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSDARP00000099627 from Danio rerio 69_9

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSDARP00000099627
Domain Number 1 Region: 4720-4879
Classification Level Classification E-value
Superfamily SET domain 2.09e-44
Family Histone lysine methyltransferases 0.0029
Further Details:      
 
Domain Number 2 Region: 994-1054
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.000000000000104
Family PHD domain 0.0034
Further Details:      
 
Domain Number 3 Region: 379-432
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00000000000247
Family PHD domain 0.0023
Further Details:      
 
Domain Number 4 Region: 1068-1134
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.000000000731
Family PHD domain 0.012
Further Details:      
 
Domain Number 5 Region: 944-1003
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00000000115
Family PHD domain 0.016
Further Details:      
 
Domain Number 6 Region: 328-380
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000532
Family PHD domain 0.011
Further Details:      
 
Domain Number 7 Region: 1693-1740
Classification Level Classification E-value
Superfamily HMG-box 0.0000144
Family HMG-box 0.0028
Further Details:      
 
Weak hits

Sequence:  ENSDARP00000099627
Domain Number - Region: 452-509
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00117
Family PHD domain 0.016
Further Details:      
 
Domain Number - Region: 268-303
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0113
Family PHD domain 0.013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSDARP00000099627   Gene: ENSDARG00000075560   Transcript: ENSDART00000112279
Sequence length 4879
Comment pep:known chromosome:Zv9:2:12643781:12829533:1 gene:ENSDARG00000075560 transcript:ENSDART00000112279 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MSSEEKSLDPSDKGPSPPPCSSGATPTGSPTPTDKRPRGRPRKDALSTATSIPSVPRQRK
KNRGRGKALVEDEDSTDGMETAETGSTQETVAKGQQEETEDASAMAVSAEGEEEEQSSSP
LPASSKTGAAQTSSASGEAKSSEQLCAFCYCGGRSLLGQGDLKPFRVTPGYDPPPRQSST
EEVHDHSDKTGAASQSGRQRGLESVGAEEAGSRFWDELRHVGLPDDVDVQTFFESGQCWA
HQSCALWSNFVCQAEDQSLLYVDKAIHSGSTEHCAYCKRLGASIKCCEEGCDRSYHYPCV
GAAGTFQDFRRRSVLCTEHIELAISKYEEEANCILCDSPGDLLDQFFCTSCGLHYHGMCL
DISVTPLKRAGWQCPECKVCQTCKEPGEDTKMLVCDVCDKGYHTFCLKPAMDSIPTNGWR
CKNCRMCVQCGTRSSEQWHHNSLLCQNCGDQQDTTLSCLCPSNVVPDIQKDLLSCHQCKR
WFNPECERPSEGHTHPQPKEDHICSTCRTTGIESGHVCTEADGVQAEIQSDVGAEESTQL
VPSNNDSEPAKVQDIEPHQPVREMQNEEKQTTNEKQMMEIGADVGGDLTKPESSETSHHS
AEKEMTSDASPRETKGPEGEQTEQKLDFAPETSTVQQLETFFKTEEPPETSASLKENISK
NENGSKPCLEEKKINPSPTKEPAPACTSHEEEPMEVLFTEEPSAEAHGVTVEPEVNTSEN
TPLIQSQRKIVQEQIIPPEIHTKEMDDITVDGDHRCLAKLTSLVEESTSLQQPPHRQPEG
LLLVVQGTSQPQALTQAVSRESISNVPSSAFIPITPKIGMGKPAISKRKFSPGRPRVKQG
AWNRRPSSPSWSLDPTEGWDGLKSRQPHSTAAWIFRVGRGSGFPGRRRPRGSGVTGRGGR
GRARMKNGVPPIPTPGVHIMEPMLTFKEEEETAMHSTVVIFSSADTFTMKQDMCVVCGSF
GQGVEGRLIACAQCGQCYHPYCVNIKITKVVLSKGWRCLECTVCEACGQASDPGRLLLCD
DCDISYHTYCLDPPLQNVPNGSWKCKWCVSCTQCGATSAGLRCEWQNNYTQCAPCASLAS
CPLCQQEYKEEEIILQCRQCDRWMHASCQGIHSEEEVEKVADTSFDCNLCQGHIPLSPAP
GTPSKSSLDFADSVFMPQRVTKTRDHDLMRTYTQDGVCLTESGLSQLQSLANAASRRRRS
KPKLKLKIINQNSVAVLQTPPDPQTELSRDGDLEDTKTEGEMVDGEVKSDSSPEREPTAE
DDSKDTDACKKRKRKPYRPGIGGFMVRQRNRTGPGRTKPAFVRKDSTGSETLQGKDEGWG
DQAPDTPVDEKHPLPDFPDNPEVKIRKRYRKKKTKLEEAFPTYLQEAFFGKDLLDKSKQN
RQQTTRLLEDERDRKQLLANQMKPFSDLSLNVSAALLSSKAAAQNSVEPLVDLSEELKSD
TELLGMFSDNNDKQSEESGVDFCPFQVEYSPSPFGLDIAPLAEDESVSSQAHLGRTHPRL
VPEEPLDGILSPELDKMVTDESILSRLYKIPELGGKDVEDLFTAVLSPSTSQPAHMIQQI
HGAQTLHGPSGPVFHPSQANSGLSSRMPVMNGLMESKQQFSQARIGPGAGPEMAQNIPPM
QRIPFSDNLRDRKFNLMSQETAGPWSATGSASSSAPAAVSEMEGDSMSTAQKSTLKWEKE
ETLGELATVAPVLYTNVNFPNLKEEYPDWSTRVKQIAKLWRKASSQDRAPYVQKARDNRA
ALRINKVQISNEPIKRQPPQPQQPLEVFDPAIPPLDPELLFKDPLKHKESEHEQEWKFRQ
QMRQKSKQQAKIEATQKLEQVKNEQRQQQQVPGSQSDHDSSGNLKSPGSVHSNSGDMSPN
QPSSKNGVSKSQLLGTSCSPDDIFLRPHPPPQSSGSQPQSPQMFSPNSSGSRPSSPWDPY
TKMVGTPRPPPSSQGTPRRNSESGKSPKGLPEPIGSPTAICNDPYAKPPDTPRPAGTTDP
FLKPMCPPRPSQTLDGRHVIGSPNHDSFSRMSVRKEAYQRMPQGRMILSDPYARPLLTPI
PGSNESGSIQVFKTPMPPPQAQEQYVGMHARRVSADAFERPMMSSRSNEGFSHNQQNDPY
AQPPLTPRPVVSDGFANQRISRQPQSLPFSQPGPMTRQPSCNSYARAPSTPRPDYSQCDP
YVQQPGTPRPSSDPFAQSPFSNPYARMPGTPRPHDPEPYSQQSASRHPAMMNQPSQQSQQ
QTHNRIMSPISMDPYTQHPNTPRSGIVDPFPKSPSNQRTPDPFCQPPGLPRCVGPDLHIQ
SIGRSQSLRNDSCSHIPRTPHHGVVGQVLVSGPLSSQDPFSPPQVLMQESFSSPTKHGPQ
TLKHLSMTDDSGSQPLSNRPNQTPIHDPFEQTAMLGQCGENKDQQSLVQIISSHTMGQPG
SGTQTIPLAEAEERLRQRQRIRELILKQQQQKSAIRQDKVVQDHTLTMSPATPQHWNLES
TGQQSEIFNHPPPPYPGPGAVRAPQRFYATRDRQGQFPEGQLPRPQFPGDADSNTRQLGA
RMPLTPGIQGPLGAIRPHQMQDSIEAHPQMRRSMSMDLGKSIEGSPLGTPHMPPRGMHVQ
QHNIMGQPFIELRHRTPDSRLRLSFGPPGMQGNRMESPLQQHAPGFLSGQELVFPFNQIT
KAVDTPLNQPQVAFTQMQTSLSLGNLQQPNISLRAGHTQIPLTRSISQPASNETLSSPLH
TDIAAVSAVQNDEVPLPTNEVPEEKIDTVESAVKELEDVEVKDLVDADLENLNLDPDDGK
DLDLETNDLHLDDFLKSGKFDIIAYTDADLDLSEDLDLSDTMEDNTEISEKKAEKKTESF
NATSSASCSTSAVTEAVDKTALSQDDTNQEIPQDQVLVSLKKEDDNKNGFKDCLSQDSSC
SNQISDSAANNQASFKRDVESSLQVHPDTTPVLSSMLVSVPPEGKELKQEQCGKSTEQNN
LSNQETAMSLSNTMLGQENFSVQGIDTGLNIDQSLVSSHDESALEVSSSIQEQQQSHIFG
IDQKDAVLSGEQNSILTQQAVLSQQGHQNRPLLLEEQPLLLQDLLDQERQEQQQQKQMQA
MIRQRSSDSFFPNIDFDAITDPIMKAKMVALKGINKMMVQNNMGMSPMVINNAQTGQALP
DTDGNITPMQLPGQDSKLAPHIARPNPPNFGTGFAHDTQKAQYEDWLRETQQLLQMQQKF
LEEQIGAHRKSKKALSAKQRTAKKAGREFPEEDAEQLKHVTEQQGVVQKQLEQIRKQQKE
HAELIEEYRVKQQQQGALQPPIMPGMQPPAGMMQVGPPINQSMVGPMMPIRLHSNQPDVT
KMPNTAGWHPGAPVPTGPRMPGVMPAQVVQPQPLQPAAARPTQVQAGGESPHVNFDDTNP
FSEGFQERERKERLREQQERQRMQIIKEVERQHVKHCVEQQQVSNCQDGTMRGLSQMPFY
NQELPQDFMQPPRLQQQIQGPTFPQQQGTQQGYIGGPPRPLLGNGPFPQEMGSGFAPQNL
AVHGPNLIQAQTRPQRYSVPHMMAQNPPQGHPFPMEGPTPLPPNFPGPGPSLIQLYSNII
PEEKGKKKRNQKKKKDEDCESLRAPSTPHTPHSDMTAPLTPCVSDTSSTPTRNPMVFGDH
EFCETSQPGSSTPGSMSSQPHSELERQLSEGSCGGGPESAMGHEEMHDRILSNIKLEKVE
ANDCHGHKPIDMEIRIGMVKVEREIMLHHPSSQSPANSSKEEGGNELLKHLLKNKRTPPH
ALPHQRSEDSLRSEEEGSTESKAFFRQNSMDSTGTFSDSNHQDFPGPLILEDKKKQRNKR
TPKSGDRPAPRCKKRKKEEDESQAVYSSTDPMITPLKQQHLSLLPLMEPLVGVNFAHLMP
YGGGQLDGENRLSGTFGSASLDGVSDYYSQLIYKQNNLSNPPTPPASLPPTPPPVARQKL
LNGFATTEELARKGGLIAGHDVTKGLLARPLEFKAEEELLAQALAQGPKTVNVPASLPTP
PHNNQEELRGQEHCEDRNTPDSFVPSSSPESVVGMEISRYPDLSLVKEEPPSPAMSPVIP
MFPVFRDKDVKLQEVKTEPSSVFFDSSFRSVQNGSNTGLVSIAIMLKPAAAENITDVVAA
IADLIRVKIPSSYEVSSGPGGSFGAIKTSVDPQCLASALPNGPRALRQAPQHLLLQHNHN
KNIGGEQYRDGQVRPGSKQQWCQHCKVVVLGKGVRKITKDEEVKPQESRLCSDGGLVFCS
HSCLILHSSSSQSNGNADNKASVPLLSESALKQSFSKVQHQYSNNMSSLDVHCLAQLQPK
PSSPAPYLHMAFAPAKAIKTESKPRSISEGHLKVTVKLKPRLHSHLEDKQWHHGKRWKGL
RWRKWTIDIAMPKVAPQSSESELEERLKQLTTSLRPCLTIRDQRRCCFCQQIGDGMTDGP
ARLLNLDLDTWVHLNCALWSSEVYETQAGALINVGLARQRGQTVVCAFCQRLGATSGCHR
LRCLNIYHFTCALQAGCTFFKDKTMLCHQHRPRGAGAAAGLHVEHQLRCFSVFRRVYVQR
DELRQLAAAVQQPERGHTFRVGSLLFHAMGQLPPALMPTFHSSTAIFPPGYEATRLYWSM
RHGQKRCRYVCSVEEHEGRAEFSIRVIEQGYEDLVLTDTSAKGVWEKVLGPVAERRAETG
MLRLFPIYLKGEDLFGLTISAVTRIAESLPGVEACSRYRFRYGRNPLLVLPLSMNSSGSA
RSELQTYPQHERVKILSSWPRISRCIQNSTVAASASSHSKHFVHSKSSQYRRLASDWKSN
VYLAHSRIQGLGLFAARAIEKQTMVIEYMGDILRTEVAMRRELLYKAKNRPAYMFCIDSE
RVIDATNSGSPARYINHSCSPNCVAEVVTFERGYKIIISAACRIERGEELCYDYKLTPVN
DQSKIPCHCGAAKCRKWIN
Download sequence
Identical sequences F1R598
ENSDARP00000099627 ENSDARP00000099627 ENSDARP00000122949

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]