SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSDNOP00000021359 from Dasypus novemcinctus 76_2

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSDNOP00000021359
Domain Number 1 Region: 5159-5317
Classification Level Classification E-value
Superfamily SET domain 9.03e-49
Family Histone lysine methyltransferases 0.0065
Further Details:      
 
Domain Number 2 Region: 1372-1432
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000000000509
Family PHD domain 0.0039
Further Details:      
 
Domain Number 3 Region: 267-326
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.000000000000166
Family PHD domain 0.0022
Further Details:      
 
Domain Number 4 Region: 1447-1515
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00000000093
Family PHD domain 0.028
Further Details:      
 
Domain Number 5 Region: 1326-1380
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.000000031
Family PHD domain 0.018
Further Details:      
 
Domain Number 6 Region: 224-277
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.000000248
Family PHD domain 0.012
Further Details:      
 
Domain Number 7 Region: 1966-2016
Classification Level Classification E-value
Superfamily HMG-box 0.00000209
Family HMG-box 0.0037
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSDNOP00000021359   Gene: ENSDNOG00000041118   Transcript: ENSDNOT00000051525
Sequence length 5317
Comment pep:known_by_projection scaffold:Dasnov3.0:JH561465.1:207882:243609:-1 gene:ENSDNOG00000041118 transcript:ENSDNOT00000051525 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDSLKPPGEDKDSEPAADGPAASEESGAAEPDLPDLHVGEVSVPGSGSARLQEPPQDGSE
GPVRRCALCNCGEPSLHGQRELRRFELPFDWPQCPVVPPGGDSGPKEAVLPSEDLSQIGF
PEGLTPAHLGEPGGPCWAHHWCAAWSAGVWGQEGPELCGVDKAIFSGISQRCSHCTRLGA
SIPCRSPGCPRLYHFPCATASGSFLSMKTLQLLCPQHSEGAAHLEEAHCAVCEGPGELCD
LFFCTSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHT
FCLKPPMEELPAHSWKCKACRVCRACGAGSAELHPNSEWFENYSLCHPCHQAQRGQPVSS
VAEQHPPVCSRFSPPEPGATPTDEPGALYFACRGQPEGGDVTAMQSKEPGPLHCEAKPLG
RAEAQPEPQPEAPLSEEMPLLPPPEESPLSPPPEESPTSPPPEASRLSPPPPEESPLSPP
PESSPFSPLEESPFSPPEESPPSPSPETPLSPPPKASSLSPPLEESPLSPPPEELPTSPP
PEASRLSPPPEESPMSPPPEESPTSPPPEASCLFPPFEESPLSPPPEESPLSPPPEALRL
SPPPEDSPMSPPPEDSPMSPPPEVSRLSPPPEESPLSPPPEESPTSPPPEASRLSPPPED
SPTSPPRPEKPHLSPRPEEPCLFPAPEEPRLSPAPEEPHLSPAPAEPRLSPAAEEPRLSP
APEEPRLSPALEEPHLSPAPEEPRLSPEPEEPHLSTAPEEPHLSPAPEEPLLSAAPEEPC
LSAAPEEPRLSPAPEEPRLSPVSEEPRLSTAPEEPCLSPAPKEPRMSPQPQESFEEPGLC
PTPEELPLVPPSGESPLSPLLGEPALSEPGEPPLSPLTEELPLSPSGEPSLSPQLMPPDP
LPPPLSPIITAVAPPALSPLGQLEYPFGAKGDSDPESPLAAPILETPISPPPEANCTDPE
PVPPMILPPSPGSPMGLASPMLISLPPQSPLPSQCFPPALRLSIPPLSPMEKAVEVSDEA
ELHEMETEKVLEPECPALEPGPSSPLPSPMGELSCPAPSPAPALDDFSGLGEDTALLDGT
DIPGSQAEAGQTSGSLTSELKGSPVLLDPEELTPVTPMEVYGPECKQVGQGSPCEEQEEP
RAPVAPTPPTLIKSDIVNEISNLSQGDASASFPGSEPLLGSPDPEGGGSLSMELGVSTDV
SPARDEGSLRLCTDSLPETDDSLLCEAGTVVSGGKADGDKGRRRSSPARSRVKQGRSSSF
PGRRRPRGGAHGGRGRGRARLKSTTSSIETLVVADIDSSPSKEEEEDDDDTMQNTVVLFS
NTDKFVLMQDMCVVCGSFGRGVEGHLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVECI
VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH
CEWQNSYTHCGPCASLVTCPICHAPYVEEDLLIQCRHCERWMHAGCESLFTEEEVEQAAD
EGFDCISCQPYVIKPAVVPVAPPELVPMKVKEPEPQYFRFEGVWLTETGMAVLRNLTMSP
LHKRRQRRGRLGLPGEAGLEGSEPSDALGPDDKKDGDLDTDELLKGEGGVEHMECEIKLE
GPVSPDVEPGKEETEESKKRKRKPYRPGIGGFMVRQRKSHTRVKKGPAAQAEVLSGDGQP
DEVLPADLPAEGPVEQSLADGDEKKKQQRRGRKKSKLEDMFPAYLQEAFFGKELLDLSRK
ALFAVGVGRPSFASDDLVRPRAPSGLEAGSGSLPLWFRGADGSLLAIAGPEDGGVKASPV
PSDPEKPGTPGEGMLSSDLDRIPTEELPKMESKDLQQLFKDVLGSEREQHLGCGTPGLDG
SRTPLQRPFLQGGLPLGNLPSSSPMDSYPSLCQSPFLDSRERGGFFSPEPGEPDSPWTGS
GGTTPSTPTTPTTEGEGDGLSYNQRSLQRWEKDEELGQLSTISPVLYANINFPNLKQDYP
DWSSRCKQIMKLWRKVPAADKAPYLQKAKDNRAAHRINKVQKQAESQINKQTKVGDLARK
TDRPALHLRIPPQPGALGSPPPAAAPTIFIGSPTTPAGLSTSADGFLKPPAGTVPGPDSP
GELFLKLPPQVPAQVPSQDPSPDPFLKPRCPSLDNLAVPESPGVGGGKTSEPLLSPPPFG
EPRKALEVKKEELGAASPSYGPPNLGFVDSPSSGPHVGGLELKAPDVFKAPLTPRASQVE
PQSPGLGLRPQEPPAAQALAPSPPSHPDIFRPGPYPDPYTQPPVTPRPQPPAPEGCCALP
PRSLPSDPFSRVPASPQSQSSSQSPLTPRPLSAEAFCPSPVTPRFQSPDPYSRPPSRPQS
RDPFAPLHKPPRPQPPEVAFKAGPLAHTPLGAGGFPAALPSGPTGELHAKVPAGQPPNFA
RSPGTGAFVGSPSSMRFTFPQAVGEPSLKPPQPGLPPPHGINSHFGPGPTLAKPQSTNYT
VATGNFHPSGSPLGPSSGSTGEGYGLSPLRPPSVLPPPVPDGSLPYLSHGASQRAGITSP
VDKREDPGAGMGSSLAAPELPGTQDPGMSSLSQTELEKQRQRQRLRELLIRQQIQRNTLR
QEKETAAAAAGAVGPPGSWAGEPSGPAFEQLNRGQTPFPGSQDKSSLVGLPPNKLSGPGL
GPGPFPGDDRLSRPPPSATPSSLDVNSRQLVGGSQAFYQRPPYPGPLPLQPQPQQQLWQQ
QQQQQAAAATSMRLAMSTRFPSTPGPELGRQALGSPLAGIPTRMPGPGEPVPGPVGPAQF
IELRHNVQKGLGPGGAPFPGQGPPQRPRFYPVSEDPHRLAPEGLRSLAVSGLPPQKPSVP
LAPELNSSLHPTSHTKGPALPTKDIFNEHLRLVESANEKAEREALLRGVEPGSLGPEERP
PPAPEASEPRLAPVLPEVKPKVEESGRHPSPCQFAITTPKVEPAPATPSLGLGLKPGQSV
IGNRDPRMGSGPFSGSGHTTEKGPFGATGGPPAHLLTPNPLGGPGGSSLLEKFDLEGGAL
TLPSGHAPSGDELDKMESSLVASELPLLIEDLLEHEKKELQKKQQLSAQLQPAQQQQQHS
LLPSSGPAQTMPLPPEAASPGLAGPQQQLALGLGGARQPGLAQPPAHALQQRLAPSMAMM
SNQGHMLSGQHGGQAGLVPQQGPQPVLAQKPMGSMPPSMCMKPPQLAMQQQLANSFFPDT
DLDKFAAEDIIDPIAKAKMVALKGIKKVMAQGSIGVAPGMNRNSRQQVSLLAQRLSGGSG
NDLQNHVTAGSGQERSAGDPSQSRPNPPTFAQGVINEADQRQYEEWLFHTQQLLQMQLKV
LEEQIGVHRKSRKALCAKQRTAKKAGREFPEADAEKLKLVTEQQSKIQKQLDQVRKQQKE
HTNLMAEYRNKQQQQQQQQQQQQQQQQQQHSAVLALSPSQSPRLLTKLPSQLLPGHGLQP
PQGPPGGQAGGLRLPPGSMALSGQPAGPFLNTALAQQQQQQQHSGGAGALAGPSGGFFPG
NLALRGLGPDSRLLQERQLQLQQQRMQLAQKLQQQQQQHLLGQVAIQQQQQQGSGVQANQ
ALGPKPPGLLPPSSHQGLLVQQLSPQPPQGPQGMLGPAQVAVLQQQHQQHPGALGPQGPN
RQVLLTQSRVLSSPQLAQQGQGLMGHRLVTAQQQQQQQQQQQQGSMAGLSHLQQGLLPHS
GQPKLSAQPMGTLQQQQFQQQQQQQQLQQQQQQFQQQQQQLQQQQLQQQQQLQQQQLQQQ
QQQLQQQQQQLQQQQQQQQQFQQQQQQQQMGLLNQSRTLLSPQQQQQPQATLGPGVPAKP
LQHFSSPGALGPTLLLTGKEQGIGETALPAEVTEGSSTHQGGPLAIGTTPESMAAEPGEG
KPPLSGDSQLLLVQPQAQPQAQPQPSSLQLQPPLRLPEQQQQANVLHTAGGGSHGLLGSG
SSSEASSVPHLLAPPSVSLGEHPGPMSQNLLGSQHPLALERPMQSTAGPQLPKAGPVPQS
GQGLPGAGVVPTVGQLRAQLQGVLAKNPQLRHLSPQQQQQLQALLMQRHLQQSQAVRHTP
PYQEPGTQPSPLQGLLGRQPQLGAFPAPQPGPLQELGAGPRPQGPPRLSAPQGALSTGPV
LGPVHPTPPPSSPQEPKRPSPQVPSPSSQLPSEVQLPPNQPGTPKPQGLPSELPPGRVSP
AAAQLVDTFFGKGLGPWGPPDNLAEAQKLEQSSLVAGHLEQVNGQPVPEPPHLSIKQEPR
EEPCALGAPAVKREANGEPVGAPGTSNHLLLAGPRSEAGHLLLQKLLRAKSVQLSTGRGP
EGLRTEINGHIDSKLAGLEQKLQGTPSSKEDTAARKPLTPKPKRVQKASDRLVSSRKKLR
KEDGVRAGEALLKQLKQELSLLPLMEPTITANFSLFAPFGSGCPVNGQCQLRGAFGNGAL
PTGPDYYSQLLTKNNLSNPPTPPSSLPPTPPPSVQQKMVNGVTPSEELGEHPKDAASARE
TEGALRDASEVKSLDLLAALPTPPHNQTEDVRMESDEDSDSPDSIVPASSPESILGEEAP
RFPQLGSGRWEQDDRALSPVIPIIPRTSIPVFPDTKPYGALDLEAPGKLPASTWEKGKGS
EVSVMLTVSAAAAKNLNGVMVAVAELLSMKIPNSYEVLFPESPARAGIEPKKGEAEGPGG
KEKGLGGKSPEAGPDWLKQFDAILPGYTLKSQLDILSLLKQESPAPEPPTQHSYTYNVSN
LDVRQLSAPPPEEPSPPPSPMAPSPASPPTEPLGELPAEPSAEPPVPSPLPLASSPESAR
PKPRARPPEEGEDSRPPRLKKWKGVRWKRLRLLLTIQKGSGRQEDEREVAEFMEQLGTAL
RPDKVPRDMRRCCFCHEEGDGATDGPARLLNLDLDLWVHLNCALWSTEVYETQGGALMNV
EVALHRGLLTKCSLCQRTGATSSCNRMRCPNVYHFACAIRAKCMFFKDKTMLCPMHKIKG
PCEQELSSFAVFRRVYIERDEVKQIASIIQRGERLHMFRVGGLVFHAIGQLLPHQMADFH
SATALYPVGYEATRIYWSLRTNNRRCCYRCSIGENSGRPEFVIKVMEQGLEDLVFTDASP
QAVWNRIIEPVAAMRKEADMLRLFPEYLKGEELFGLTVHAVLRIAESLPGVESCQNYLFR
YGRHPLMELPLMINPTGCARSEPKILTHYKRPHTLNSTSMSKAYQSTFTGETNTPYSKQF
VHSKSSQYRRLRTEWKNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRRE
KIYEEQNRGIYMFRINNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSR
RIPKGEELTYDYQFDFEDDQHKIPCHCGAWNCRKWMN
Download sequence
Identical sequences ENSDNOP00000021359

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]