SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSCPOP00000002700 from Cavia porcellus 76_3

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSCPOP00000002700
Domain Number 1 Region: 5106-5264
Classification Level Classification E-value
Superfamily SET domain 8.9e-49
Family Histone lysine methyltransferases 0.0065
Further Details:      
 
Domain Number 2 Region: 1145-1205
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000000000487
Family PHD domain 0.0039
Further Details:      
 
Domain Number 3 Region: 273-326
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00000000000421
Family PHD domain 0.0022
Further Details:      
 
Domain Number 4 Region: 1220-1289
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00000000117
Family PHD domain 0.025
Further Details:      
 
Domain Number 5 Region: 1099-1153
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000113
Family PHD domain 0.017
Further Details:      
 
Domain Number 6 Region: 215-276
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000493
Family PHD domain 0.015
Further Details:      
 
Domain Number 7 Region: 1750-1800
Classification Level Classification E-value
Superfamily HMG-box 0.00000249
Family HMG-box 0.0037
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSCPOP00000002700   Gene: ENSCPOG00000002973   Transcript: ENSCPOT00000003012
Sequence length 5264
Comment pep:known_by_projection scaffold:cavPor3:scaffold_9:50517213:50549471:1 gene:ENSCPOG00000002973 transcript:ENSCPOT00000003012 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDGQKQPGEDKDSEPAADGPAASEDPGATEPDLPNSHIGEVSDLCHGSPRLQEPPRDGSG
GLVRRCALCNCGEPSLHGQRELRRFELPFDWPRSPVVAPGGNAGPSEAVLPSEDPSQIGF
PEGLTPAHLGEPGGSCWAHHWCAAWSAGVWGQEGPELCGVDKAVFSGISQRCSHCTRLGA
SIPCRSSGCPRLYHFPCATASGSFLSMKTLQLLCPEHSEGAAHLEEARCAVCEGPGELCN
MFFCTSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHT
FCLKPPMEELPAHSWKCKACRVCRVCGAGSSELNPNSEWFENYSLCHRCHKAQEGQPVSS
VAEQHPSVCSKFSPPESGDIPTDGPDALYVACQGQPKSGHVTSMQPKELGPLQCEVKPLG
RAGAQLEPQSEAPLNEEMPLLPLPEESPLSPPPEESPTSPPPEASRLSPPPEESPASPPP
EASCLSLLPEESPLSPPPEESPLSPLPDSSPFSPLEESPLSPPEESPISPALETPLSPPP
KTSPLSPSFEESPLSPPPEELPTSPPPEASRISPPPEESPMSPPPEESPMSPPPEASRLF
PPFEESPLSPPPEESPLSPPPEASRLSPPPEDSPMSPPPEDSPMSPPPEISCLSPLPEVS
HLSPPPEESPLSPPPLSPLGELEYPFGGKGDSDPESPLAAPILETPISPPPEANCTDPEP
VPPMILPPSPSSPMGPASPILMGPLPPQCSPLLQHSSPPPNSSPSHCSPPALPLSVPSPL
SPMGKAVEVSEEAESHEMETEKGPEPECPALEPSATSPLPSPVGDLSCPAPSPAPALDDF
SGLGEDIAPLDGTDAPGSQPEAGQTPGSSASEFKGSPVLLDPEELAPVTPMEVYGPECKQ
AEQGSPCEEQEEPSAPVVPIPPTLIKSDIVNEISNLSQGDASASFPGSEPLLGSPDPEGG
GSLSMELGVSTDVSPVRDEGSLRLCTDSLPETDDSLLCDAGTAISGGKAEGDKGRRRSSP
ARSRIKQGRSSSFPGRRRPRGGAHGGRGRGRARLKSTTSSIETLVVADIDSSPSKEEEEE
DDDTMQNTVVLFSNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITK
VMLLKGWRCVECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCV
SCMQCGAASPGFHCEWQNSYTHCGPCASLVTCPICHAPYVEEDLLIQCRHCERWMHAGCE
SLFTEDDVEQAADEGFDCVSCQPYVVKPVAPVAPPELVPMKVKEPEPQYFRFEGVWLTET
GMAVLRNLTMSPLHKRRQRRGRLGLPGEVALEGSEPSDALGLDEKKDGELDTDELLKGEG
GVEHMECEIKLEGPASPDVEPGKEETEESKKRKRKPYRPGIGGFMVRQRKSHTRVKKGPA
AQAEVLSGDGQPDEGETVMPTDLPAEGSGEQGLIDGDEKKKQQRRGRKKSKLEDMFPAYL
QEAFFGKELLDLSRKALFAVGVGRPSFGLGTPKARGDGGSERKELPVLQKGDDGPDVADE
ESRGLEGMADTPGPEDGGVKASPVPSDSEKPGTPGEGMLSSDLDRIPTEELPKMESKDLQ
QLFKDVLGSEREQHLGCGTPGLDGSRTPLQRPFVQGGLPLGSLPSNSPMDSYPGLCQSPF
LDSRERGGFFSPEPGEPDSPWTGSGGTTPSTPTTPTTEGEGDGLSYNQRSLQRWEKDEEL
GQLSTISPVLYANINFPNLKQDYPDWSSRCKQIMKLWRKVPATDKAPYLQKAKDNRAAHR
INKVQKQAESQINKQTKVGDVARKTDRPALHLRIPPQPGALGSPLPASAPTIFIGSPTTP
AGLSTSADGFLKPPAGTVPGPDSPGELFLKLPPQVPAQVPSQDPFGLAPTYTLEPRFPAA
PPTYPPYPSPTGAPAQPPMLGASSRPGTGQPGEFHSTPPGTPRHQPSTPDPFLKPRCPSL
DNLAVPESPGVAGSKASEPLLSPSAFGETRKALEVKKEELGASSPSYGPSNLGFVDSPSS
GPHLGGLELKAPDVFKAPLTPRASQVEPQSPGLGLRNQEPSPAQGLAASPPNHPDIFRPG
PYPDPYAQPPLTPRPQPPPPLPESCCALPPRSLPSDPFSRVPASPQSQSSSQSPLTPRPL
STEAFCPSPVTPRFQSPDPYNRPPSRPQSRDPFAPLHKPPRPQPPEVAFKAGPLAHTPLG
AGGFPAALPSGPVGELHAKVPSGQPPNFARSPGTGTFVGTSSPMRFTFPQGVGEPSLKPP
VPQPGLPPPHGINSHFGPAPTLGKPQSTNYAVTTGNFHPAGSPLGPSSGSTGEGYGLSPL
RPTSVLPPPAPDGSLPFLSHGASQRTGITSPVEKREDPGATMSSSLGAPELPGTQDPGMS
SLSQTELEKQWQRQRLRELLIRQQIQRNTLRQEKETAAAAAGAVGHPGSWGTEPSSSTFE
QLSRGQTPFPAAQDKSSLVGLPPSKLGGPVLGPGPGLFPTDDRLSRPPPPATPSSMDVNS
RQLVGGSQAFYQRVPYPGSLPLQQQQQQQQQQQQAASATSMRLTMSTRFPSTAGSELGRQ
ALGSPLAGIPTRLPCPAEPVPGPSGPAQFIELRHNVQKGLGPGGPSFPGQGPPQRPRFFP
VNEDTHRLAPEGLRGLAAPGLPPQKPPAPPAPELNNSLHPTSHTKASALTAGLDLVTRPP
STTELARPPPLALEAGKLHCEDPELDDDFDAHKALEDDEELAHLGLGVDVAKGDDELGTL
ENLETNDPHLDDLLNGDEFDLLAYTDPELDTGDKKDIFNEHLRLVESANEKAEREALLRG
VEPGPLGPEERPPPPTDVSEPRLASVLPEVKPKVEEGGRHPSPCQFTITTPKVEPGPASA
SLGLGLKPGQNVLGSRDTRMGTGPFSGSGHTAEKGSFGATGGPPAHLLTPSPLSGTGGSS
LLEKFELESGALTLPSGHGASGDELDKMESSLVASELPLLIEDLLEHEKKELQKKQQLSA
QLQPVQQQQQQQPQPQQQHSLLSTPSSGQAMPLPHEGSSPNLAGPQQQLALGLGGTRQPG
LGQPLMPTQPPAHALQQRLAPTMAMVSNQGHMLSGQHGGQAGLVPQQSPQPVLAQKPMST
MPPSMCMKPQQLAMQQQLANSFFPDTDLDKFAAEDIIDPIAKAKMVALKGIKKVMAQGSI
GVAPGMNRQQVSLLAQRLSGGSGGDLQNHVAPGSSQERNASDPSQPRPNPPTFAQGVINE
ADQRQYEEWLFHTQQLLQMQLKVLEEQIGVHRKSRKALCAKQRTAKKAGREFPEADAEKL
KLVTEQQSKIQKQLDQVRKQQKEHTNLMAEYRNKQQQQQQQQQQQQHSAVLALSPSQSPR
LLTKLPGQLLPGHGLQPPQGPPGGQTGSLRLPPGGMALPGQPGGPFLNTALAQQQQQQHS
GGAGSLAGPSGGFFPGNLALRSLGPDSRLLQERQLQLQQQRMQLAQKLQQQQQQQQQQQH
HLGQVAVQQQQQQGPSLQANQALGPKPQGLVPPNSHQGLLVQQLSPQPPQGPQGMLGPAQ
VAVLQQHSGALGPQGPHRQVLMTQSRVLSSPQLAQQGHGLMGHRLVTGQQQQQSQQHQQQ
GPMAGLSHLQQGLLSHSGQAKLNAQPLGSLQQQQLQQQLQQQQQQQQQLQQQQQQQQQLQ
QQQQLQQQQQQQLHQQQLQQLQQQQQQQLQQQLLQQQQQQQQQQQQQMCLLNQSRTLLSP
QQQQQVTLGPGMPAKPLQHFSSPGTLGPTLLLTGKEQNTVETALPSEVNEGPSTHQGGPL
IIGPASESVATESGEVKPSLSGDSQLLLVQPQAQPQPNSLQLQPPVRLPGQPQQQVNLLH
TAGVGSHGQLGSGSSSEGSSMPHLLTQPSVSLGEQPGPVTQNILGPQQPLGLERPVQNNA
GPQPPKSGPVPQSGQGLSGVGITPTVGQLRVQLQGVLAKNPQLRHLSPQQQQQLQALILQ
RQLQQSQAVRQTPPYQEPGTQPSPLQGLLGCQPQPGGFPGTQTGPLQELGAGPRPQGPPR
LSVPQGALSTGPVVGPVHPTPPPSSPQEPKRPSSQLPSPSTQLTPTHPGTPKPQGPTLEL
PPGRVSPAAAQLADTFFGKGLGPWDPSDNLVEAQKPEQCSLVPGHLEQVNGQVAPEPPQL
SIKQEPREEPCALGAQAVKREANGEPVGASGTSNHLLLAGPRSEAGHLLLQKLLRAKNVQ
LSAGRGPEGLRAEINGHIDSKLTGLEQKLQGTPANKEDAAARKPLTPKPKRVQKASDRLV
SSRKKLRKEDGVRANEALLKQLKQELSLLPLTEPTITANYSLFAPFGSSCPISGQSQLRG
AFGSGALPSGPDYYSQLLTKNNLSNPPTPPSSLPPTPPPSVQQKMVNGVTPSEELGEHPK
DPASAGDTEGTLRDASEVKSLDLLAALPTPPHNQTEDVRMESDEDSDSPDSIVPASSPES
ILGEEAPRFPQLGSGRGEQDDRALSPVIPIIPRASIPVFPDTKPYGVLDLEVPGKLPATA
WEKGKGSEVSVMLTVSAAAAKNLNGVMVAVAELLRMKIPNSYEVLFPESPARVGIEPKKG
EAEGPGGKEKNISSKSSDSSPDWLKQFDAVLPGYTLKSQLDILSLLKQESPAPELPTQHS
YTYNVSNLDVRQLSAPPPEEPSPPPSPLAPSPASPPADPLVELPVEPLAEPPVPSPLPLA
SSPESTRPKPRARPPEEGEDSRPPHLKKWKGVRWKRLRLLLTIQKGSGRQEDEREVAEFM
EQLGTALRPDKVPRDMRRCCFCHEEGDGATDGPARLLNLDLDLWVHLNCALWSTEVYETQ
GGALMNVEVALHRGLLTKCSLCQRTGATSSCNRMRCPSVYHFACAIRAKCMFFKDKTMLC
PMHKIKGPCEQELSSFAVFRRVYIERDEVKQIASIIQRGERLHMFRVGGLVFHAIGQLLP
HQMADFHSATALYPVGYEATRIYWSLRTNNRRCCYRCSIGENNGRPEFVIKVMEQGLEDL
VFTDASPQAVWNRIIEPVAAMRKEADMLRLFPEYLKGEELFGLTVHAVLRIAESLPGVES
CQNYLFRYGRHPLMELPLMINPTGCARSEPKILTHYKRPHTLNSTSMSKAYQSTFTGETN
TPYSKQFVHSKSSQYRRLRTEWKNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRN
EVANRREKIYEEQNRGIYMFRINNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDK
IIIISSRRIPKGEELTYDYQFDFEDDQHKIPCHCGAWNCRKWMN
Download sequence
Identical sequences ENSCPOP00000002700 ENSCPOP00000002700 10141.ENSCPOP00000002700

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]