SUPERFAMILY 1.75 HMM library and genome assignments server

Superfamily is undergoing a server migration - you are now browsing on the new server. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000011950 from Gasterosteus aculeatus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000011950
Domain Number 1 Region: 4620-4778
Classification Level Classification E-value
Superfamily SET domain 2.09e-48
Family Histone lysine methyltransferases 0.0069
Further Details:      
 
Domain Number 2 Region: 255-322
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 6.64e-16
Family PHD domain 0.0021
Further Details:      
 
Domain Number 3 Region: 740-799
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00000000000532
Family PHD domain 0.0043
Further Details:      
 
Domain Number 4 Region: 207-266
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00000000126
Family PHD domain 0.01
Further Details:      
 
Domain Number 5 Region: 820-881
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.00000000171
Family PHD domain 0.018
Further Details:      
 
Domain Number 6 Region: 695-748
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000000332
Family PHD domain 0.018
Further Details:      
 
Domain Number 7 Region: 345-396
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0000399
Family PHD domain 0.028
Further Details:      
 
Domain Number 8 Region: 1334-1381
Classification Level Classification E-value
Superfamily HMG-box 0.0000908
Family HMG-box 0.0033
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000011950
Domain Number - Region: 158-189
Classification Level Classification E-value
Superfamily FYVE/PHD zinc finger 0.0952
Family PHD domain 0.081
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000011950   Gene: ENSGACG00000009038   Transcript: ENSGACT00000011974
Sequence length 4778
Comment pep:known_by_projection group:BROADS1:groupXII:11221976:11246371:-1 gene:ENSGACG00000009038 transcript:ENSGACT00000011974 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MDEQKSNCEENDSEPTADDNASSKQLVSSEEESEALLAAEKVSSANAASSSTPVESARAC
ALCNCVERSLHGQRELRHLGPFSEQPTIKPSASPLPQPGNDDLSSIGFSNSPSVASLFDD
SEIGGCAVHHWCAVWSEGVKRMENEELENVDKAVISGTQRLCDYCKRLGATIRCHAEACT
RFYHFPCSAASGSFQSMKQLVLLCPEHIDKAEELAGEESWCAVCDSAGELTDLLFCTGCG
QHYHAACLEIGATPIQRAGWQCPECKVCQTCRQPGEDSKMLVCDACDKGYHTFCLQPAMD
SLPSDPWKCRRCRVCMECGVRGLALPGLAQWFDNYAVCEGCQRQRSSVCGVCSKAADPSV
TLLSCSVCHRWVHSECALPEELSEAKCICQLCKEGEQQPATQPHATEIQTREASEETEGH
VTLVEMTIQTDADMVTEEHTELSGVTSRQKDSSEMVQAEAITDKETPMELGKSKGSSPCQ
DPQSPLSLSMDAEVLSSMERGGRLLPHSEDDEDEEYDDMRAEDDIKQELQEQEVKRELLL
DDMSYMSHGDESSSGFLGSPGEQDPQLSIEFGLVPAGRSHADNLLTETDDSLPFEPFRSD
REKVKRRGSPGRSRVKQGRSNSFPGKRRPRGGGGGGGGRGRGGRSRLKAMASCIDAFLLS
MTTETGLSKEEDEEEDDTMQNTVVLFSNTDKFVLLQDMCVVCGSFGKGAEGQLLACAQCA
QCYHPYCVNSKITKTMLRKGWRCLECIVCEMCGKASDPSRLLLCDDCDVSYHTYCLDPPL
HNVPKGGWKCKWCVCCVQCGSNSPGFHCEWQNNYTHCGPCASLVTCPVCRENFMEEELLL
QCQYCDRWVHAVCESLYTEDEVEQASDEGFACTSCSAYVPKPVGESHNNAFYNRRNPIEP
QFYRLEGVWLTESGMSLLRSISMSPLHKRRQRRSRLGTLCCDGGPDGMDLREAEEGEDGK
GCGEPMECDSKMENPGSPDRDGGAEGGPEGMADCEGLKGGSEEMDDSKKRKRKPYRPVGI
GGFMVRQRKCHTRMKKEFFAQLAGETTLDGQPIERTIDEDNIMDPKPVEGEEQAKKRRGR
KKSKLEDMFPAYLQEAFFGKTLIDMGKKAAMIPPGQRPGTCLVRPALPAPQGVKSTAPES
NKPGRIIVIIITIFVSFDPFVLLSLVDDGVVKDNFPLKQEWGEVSQAQGEGAGLPQGMES
QDSEQFFRKVLGVSDGSSLEGMKPILEGSKGELNRSALPQRALLSGSLPSAGMMDAFPGL
NQSPFFDMRDRGGLFSPDGGDESPWATPSTPATPSSPATPTEAEGDGLSYNQRSLQRWEK
DEELGELSTISPVLYANTNFPTLKQDYPDWASRCKQIMKVWRKVSAADKVPYLQKAKDNR
AAQRINKAQKQAESQVCRPVKTEGVRVKTERPSLHLQIPPHSGSTSISSQPSSAESPFPF
PPDSGSSSVFFSDGPVKTPGSAEIRTDPFAKLPPQSPHSHSHPTTPFSHAGASPLQASSS
GYSASGPQGPPQGRPASLGPFDMQPGTPGTPRRAQQVDPYFRSQLQQQHGHLSQSQQGSL
ESLGPPESPHSRGAGLGESPIFSPTHSTHYGDPFRTQQGMGRLEYGSSPSPSAAASSPAS
TGQYKADMSAPSPRSAGVGRSDLSTDSPAGMLESGDGLFKAPMTPRMHQGEGGTLHPGAS
PSHPSEGYRQSPSHPFSDSHTQPSLTPRPQSGDNCSIGPQRHPVAQQELCSRVPSSPQSQ
GSSQSPHTPGGHSNDPYSVHSPAAPRFQSPDLCSQSPSRPQSRDPFATIQKPPRPPSAAP
EGTVIYKTSPHPNQQQSPHSTNNSTGDPLSGKPSAPPNFSRSPSTGGFQITQQQSQMLQG
QLQQQQPQTQRTMSADGYGSRAPPLSGSQELPAGRPPDAPHQPALPGTQEMPDISAVQDP
ALVGLSPSELDKHRQRQRLRDFLIRQQMQKRQGKGNTSPGWSGGEMGAFQQDKAHRAPPP
YPQDRVAAATPGPQTSMAGNMPMAAGGMEDKLIRPPPTGTPAIIDPNALRAQGPSRPQGM
FGRPPFPPQWQGQPPGPRRFPQPGMEVMGMRHNLNPAANIQGMEGMGNPHTMISGHGGEN
MQPMGQGPPPQFIELRHNSQRLSLRPQFMPRGPQQRPRLFVPQQDMSAPYVQQHPIAQAG
GVQTEAGSTSQLGLQQGGLSVLLPQQSTGSLTQQPHLQPLPVTSAPNIPSSEQHQLRQPG
HVTQPQPATAEHGEELPEPDLEGLGDAPGDGGVEDEDDLALDLDPDKGDDDLGNLDNLET
NDPHLDDLLNSDEFDLLAYTDPELDQGDPKDVFSDQLRLVEAESEAPSSVSALVKVEDKA
MVEPERRSLTTAAANERGSASQMSTCADMVDTSKVKVEDRGLTSQQHPGHTVVKDEIGDA
VSMLLGGNAPAKPPQPPNQSASLSSVRLGGLPYSLPGQANALTFPAAAGHANIDDPLGLP
DVGEQHTLAVDLAKVESSLDGELPLLIQDLLEHEKKELQKQQQLSSLHQGGMAHHFPGLS
SQQSNPQAPGQIMLPHHHRPTPQGMMAQPGMVPRAPHMLQQQQQQRLMGPGMAPHMNMGQ
QQAMVRMGQPGIHSGAGHLPQTLIKPPLANNFFPDKDLDKFATDDMDPIAKAKMVALKGI
KRVLAQDPMVVPSGINRQQVSLLAQRLASAPGTDPGQLTPGPPKEGETSDPTQSRPNPPQ
FVQGIINDAEQHQYEEWLLHTQQLLQMQLKFLEEQIGVHRKSRKALCAKQRTAKKAGREF
AEADAEKLKLVTEEQSKIQKQLDQVRKQQKEHTNLIAEYRSKQQQHQQSSGILPPGHSAQ
GAPPHMFPKMPGQMMMGQQGTQVMSQHPAVMPQSGMPVRMPQGQPFIGGAQPQLPAALVA
RAPRAPGPPGAPPGFFPQGPGIQGADPRLLQERQLQHRMQMAKVMMGQQPMPHPNQPPSS
MMGNPLMAQQANPQQGILANQPNQQNMVQVPQRIIGGQSVAPLSQNLAADQPIAHGQAMM
AVQPGIMGNPPVALAQQQRPQLMMGPQGMVGSPGHPGLRGPQAQLTPQQQNILAQRMLVS
QQQHQQNLAHQQQQRQQTQLTSQSNQEQGVLPQPSTPQMGSSPSTGSITPQPQGATDNQN
SGPKEGGMLSPDSRTPPQHSGPSTPNQMPQPGATNDHQTQQQQSNTQSVPEAQTGLVGNQ
QTGALQQHQQQLPLILQRQGSLSVDKPVLMTVKEERKPNDFLAQQQQQQQQQTVQNAMQQ
SQDPNIQQQIITQNHPGQPQSVVMGHSSQQQALMAQQQKQQAMIGMMRAQQQGMMAQRPV
VPPGQIRTPINIQAILAQNPQLRNLPPNQQIQHIQAMIAQRQLQQGQMLRMSMGQGQQGQ
LRPQMPPGQMPQGVQQMQSGMLGQQPGVAPQMQQGMTVPGQQQPVQQQQPVGQMMQQHIM
RGQVPVPPSPMDQGRMVRPTSPRHPMVNSPGHSFNQTMGMRPPTPNQNQQALMPAAAGRM
QGSPSHAYSPRGPFGMSPAHPASPHSSHVSSPSVADSRPGRGSPYSHVKASPLRSPGARS
PLDFPGMKVEAQPSGSEAPHSASNIPNGPKKCFNIQQQITENHGPHAQHGPRVGELCKMT
LQNIKQEPREVQCDGVPEAHPGAIKREATVEAVSSGNNTGFINAGNMSGDPGTQAPRSET
GQQLLQKLLRTKNLQLGAQRPSEGIHNEINGHINSKLAMLEQKLQGTPRNMELLYLCHDL
QSITKRAPVQKPKRTNKASGDRGPNARKKNKKEESGKSAEALIKQLKQGLSLLPLMEPSI
TASLDLFAPFGSSPVNGKAQLKGSFGNAVLDNIPDYYSQLLTKSNLSNPPTPPSSLPPTP
PPSVQHKLLNGVTAGEELSEGQKDTEATQEQMDSVKDEVKSVDILAALPTPPHNQNEDIR
MESDDEDAPESIIPASSPESNVGDDTPRFPQLREPKEEETERAISPIIPLIPRTAIPAFP
ENKPFEAAEGKIVSTSNHWDKAKNNEVSVTFTLSSAAAKKLNHVMMAMAQLLHIRMPGSY
EVTFPPQNPDMAAVDGPGKGSEDSGDLGTKDSASPSQDDWLRQFDVSLPGCTLKKQVDIL
ALIKQEFQEKEDTPAQHCYTTKVNDLDVRHLPAIPVEESPPPSPSPPSPPCPPLPVPVPT
SEAEPSKKPASPSPPPSGLAIVQIKTEAEQHDELAVDAAQLPDVTESEVAVPEPVTDPAA
TSAVPDPPPPTDPTTVLPKVKKWKGIRWKRLQIVITIRKGGSKKESSREVSELMDRLRIA
LRPDRLPRDKRKCCFCHEEGDGATDGPARLLNIDVDLWVHLNCALWSTEVYETQGGALIN
VEVALRRGLRTLCAYCQKTGATNSCNRIRCPNVYHFACAMRARCMFFKDKTMLCTQHKLK
GPSEDELGMFAVLRRVYIERDEVKQIASILQRGDRVHLFRVGGLIFHAVGQLLPSQMANF
HSPTAIFPVGYEATRIYWSTRVPNKRCRYRCRVNEDDGRPLFEVRVLEHGMEDLQYRDTT
PEGIWERVVQQVAKLRDDSSMLKLFTEHVKGEEMYGLTVHAVMRITESLPGVEMCQNYLF
RYGRHPLMELPLMINPSGSARSEPKVPTQCKRPHTLNSTSVSKAYQSTFTGELNTPYSKQ
FVHSKSSQYRRLKTEWKNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTVIRNEVANRR
EKIYEFQNRGIYMFRINNEQVIDATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISS
RRIPKGEELTYDYQFDFEDDQHKIPCHCGAWNCRKWMN
Download sequence
Identical sequences G3P2X7
69293.ENSGACP00000011950 ENSGACP00000011950 ENSGACP00000011950

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]