SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000027327 from Gorilla gorilla 69_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000027327
Domain Number 1 Region: 74-260
Classification Level Classification E-value
Superfamily p53-like transcription factors 3.35e-70
Family T-box 0.00000151
Further Details:      
 
Domain Number 2 Region: 2301-2381
Classification Level Classification E-value
Superfamily HLH, helix-loop-helix DNA-binding domain 0.00000000000393
Family HLH, helix-loop-helix DNA-binding domain 0.0013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000027327   Gene: ENSGGOG00000001640   Transcript: ENSGGOT00000027840
Sequence length 2942
Comment pep:novel chromosome:gorGor3.1:15:20127085:20221889:1 gene:ENSGGOG00000001640 transcript:ENSGGOT00000027840 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MEEKQQIILANQDGGTVAGAAPTFFVILKQPGNGKTDQGILVTNQDACALASSVSSPVKS
KGKICLPADCTVGGITVTLDNNSMWNEFYHRSTEMILTKQGRRMFPYCRYWITGLDSNLK
YILVMDISPVDNHRYKWNGRWWEPSGKAEPHVLGRVFIHPESPSTGHYWMHQPVSFYKLK
LTNNTLDQEGHIILHSMHRYLPRLHLVPAEKAVEVIQLNGPGVHTFTFPQTEFFAVTAYQ
NIQITQLKIDYNPFAKGFRDDGLNSKPQRDGKQKNSSDQEGNNISSSSGHRVRLTEGQGS
EIQPGDLDPLSRGHETSGKGLEKTSLNIKRDFLGFMDTDSALSEVPQLKQEISECLIASS
FEDDSRVASPLDQNGSFNVVIKEEPLDDYDYELGECPEGVTVKQEETDEETDIYSNSDDD
PILEKQLKRHNKVDNPEADHLSSKWLPSSPSGVAKAKMFKLDTGKMPVVYLEPCAVTRST
VKISELPDNMLSTSRKDKSSMLAELEYLPTYIENSNETAAFCLGKESENGLRKHSPDLRV
VQKYPLLKEPQWKYPDISDSISTERILDDSKGSVGDSLSGKEDLGRKRTMLKIAAAAKVV
NANQNASPNLPGKRGRPRKLKLCKAGRPPKNTGKSLISAKNTPVSPGSTFPDVKPDLEDV
DGVLFVSFESKEALDIHAVDGTTEESSSLQASTTNDSGYRARISQLEKELIEDLKSLRHK
QVIHPGLQEVGLKLNSVDPTMSIDLKYLGVQLPLAPATSFPFWNLTGTNPASPDAGFPFV
SRTGKTNDFTKIKGWRGKFHSASASRNEGGNSESSLKNRSAFCSDKLDEYLENEGKLMET
SMGFSSNAPTSPVVYQLPTKSTSYVRTLDSVLKKQSTISPSTSYSLKPHSVPPVSRKAKS
QNRQATFSGRTKSSYKSILPYPVSPKQKYSHMILGDKVTKNSSGIISENQANNFVVPTLD
ENIFPKQISLRQAQQQQQQQQQGSRPPGLSKSQVKLMDLEDCALWEGKPRTYITEERADV
SLTTLLTAQASLKTKPIHTIIRKRAPPCNNDFCRLGCVCSSLALEKRQPAHCRRPDCMFG
CTCLKRKVVLVKGGSKTKHFQRKAAHRDPVFYDTLGEEAREEEEGIREEEEQLKEKKKRK
KLEYTICETEPEQPVRHYPLWVKVEGEVDPEPVYIPTPSVIEPMKPLLLPQPEVLSPTVK
GKLLTGIKSARSYTPKPNPVIREEDKDPVYLYFESMMTCARVRVYERKKEDQRQPSSSSS
PSPSFQQQSSCHSSPENHNNAKEPDSEQQPLKQLTCDLEDDSDKLQVNGKSYPQAKLLLG
QMGALHPANRLAAYITGRLRPSVLDLSTLSTVISKVASNAKVAASRKPRTLLPSTSNSKM
ASSSGTATNRPGKNLKAFVPAKRPIAARPSPGGVFTQFVMSKVGALQQKIPGVSTPQTLA
GTQKFSIRPSPVMVVTPVVSSEPVQVCSPVTAAVTTTTPQVFLENITAVTPMTAISDVET
KETTYSSGATTTGVVEVSETNTSTPVTSTQSTATVNLTKTTGITTPVASVAFPKSLVASP
STITLPVASTASTSLVVVTAAASSSMVTTPTSSLGSVPIILSGINGSPPVSQRPENAAQI
PVATPQVSPNTVKRAGPRLLLIPVQQGSPTLRPVSNTQLQGHRMVLQPVRSPSGMNLFRH
PNGQIVQLLPLHQLRGSNTQPNLQPVMFRNPGSVMGIRLPAPSKPSETPPSSTSSSAFSV
MNPVIQAVGSSSAVNVITQAPSLLSSGASFVSQAGTLTLRISPPEPQSFASKTGSETKIT
YSSGGQPVGTASLIPLQSGSFALLQLPGQKPVPSSILQHVASLQMKRESQNPDQKDETNS
IKREQETKKVLQSEGEAVDPEANIIKQNSGAATSEETLNDSLEDRGDHLDEECLPEEGCA
TVKPSEHSCIAGSHADQDYKDVNEEYGARNRKSSKEKVAVLEVRTISEKASNKTVQNLSK
VQHQKLGDVKVEQQKGFDNPEENSSEFPVTFKEESKFELSGSKVMEQQSNLQPEAKEKEC
GDSLEKDKERWRKHLKGPLTRKCVGASQECKKEADEQLIKETKTCQENSDVFQQEQGISD
LLGKSGITEDARVLKTECDSWSRISNPSAFSIVPRRAAKSSRGNGHFQGHLLLPGEQIQP
KQEKKAGRSSADFTVLDLEEDDEDDNEKTDDSIDEIVDVVSDYQSEEVDDVEKSNYVEYI
EDDEEHVDIETVEELSEEINVAHLKTTAAHTQSFKQPSCTHISADEKAAERSRKAPPIPL
KLKPDYWSDKLQKEAEAFAYYRRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLI
LTRAFSEIQGLTDQADKLIGQKNLLTRKRNILIRKVSSLSGKTEEVVLKKLEYIYAKQQA
LEAQKRKKKMGSDEFDMSPRISKQQEGSSTSSVDLGQMFINNRRGKPLILSRKKDQATEN
TSSSNTPHTSTNLVMTPQGQLLTLKGPLFSGPVVAVSPDLLESDLKPQVAGSAVALPEND
DLFMMPRIVNVTSLATEGGLVDMGGSKYPHEVPDGKPSDHLKDTVRNEDNSLEDKGRISS
RGNRDGRVTLGPTQVFLANKDSGYPQIVDVSSMQKAQEFLPKKISGDMRGIQYKWKESES
RGERVKSKDSSFHKLKMKDLKDSSIEMELRKVTSAIEEAALDSSELLTNMEDEDDTDETL
TSLLNEIAFLNQQLNDDSVSLAELPSSMDTEFPGDARRAFISKVPPGSRATFQVEHLGTG
LKELPDVQGESDSISPLLLHLEDDDFSENEKQLAEPASEPDVLKIVIDSEIKDSLLSNKK
AIDGGKNTSGLPAEPESVSSPPTLHMKTGLENSNSTDTLWRPMPKLAPLGLKVANPSSDA
DGQSLKVMPCLAPIAAKVGSVGHKMNLTGNDQEGRESKVMPTLAPVVAKLGNSGASPSSA
GK
Download sequence
Identical sequences ENSGGOP00000001619 ENSGGOP00000027327

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]