SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000015110 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000015110
Domain Number 1 Region: 215-392
Classification Level Classification E-value
Superfamily MIR domain 8.37e-47
Family MIR domain 0.0025
Further Details:      
 
Domain Number 2 Region: 1051-1178
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.54e-18
Family SPRY domain 0.032
Further Details:      
 
Domain Number 3 Region: 659-794
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.0000000138
Family SPRY domain 0.064
Further Details:      
 
Domain Number 4 Region: 101-185
Classification Level Classification E-value
Superfamily MIR domain 0.00000719
Family MIR domain 0.028
Further Details:      
 
Domain Number 5 Region: 2158-2228
Classification Level Classification E-value
Superfamily IP3 receptor type 1 binding core, domain 2 0.0000119
Family IP3 receptor type 1 binding core, domain 2 0.018
Further Details:      
 
Domain Number 6 Region: 3973-4051
Classification Level Classification E-value
Superfamily EF-hand 0.0000143
Family Calmodulin-like 0.051
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000015110
Domain Number - Region: 1404-1528
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.000214
Family SPRY domain 0.033
Further Details:      
 
Domain Number - Region: 482-643,1821-1853
Classification Level Classification E-value
Superfamily ARM repeat 0.00038
Family Armadillo repeat 0.094
Further Details:      
 
Domain Number - Region: 2578-2679,2876-2975,3080-3122,3602-3612,3787-3999
Classification Level Classification E-value
Superfamily ARM repeat 0.00219
Family Pumilio repeat 0.069
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000015110   Gene: ENSGGOG00000015461   Transcript: ENSGGOT00000015542
Sequence length 4803
Comment pep:known_by_projection chromosome:gorGor3.1:19:35808724:35967517:1 gene:ENSGGOG00000015461 transcript:ENSGGOT00000015542 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MGDAEGEDEVQFLRTDDEVVLQCSATVLKEQLKLCLAAEGFGNRLCFLEPTSNAQNVPPD
LAICCFVLEQSLSVRALQEMLANTVEAGVESSQGGGHRTLLYGHAILLRHAHSRMYLSCL
TTSRSMTDKLAFDVGLQEDATGEACWWTMHPASKQRSEGEKVRVGDDIILVSVSSERYLH
LSTASGELQVDASFMQTLWNMNPICSRCEEGFVTGGHVLRLFHGHMDECLTISPADSDDQ
RRLVYYEGGAVCTHARSLWRLEPLRISWSGSHLRWGQPLRVRHVTTGRYLALTEDQGLVV
VDASKAHTKATSFCFRISKEKLDVAPKRDVEGMGPPEIKYGESLCFVQHVASGLWLTYAA
PDPKALRLGVLKKKAMLHQEGHMDDALSLTRCQQEESQAARMIHSTNGLYNQFIKSLDSF
SGKPRGSGPPAGTALPIEGVILSLQDLIIYFEPPSEDLQHEEKQSKLRSLRNRQSLFQEE
GMLSMVLNCIDRLNVYTTAAHFAEFAGEEAAESWKEIVNLLYELLASLIRGNRSNCALFS
TNLDWLVSKLDRLEASSGILEVLYCVLIESPEVLNIIQENHIKSIISLLDKHGRNHKVLD
VLCSLCVCNGVAVRSNQDLITENLLPGRELLLQTNLINYVTSIRPNIFVGRAEGTTQYSK
WYFEVMVDEVTPFLTAQATHLRVGWALTEGYSPYPGGGEGWGGNGVGDDLYSYGFDGLHL
WTGHVARPVTSPGQHLLAPEDVISCCLDLSVPSISFRINGCPVQGVFESFNLDGLFFPVV
SFSAGVKVRFLLGGRHGEFKFLPPPGYAPCHEAVLPRERLHLEPIKEYRREGPRGPHLVG
PSRCLSHTDFVPCPVDTVQIVLPPHLERIREKLAENIHELWALTRIEQGWTYGPVRDDNK
RLHPCLVDFHSLPEPERNYNLQMSGETLKYMMSNGYKPAPLDLSHVRLTPAQTTLVDRLA
ENGHNVWARDRVGQGWSYSAVQDIPARRNPRLVPYRLLDEATKRSNRDSLCQAVRTLLGY
GYNIEPPDQEPSQVENQSRCDRVRIFRAEKSYTVQSGRWYFEFEAVTTGEMRVGWARPEL
RPDVELGADELAYVFNGHRGQRWHLGSEPFGRPWQPGDVVGCMIDLTENTIIFTLNGEVL
MSDSGSETAFREIEIGDGFLPVCSLGPGQVGHLNLGQDVSSLRFFAICGLQEGFEPFAIN
MQRPVTTWFSKGLPQFEPVPLEHPHYEVSRVDGTVDTPPCLRLTHRTWGSQNSLVEMLFL
RLSLPVQFHQHFRCTAGATPLAPPGLQPPAEDEARAAEPDPDYENLRRSAGGWGEAENGK
EGTAKEGAPGGTPQAGGEAQPARAENEKDATTEKNKKRGFLFKAKKVAMMTQPPATPTLP
RLPHDVVPADNRDDPEIILNTTTYYYSVRVFAGQEPSCVWAGWVTPDYHQHDMSFDLSKV
RVVTVTMGDEQGNVHSSLKCSNCYMVWGGDFVSPGQQGRISHTDLVIGCLVDLATGLMTF
TANGKESNTFFQVEPNTKLFPAVFVLPTHQNVIQFELGKQKNIMPLSAAMFQSERKNPAP
QCPPRLEMQMLMPVSWSRMPNHFLQVETRRAGERLGWAVQCQEPLTMCMDILELSERLDL
QRFHSHTLCLYRAVCALGNNRVAHALCSHVDQAQLLHALEDAHLPGPLRAGYYDLLISIH
LESACRSRRSMLSEYIVPLTPETRAITLFPPGRSTENGHPRHGLPGVGVTTSLRPPHHFS
PPCFVAALPAAGAAEAPARLSPAIPLEALRDKALRMLGEAVRDGGQHARDPVGGSVEFQF
VPVLKLVSTLLVMGIFGDEDVKQILKMIEPEVFTEEEEEEEEEEEDEEEDEEEKEEDEEE
TAQEKEDEEKEEEEAAEGEKEEGLEEGLLQMKLPESVKLQMCHLLEYFCDQELQHRVESL
AAFAERYVDKLQANQRSRYGLLIKAFSMTAAETARRTREFRSPPQEQINMLLQFKDGTDE
EDCPLPEEIRQDLLDFHQDLLAHCGIQLDGEEEEPEEETTLGSRLMSLLEKVRLVKKKEE
KPEEERSAEESKPRSLQELVSHMVVRWAQEDFVQSPELVRAMFSLLHRQYDGLGELLRAL
PRAYTISPSSVEDTMSLLECLGQIRSLLIVQMGPQEENLMIQSIGNIMNNKVFYQHPNLM
RALGMHETVMEVMVNVLGGGESKEIRFPKMVTSCCRFLCYFCRISRQNQRSMFDHLSYLL
ENSGIGLGMQGSTPLDVAAASVIDNNELALALQEQDLEKVVSYLAGCGLQSCPMLVAKGY
PDIGWNPCGGERYLDFLRFAVFVNGESVEENANVVVRLLIRKPECFGPALRGEGGSGLLA
AIEEAIRISEDPARDGPGIRRDRRREHFGEEPPEENRVHLGHAIMSFYAALIDLLGRCAP
EMHLIQAGKGEALRIRAILRSLVPLEDLVGIISLPLQIPTLGKDGALVQPKMSASFVPDH
KASMVLFLDRVYGIENQDFLLHVLDVGFLPDMRAAASLDTATFSTTEMALALNRYLCLAV
LPLITKCAPLFAGTEHRAIMVDSMLHTVYRLSRGRSLTKAQRDVIEDCLMSLCRYIRPSM
LQHLLRRLVFDVPILNEFAKMPLKLLTNHYERCWKYYCLPTGWANFGVTSEEELHLTRKL
FWGIFDSLAHKKYDPELYRMAMPCLCAIAGALPPDYVDASYSSKAEKKATVDAEGNFDPR
PVETLNVIIPEKLDSFINKFAEYTHEKWAFDKIQNNWSYGENIDEELKTHPMLRPYKTFS
EKDKEIYRWPIKESLKAMIAWEWTIEKAREGEEEKTEKKKTRKISQSAQTYDPREGYNPQ
PPDLSAVTLSRELQAMAEQLAENYHNTWGRKKKQELEAKGLKDMELDSSSIEKRFAFGFL
QQLLRWMDISQEFIAHLEAVVSSGRVEKSPHEQEIKFFAKILLPLINQYFTNHCLYFLST
PAKVLGSGGHASNKEKEMITSLFCKLAALVRHRVSLFGTDASAVVNCLHILARSLDARTV
MKSGPEIVKAGLRSFFESASEDIEKMVENLRLGKVSQARTQVKGVGQNLTYTTVALLPVL
TTLFQHIAQHQFGDDVILDDVQVSCYRTLCSIYSLGTTKNTYVEKLRPALGECLARLAAA
MPVAFLEPQLNEYNACSVYTTKSPRERAILGLPNSVEEMCPDIPVLERLMADIGGLAESG
ARYTEMPHVIEITLPMLCSYLPRWWERGPEAPPPALPAGAPPPCTAVTSDHLNSLLGNIL
RIIVNNLGIDEASWMKRLAVFAQPIVSRARPELLQSHFIPTIGRLRKRAGKVVSEEEQLR
LEAKAEAQEGELLVRDEFSVLCRDLYALYPLLIRYVDNNRAQWLTEPNPSAEELFRMVGE
IFIYWSKSHNFKREEQNFVVQNEINNMSFLTADNKSKMAKAGDIQSGGSDQERTKKKRRG
DRYSVQTSLIVATLKKMLPIGLNMCAPTDQDLITLAKTRYALKDTDEEVREFLHNNLHLQ
GKVEGSPSLRWQMALYRGVPGREEDADDPEKIVRRVQEVSAVLYYLDQTEHPYKSKKAVW
HKLLSKQRRRAVVACFRMTPLYNLPTHRACNMFLESYKAAWIVTEDHSFEDRMIDDLSKA
GEQEEEEEEVEEKKPDPLHQLVLHFSRTALTEKSKLDEDYLYMAYADIMAKSCHLEEGGE
NGEAEEEVEVSFEEKEMEKQRLLYQQARLHTRGAAEMVLQMISACKGETGAMVSSTLKLG
ISILNGGNAEVQQKMLDYLKDKKEVGFFQSIQALMQTCSVLDLNAFERQNKAEGLGMVNE
DGTVINRQNGEKVMADDEFTQDLFRFLQLLCEGHNNDFQNYLRTQTGNTTTINIIICTVD
YLLRLQESISDFYWYYSGKDVIEEQGKRNFSKAMSVAKQVFNSLTEYIQGPCTGNQQSLA
HSRLWDAVVGFLHVFAHMMMKLAQDSSQIELLKELLDLQKDMVVMLLSLLEGNVVNGMIA
RQMVDMLVESSSNVEMILKFFDMFLKLKDIVGSEAFQDYVTDPRGLISKKDFQKAMDSQK
QFSGPEIQFLLSCSEADENEMINCEEFANRFQEPARDIGFNVAVLLTNLSEHVPHDPRLH
NFLELAESILEYFRPYLGRIEIMGASRRIERIYFEISETNRAQWEMPQVKESKRQFIFDV
VNEGGEAEKMELFVSFCEDTIFEMQIAAQISEPEGEPETDEDQGSGAAKTGPKSPEEIPP
LGFQATAAPAGAPDPTSDEVHGEQPAGPGGDADGEGASEGAGEAAEGAGDEEEAVHEAGP
GGADGAVAVTDGGPFRPEGAGGLGDMGDTTPAEPPTPEGSPILKRKLGVDGVEEELPPEP
EPEPEPELEPEKADAENGEKEEVPEPPPEPPKKQAPPSPPPKKEEAGGEFWGELEVQRVK
FLNYLSRNFYTLRFLALFLAFAINFILLFYKVSDSPPGEDDMEGSAAGDVSGAGSGGGSG
WGLGAGEEAEGDEDENMVYYFLEESTGYMEPALRCLSLLHTLVAFLCIIGYNCLKVPLVI
FKREKELARKLEFDGLYITEQPEDDDVKGQWDRLVLNTPSFPSNYWDKFVKRKSFLYLGW
YMVMSLLGHYNNFFFAAHLLDIAMGVKTLRTILSSVTHNGKQLVMTVGLLAVVVYLYTVV
AFNFFRKFYNKSEDEDEPDMKCDDMMTCYLFHMYVGVRAGGGIGDEIEDPAGDEYELYRV
VFDITFFFFVIVILLAIIQGLIIDAFGELRDQQEQVKEDMETKCFICGIGSDYFDTTPHG
FETHTLEEHNLANYMFFLMYLINKDETEHTGQESYVWKMYQERCWDFFPAGDCFRKQYED
QLS
Download sequence
Identical sequences ENSGGOP00000015110

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]