SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000016998 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000016998
Domain Number 1 Region: 215-392
Classification Level Classification E-value
Superfamily MIR domain 8.63e-47
Family MIR domain 0.0025
Further Details:      
 
Domain Number 2 Region: 1079-1206
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.64e-17
Family SPRY domain 0.032
Further Details:      
 
Domain Number 3 Region: 659-794
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.0000000142
Family SPRY domain 0.064
Further Details:      
 
Domain Number 4 Region: 101-185
Classification Level Classification E-value
Superfamily MIR domain 0.00000732
Family MIR domain 0.028
Further Details:      
 
Domain Number 5 Region: 437-550
Classification Level Classification E-value
Superfamily IP3 receptor type 1 binding core, domain 2 0.00000942
Family IP3 receptor type 1 binding core, domain 2 0.011
Further Details:      
 
Domain Number 6 Region: 2160-2230
Classification Level Classification E-value
Superfamily IP3 receptor type 1 binding core, domain 2 0.0000122
Family IP3 receptor type 1 binding core, domain 2 0.018
Further Details:      
 
Domain Number 7 Region: 4018-4096
Classification Level Classification E-value
Superfamily EF-hand 0.0000146
Family Calmodulin-like 0.051
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000016998
Domain Number - Region: 1433-1557
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.000218
Family SPRY domain 0.033
Further Details:      
 
Domain Number - Region: 1895-1920,2041-2225,2506-2680
Classification Level Classification E-value
Superfamily ARM repeat 0.0374
Family Armadillo repeat 0.097
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000016998   Gene: ENSGGOG00000015461   Transcript: ENSGGOT00000029090
Sequence length 4905
Comment pep:known_by_projection chromosome:gorGor3.1:19:35808724:35967517:1 gene:ENSGGOG00000015461 transcript:ENSGGOT00000029090 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MGDAEGEDEVQFLRTDDEVVLQCSATVLKEQLKLCLAAEGFGNRLCFLEPTSNAQNVPPD
LAICCFVLEQSLSVRALQEMLANTVEAGVESSQGGGHRTLLYGHAILLRHAHSRMYLSCL
TTSRSMTDKLAFDVGLQEDATGEACWWTMHPASKQRSEGEKVRVGDDIILVSVSSERYLH
LSTASGELQVDASFMQTLWNMNPICSRCEEGFVTGGHVLRLFHGHMDECLTISPADSDDQ
RRLVYYEGGAVCTHARSLWRLEPLRISWSGSHLRWGQPLRVRHVTTGRYLALTEDQGLVV
VDASKAHTKATSFCFRISKEKLDVAPKRDVEGMGPPEIKYGESLCFVQHVASGLWLTYAA
PDPKALRLGVLKKKAMLHQEGHMDDALSLTRCQQEESQAARMIHSTNGLYNQFIKSLDSF
SGKPRGSGPPAGTALPIEGVILSLQDLIIYFEPPSEDLQHEEKQSKLRSLRNRQSLFQEE
GMLSMVLNCIDRLNVYTTAAHFAEFAGEEAAESWKEIVNLLYELLASLIRGNRSNCALFS
TNLDWLVSKLDRLEASSGILEVLYCVLIESPEVLNIIQENHIKSIISLLDKHGRNHKVLD
VLCSLCVCNGVAVRSNQDLITENLLPGRELLLQTNLINYVTSIRPNIFVGRAEGTTQYSK
WYFEVMVDEVTPFLTAQATHLRVGWALTEGYSPYPGGGEGWGGNGVGDDLYSYGFDGLHL
WTGHVARPVTSPGQHLLAPEDVISCCLDLSVPSISFRINGCPVQGVFESFNLDGLFFPVV
SFSAGVKVRFLLGGRHGEFKFLPPPGYAPCHEAVLPRERLHLEPIKEYRREGPRGPHLVG
PSRCLSHTDFVPCPVDTVQIVLPPHLERIREKLAENIHELWALTRIEQGWTYGPVRDDNK
RLHPCLVDFHSLPEPERNYNLQMSGETLKTGIVRQKQMGISGERSERVLGTATLYQRYMM
SNGYKPAPLDLSHVRLTPAQTTLVDRLAENGHNVWARDRVGQGWSYSAVQDIPARRNPRL
VPYRLLDEATKRSNRDSLCQAVRTLLGYGYNIEPPDQEPSEVENQSRCDRVRIFRAEKSY
TVQSGRWYFEFEAVTTGEMRVGWARPELRPDVELGADELAYVFNGHRGQRWHLGSEPFGR
PWQPGDVVGCMIDLTENTIIFTLNGEVLMSDSGSETAFREIEIGDAGFLPVCSLGPGQVG
HLNLGQDVSSLRFFAICGLQEGFEPFAINMQRPVTTWFSKGLPQFEPVPLEHPHYEVSRV
DGTVDTPPCLRLTHRTWGSQNSLVEMLFLRLSLPVQFHQHFRCTAGATPLAPPGLQPPAE
DEARAAEPDPDYENLRRSAGGWGEAENGKEGTAKEGAPGGTPQAGGEAQPARAENEKDAT
TEKNKKRGFLFKAKKVAMMTQPPATPTLPRLPHDVVPADNRDDPEIILNTTTYYYSVRVF
AGQEPSCVWAGWVTPDYHQHDMSFDLSKVRVVTVTMGDEQGNVHSSLKCSNCYMVWGGDF
VSPGQQGRISHTDLVIGCLVDLATGLMTFTANGKESNTFFQVEPNTKLFPAVFVLPTHQN
VIQFELGKQKNIMPLSAAMFQSERKNPAPQCPPRLEMQMLMPVSWSRMPNHFLQVETRRA
GERLGWAVQCQEPLTMLIRLSLSVFHLCCRCMDILELSERLDLQRFHSHTLCLYRAVCAL
GNNRVAHALCSHVDQAQLLHALEDAHLPGPLRAGYYDLLISIHLESACRSRRSMLSEYIV
PLTPETRAITLFPPGRSTENGHPRHGLPGVGVTTSLRPPHHFSPPCFVAALPAAGAAEAP
ARLSPAIPLEALRDKALRMLGEAVRDGGQHARDPVGGSVEFQFVPVLKLVSTLLVMGIFG
DEDVKQILKMIEPEVFTAEGEKEEGLEEGLLQMKLPESVKLQMCHLLEYFCDQELQHRVE
SLAAFAERYVDKLQANQRSRYGLLIKAFSMTAAETARRTREFRSPPQEQINMLLQFKDGT
DEEDCPLPEEIRQDLLDFHQDLLAHCGIQLDGEEEEPEEETTLGSRLMSLLEKVRLVKKK
EEKPEEERSAEESKPRSLQELVSHMVVRWAQEDFVQSPELVRAMFSLLHRQYDGLGELLR
ALPRAYTISPSSVEDTMSLLECLGQIRSLLIVQMGPQEENLMIQSIGNIMNNKVFYQHPN
LMRALGMHETVMEVMVNVLGGGESKEIRFPKMVTSCCRFLCYFCRISRQNQRSMFDHLSY
LLENSGIGLGMQGSTPLDVAAASVIDNNELALALQEQDLEKVVSYLAGCGLQSCPMLVAK
GYPDIGWNPCGGERYLDFLRFAVFVNGESVEENANVVVRLLIRKPECFGPALRGEGGSGL
LAAIEEAIRISEDPARDGPGIRRDRRREHFGEEPPEENRVHLGHAIMSFYAALIDLLGRC
APEMHLIQAGKGEALRIRAILRSLVPLEDLVGIISLPLQIPTLGKDGALVQPKMSASFVP
DHKASMVLFLDRVYGIENQDFLLHVLDVGFLPDMRAAASLDTATFSTTEMALALNRYLCL
AVLPLITKCAPLFAGTEHRAIMVDSMLHTVYRLSRGRSLTKAQRDVIEDCLMSLCRYIRP
SMLQHLLRRLVFDVPILNEFAKMPLKLLTNHYERCWKYYCLPTGWANFGVTSEEELHLTR
KLFWGIFDSLAHKKYDPELYRMAMPCLCAIAGALPPDYVDASYSSKAEKKATVDAEGNFD
PRPVETLNVIIPEKLDSFINKFAEYTHEKWAFDKIQNNWSYGENIDEELKTHPMLRPYKT
FSEKDKEIYRWPIKESLKAMIAWEWTIEKAREGEEEKTEKKKTRKISQSAQTYDPREGYN
PQPPDLSAVTLSRELQAMAEQLAENYHNTWGRKKKQELEAKGEGAHAGGTKEERRVEKGR
ARGTEETVGPASGLILLSTSPCRGLKDMELDSSSIEKRFAFGFLQQLLRWMDISQEFIAH
LEAVVSSGRVEKSPHEQEIKFFAKILLPLINQYFTNHCLYFLSTPAKVLGSGGHASNKEK
EMITSLFCKLAALVRHRVSLFGTDASAVVNCLHILARSLDARTVMKSGPEIVKAGLRSFF
ESASEDIEKMVENLRLGKVSQARTQVKGVGQNLTYTTVALLPVLTTLFQHIAQHQFGDDV
ILDDVQVSCYRTLCSIYSLGTTKNTYVEKLRPALGECLARLAAAMPVAFLEPQLNEYNAC
SVYTTKSPRERAILGLPNSVEEMCPDIPVLERLMADIGGLAESGARYTEMPHVIEITLPM
LCSYLPRWWERGPEAPPPALPAGAPPPCTAVTSDHLNSLLGNILRIIVNNLGIDEASWMK
RLAVFAQPIVSRARPELLQSHFIPTIGRLRKRAGKVVSEEEQLRLEAKAEAQEGELLVRD
EFSVLCRDLYALYPLLIRYVDNNRAQWLTEPNPSAEELFRMVGEIFIYWSKSHNFKREEQ
NFVVQNEINNMSFLTADNKSKMAKAGDIQSGGSDQERTKKKRRGDRYSVQTSLIVATLKK
MLPIGLNMCAPTDQDLITLAKTRYALKDTDEEVREFLHNNLHLQGKVEGSPSLRWQMALY
RGVPGREEDADDPEKIVRRVQEVSAVLYYLDQTEHPYKSKKAVWHKLLSKQRRRAVVACF
RMTPLYNLPTHRACNMFLESYKAAWIVTEDHSFEDRMIDDLSTSRGRPSTSRGIEMPAPD
DPLHQLVLHFSRTALTEKSKLDEDYLYMAYADIMAKSCHLEEGGENGEAEEEVEVSFEEK
EMEKQRLLYQQARLHTRGAAEMVLQMISACKGETGAMVSSTLKLGISILNGGNAEVQQKM
LDYLKDKKEVGFFQSIQALMQTCSVLDLNAFERQNKAEGLGMVNEDGTVINRQNGEKVMA
DDEFTQDLFRFLQLLCEGHNNDFQNYLRTQTGNTTTINIIICTVDYLLRLQESISDFYWY
YSGKDVIEEQGKRNFSKAMSVAKQVFNSLTEYIQGPCTGNQQSLAHSRLWDAVVGFLHVF
AHMMMKLAQDSSQIELLKELLDLQKDMVVMLLSLLEGNVVNGMIARQMVDMLVESSSNVE
MILKFFDMFLKLKDIVGSEAFQDYVTDPRGLISKKDFQKAMDSQKQFSGPEIQFLLSCSE
ADENEMINCEEFANRFQEPARDIGFNVAVLLTNLSEHVPHDPRLHNFLELAESILEYFRP
YLGRIEIMGASRRIERIYFEISETNRAQWEMPQVKESKRQFIFDVVNEGGEAEKMELFVS
FCEDTIFEMQIAAQISEPEGEPETDEDQGSGAAKTRPKKPRRNPPSGLPSHCGPGRPDPT
SDEVHGEQPAGPGGDADGEGASEGAGEAAEGAGDEEEAVHVAVTDGGPFRPEGAGGLGDM
GDTTPAEPPTPEGSPILKRKLGVDGVEEELPPEPEPEPEPELEPEKADAENGEKEEVPEP
PPEPPKKQAPPSPPPKKEEAGGEFWGELEVQRVKFLNYLSRNFYTLRFLALFLAFAINFI
LLFYKVSDSPPGEDDMEGSAAGDVSGAGSGGGSGWGLGAGEEAEGDEDENMVYYFLEEST
GYMEPALRCLSLLHTLVAFLCIIGYNCLKVPLVIFKREKELARKLEFDGLYITEQPEDDD
VKGQWDRLVLNTPSFPSNYWDKFVKRKKKLAKRGGSHLPAGCLLTRNALILHVLPKQSWH
PTPRAQLTLSSGPYPPECSSCVPAFPLTPGPVCPQSFLYLGWYMVMSLLGHYNNFFFAAH
LLDIAMGVKTLRTILSSVTHNGKQLVMTVGLLAVVVYLYTVVAFNFFRKFYNKSEDEDEP
DMKCDDMMTCYLFHMYVGVRAGGGIGDEIEDPAGDEYELYRVVFDITFFFFVIVILLAII
QGLIIDAFGELRDQQEQVKEDMETKCFICGIGSDYFDTTPHGFETHTLEEHNLANYMFFL
MYLINKDETEHTGQESYVWKMYQERCWDFFPAGDCFRKQYEDQLS
Download sequence
Identical sequences ENSGGOP00000016998 ENSGGOP00000015110

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]