SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000023745 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000023745
Domain Number 1 Region: 129-308
Classification Level Classification E-value
Superfamily MIR domain 1.13e-49
Family MIR domain 0.0017
Further Details:      
 
Domain Number 2 Region: 992-1081
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.31e-17
Family SPRY domain 0.031
Further Details:      
 
Domain Number 3 Region: 3913-3991
Classification Level Classification E-value
Superfamily EF-hand 0.000000000748
Family Calmodulin-like 0.051
Further Details:      
 
Domain Number 4 Region: 570-718
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.0000000021
Family SPRY domain 0.052
Further Details:      
 
Domain Number 5 Region: 13-94
Classification Level Classification E-value
Superfamily MIR domain 0.00000366
Family MIR domain 0.021
Further Details:      
 
Domain Number 6 Region: 335-462
Classification Level Classification E-value
Superfamily IP3 receptor type 1 binding core, domain 2 0.0000209
Family IP3 receptor type 1 binding core, domain 2 0.02
Further Details:      
 
Domain Number 7 Region: 2065-2130
Classification Level Classification E-value
Superfamily IP3 receptor type 1 binding core, domain 2 0.0000968
Family IP3 receptor type 1 binding core, domain 2 0.023
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000023745
Domain Number - Region: 1328-1450
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.000475
Family SPRY domain 0.034
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000023745   Gene: ENSGGOG00000012582   Transcript: ENSGGOT00000032329
Sequence length 4871
Comment pep:known_by_projection chromosome:gorGor3.1:1:217678920:218153888:1 gene:ENSGGOG00000012582 transcript:ENSGGOT00000032329 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
LKTAQGGGHRTLLYGHAILLRHSYSGMYLCCLSTSRSSTDKLAFDVGLQEDTTGEACWWT
IHPASKQRSEGEKVRVGDDLILVSVSSERYLHLSYGNGSLHVDAAFQQTLWSVAPISSGS
EAAQGYLIGGDVLRLLHGHMDECLTVPSGEHGEEQRRNRTVHYEGGAVSVHARSLWRLET
LRVAWSGSHIRWGQPFRLRHVTTGKYLSLMEDKNLLLMDKEKADVKSTAFTFRSSKEKLD
VGVRKEVDGMGTSEIKYGDSVCYIQHVDTGLWLTYQSVDVKSVRMGSIQRKAIMHHEGHM
DDGISLSRSQHEESRTARVIRSTVFLFNRFIRGLDALSKKAKASTVDLPIESVSLSLQDL
IGYFHPPDEHLEHEDKQNRLRALKNRQNLFQEEGMINLVLECIDRLHVYSSAAHFADVAG
REAGESWKSILNSLYELLAALIRGNRKNCAQFSGSLDWLISRLERLEASSGILEVLHCVL
VESPEALNIIKEGHIKSIISLLDKHGRNHKLLIMINNFCICCIFSVHSEPHAPCDTKLSL
VKLAFSVLYLSGMRPNIFLGVSEGSAQYKKWYYELMVDHTEPFVTAEATHLRVGWASTEG
YSPYPGGGEEWGGNGVGDDLFSYGFDGLHLWSGCIARTVSSPNQHLLRTDDVISCCLDLS
APSISFRINGQPVQGMFENFNIDGLFFPVVSFSAGIKVRFLLGGRHGEFKFLPPPGYAPC
YEAVLPKEKLKVEHSREYKQERTYTRDLLGPTVSLTQAAFTPIPVDTSQIVLPPHLERIR
EKLAENIHELWVMNKIELGWQYGPVRDDNKRQHPCLVEFSKLPEQERNYNLQMSLETLKV
LYPQGCRLSCTEIHMEGKIKKKILYKFYQLTSGYKPAPMDLSFIKLTPSQEAMVDKLAEN
AHNVWARDRIRQGWTYGIQQDVKNRRNPRLVPYTLLDDRTKKSNKDSLREAVRTLLGYGY
NLEAPDQDHAARAEVCSGTGERFRIFRAEKTYAVKAGRWYFEFETVTAGDMRVGWSRPGC
QPDQELGSDERAFAFDGFKAQRWHQGNEHYGRSWQAGDVVGCMVDMNEHTMMFTLNGEIL
LDDSGSELAFKDFDVGDEFLSLLTIGLITIQNQDWGSPGKEVAVFALVCFLDEAEKQSGI
NGEKLFTMCTTEKNVSGKEVPKDFEELTISLIKMSAVDTYLVLSISSFSEEGRNQDCEAN
FYSRGIQVHQRQVFFCLFVFYLPSQTFFKKKHFLTLYDVDSGIQCLLKRFLSHWMVVRVH
QDETSGSASLKMRKKYASFYLWRYLHKYIPLKGTKTVISVCHTLNILDESISNMQKSYSF
DYMLLHTNWYYYSVRIFPGQEPANVWVGWITSDFHQYDTGFDLDRVRTVTVTLGDEKGKV
HESIKRSNCYMVCAGESMSPGQGRNNNGLEIGCVVDAASGLLTFIANGKELSTYYQVEPS
TKLFPAVFAQATSPNVFQFELGRIKNVMPLSAGLFKSEHKNPVPQCPPRLHVQFLSHVLW
SRMPNQFLKVDVSRISERQGWLVQCLDPLQFMSLHIPEENRSVDILELTEQEELLKFHYH
TLRLYSAVCALGNHRVAHALCSHVDEPQLLYAIENKYMPGLLRAGYYDLLIDIHLSSYAT
ARLMMNNEYIVPMTEETKSITLFPDENKKHGLPGIGLSTSLRPRMQFSSPSFVSISNECY
QYSPEFPLDILKSKTIQMLTEAVKEGSLHARDPVGGTTEFLFVPLIKLFYTLLIMGIFHN
EDLKHILQLIEPSVFKEAATPEEESDTLEKELSVDDAKLQGAGEEEAKGGKRPKEGLLQM
KLPEPVKLQMCLLLQYLCDCQVRHRIEAIVAFSDDFVAKLQDNQRFRYNEVMQALNMSAA
LTARKTKEFRSPPQEQINMLLNFKDDKSECPCPEEIRDQLLDFHEDLMTHCVGIELDEDG
SLDGNSDLTIRGRLLSLVEKVTYLKKKQAEKPVESDSKKSSTLQQLISETMVRWAQESVI
EDPELVRAMFVLLHRQYDGIGGLVRALPKTYTINGVSVEDTINLLASLGQIRSLLSVRMG
KEEEKLMIRGLGDIMNNKVFYQHPNLMRALGMHETVMEVMVNVLGGGESKEITFPKMVAN
CCRFLCYFCRISRQNQKAMFDHLSYLLENSSVGLASPAMRGSTPLDVAAASVMDNNELAL
ALREPDLEKVVRYLAGCGLQSCQMLVSKGYPDIGWNPVEGERYLDFLRFAVFCNGESVEE
NANVVVRLLIRRPECFGPALRGEGGNGLLAAMEEAIKIAEDPSRDGPSPNSGSSKTLDTE
EEEDDTIHMGNAIMTFYSALIDLLGRCAPEMHLIHAGKGEAIRIRSILRSLIPLGDLVGV
ISIAFQMPTIAKDGNVVEPDMSAGFCPDHKAAMVLFLDRVYGIEVQDFLLHLLEVGFLPD
LRAAASLDTAALSATDMALALNRYLCTAVLPLLTRCAPLFAGTEHHASLIDSLLHTVYRL
SKGCSLTKAQRDSIEVCLLSICGQLRPSMMQHLLRRLVFDVPLLNEHAKMPLKLLTNHYE
RCWKYYCLPGGWGNFGAASEEELHLSRKLFWGIFDALSQKKYEQELFKLALPCLSAVAGA
LPPDYMESNYVSMMEKQSSMDSEGNFNPQPVDTSNITIPEKLEYFINKYAEHSHDKWSMD
KLANGWIYGEIYSDSSKVQPLMKPYKLLSEKEKEIYRWPIKESLKTMLAWGWRIERTREG
DSMALYNRTRRISQTSQVSVDAAHGYSPRAIDMSNVTLSRDLHAMAEMMAENYHNIWAKK
KKMELESKGGGNHPLLVPYDTLTAKEKAKDREKAQDILKFLQINGYAVSSCSRGFKDLEL
DTPSIEKRFAYSFLQQLIRYVDEAHQYILEFDGGSRGKGEHFPYEQEIKFFAKVVLPLID
QYFKNHRLYFLSAASRPLCSGGHASNKEKEMVTSLFCKLGVLVRHRISLFGNDATSIVNC
LHILGQTLDARTVMKTGLESVKSALRAFLDNAAEDLEKTMENLKQGQFTHTRNQPKGVTQ
IINYTTVALLPMLSSLFEHIGQHQFGEDLILEDVQVSCYRILTSLYALGTSKSIYVERQR
SALGECLAAFAGAFPVAFLETHLDKHNIYSIYNTKSSRERADMKLKLDIYHIVQYISNLE
QLLRNMALQILAYSRKLKVQQCFMPWKYSYYGEKSHFKNGPMKFRTIINNCGNFISCVKM
ISKMSESLFFFHSKLAIYSKNKEDRYTVFSQPIINKVKPQLLKTHFLPLMEKLKKKAATV
VSEEDHLKAEARGDMSEAELLILDEFTTLARDLYAFYPLLIRFVDYNRAKWLKEPNPEAE
ELFRMVAEVFIYWSKSHNFKREEQNFVVQNEINNMSFLITDTKSKMSKAAVSDQERKKMK
RKGDRYSMQTSLIVAALKRLLPIGLNICAPGDQELIALAKNRFSLKDTEDEVRDIIRSNI
HLQGKRLSNFGKNIYHQLKYRLDAWSDQQKTCDAVLKVKSLIFHQETASRPVGRRHYCLV
EHPQRSKKAVWHKLLSKQRKRAVVACFRMAPLYNLPRHRAVNLFLQGYEKSWIETEEHYF
EDKLIEDLAKPGAEPPEEDEGTKRVDPLHQLILLFSRTALTEKCKLEEDFLYMAYADIMA
KSCHDEEDDDGEEEVKSFEEKEMEKQKLLYQQARLHDRGAAEMVLQTISASKGETGPMVA
ATLKLGIAILNGGNSTVQQKMLDYLKEKKDVGFFQSLAGLMQSCSVLDLNAFERQNKAEG
LGMVTEEGSGEKVLQDDEFTCDLFRFLQLLCEGHNSDFQNYLRTQTGNNTTVNIIISTVD
YLLRVQESISDFYWYYSGKDVIDEQGQRNFSKAIQVAKQVFNTLTEYIQGPCTGNQQSLA
HSRLWDAVVGFLHVFAHMQMKLSQDSSQIELLKELMDLQKDMVVMLLSMLEGNVVNGTIG
KQMVDMLVESSNNVEMILKFFDMFLKLKDLTSSDTFKEYDPDGKGVISKRDFHKAMESHK
HYTQSETEFLLSCAETDENETLDYEEFVKRFHEPAKDIGFNVAVLLTNLSEHMPNDTRLQ
TFLELAESVLNYFQPFLGRIEIMGSAKRIERVYFEISESSRTQWEKPQVKESKRQFIFDV
VNEGGEKEKMELFVNFCEDTIFEMQLAAQISESDLNERSANKEESEKERPEEQGPRMAFF
SILTVKSALFALRYNILTLMRMLSLKSLKKQMKKVKKMTVKDMVTAFFSSYWSIFMTLLH
FVASVFRGFFRIICSLLLGGSLVEGAKKIKVAELLANMPDPTQDEVRGDGEEGERKPLEA
ALPSEDLTDLKELTEESDLLSDIFGLDLKREGGQYKLIPHNPNAGLSDLMSNPVPMPEVQ
EKFQVLKIKKDISEMRKETKSEPEKAEGEDGEKEEKAKEDKGKQKLRQLHTHRYGEPEVP
ESAFWKKIIAYQQKLLNYFARNFYNMRMLALFVAFAINFILLFYKVSTSSVVEGKELPTR
SSSENAKVTSLDSSSHRIIAVHYVLEESSGYMEPTLRILAILHTVISFFCIIGYYCLKVP
LVIFKREKEVARKLEFDGLYITEQPSEDDIKGQWDRLVINTQSFPNNYWDKFVKRKVMDK
YGEFYGRDRISELLGMDKAALDFSDAREKKKPKKDSSLSAVLNSIDVKYQMWKLGVVFTD
NSFLYLAWYMTMSVLGHYNNFFFAAHLLDIAMGFKTLRTILSSVTHNGKQLVLTVGLLAV
VVYLYTVVAFNFFRKFYNKSEDGDTPDMKCDDMLTCYMFHMYVGVRAGGGIGDEIEDPAG
DEYEIYRIIFDITFFFFVIVILLAIIQGLIIDAFGELRDQQEQVKEDMETKCFICGIGND
YFDTVPHGFETHTLQEHNLANYLFFLMYLINKDETEHTGQESYVWKMYQERCWEFFPAGD
CFRKQYEDQLN
Download sequence
Identical sequences ENSGGOP00000023745 ENSGGOP00000023745

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]