SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGACP00000020213 from Gasterosteus aculeatus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGACP00000020213
Domain Number 1 Region: 204-381
Classification Level Classification E-value
Superfamily MIR domain 4.45e-48
Family MIR domain 0.0017
Further Details:      
 
Domain Number 2 Region: 1063-1154
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.19e-16
Family SPRY domain 0.061
Further Details:      
 
Domain Number 3 Region: 1385-1505
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.0000000554
Family SPRY domain 0.027
Further Details:      
 
Domain Number 4 Region: 638-794
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.00000119
Family SPRY domain 0.065
Further Details:      
 
Domain Number 5 Region: 3971-4055
Classification Level Classification E-value
Superfamily EF-hand 0.00000121
Family Calmodulin-like 0.05
Further Details:      
 
Domain Number 6 Region: 88-169
Classification Level Classification E-value
Superfamily MIR domain 0.00000445
Family MIR domain 0.022
Further Details:      
 
Weak hits

Sequence:  ENSGACP00000020213
Domain Number - Region: 414-535
Classification Level Classification E-value
Superfamily IP3 receptor type 1 binding core, domain 2 0.00017
Family IP3 receptor type 1 binding core, domain 2 0.021
Further Details:      
 
Domain Number - Region: 2032-2181,2474-2551,2594-2661,2960-3008,3083-3248
Classification Level Classification E-value
Superfamily ARM repeat 0.00121
Family Armadillo repeat 0.045
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGACP00000020213   Gene: ENSGACG00000015320   Transcript: ENSGACT00000020252
Sequence length 4945
Comment pep:novel scaffold:BROADS1:scaffold_48:1076325:1158405:-1 gene:ENSGACG00000015320 transcript:ENSGACT00000020252 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
DDEVVLQCSATIHKEQQKLCLAAEGFGNRLCFLESISNSKNVPPDLSICTFVLEQSLSVR
ALQEMLANTEEKAEGVRTAQGGGHRTLLYGHAVLLRHSYSGMYLCCLSTSRSSTDKLAFD
VGLQEDTTGEACWWTIHPASKQRSEGEKVRVGDDLILVSVSSERYLHLSYGNSSLHVDAA
FQQTLWSVAPICSGSEVAQGFLIGGDVLRLLHGHMDECLTVPSGEHGDEQRRTVHYEGGA
VSSHARSLWRLETLRVVWSGSHIRWGQQFRLRHVTTGKYLSLVEDKSLLLLDKENADVKS
TAFCFRSSKEKLDPGGRKEVDGMGVPDIKYGDSVCYVQHVDTCLWLTYQTIDAKCARMGG
VQRKAIMHHEGHMDDGLTLSRSQHEESRTARVIRSTVFLFNVFIRGLDTLRRKGKSSTLV
LPIDSVSLSLQDLIGYFHPPGDHLEHEDKQNRLRALKNRQNLFQEEGMISLVQECIDRLH
VYSSAAHFAEAVGREGGETWSSILNSLYQLLAALIRGNRKNCAQFSGSLDWLISRLERLE
ASSGILEVLHCVLVESPEALNIIKEGHIKSIISLLDKHGRNYKVLDVLCSLCVCHGVAVR
SNQHLICDNLLPGRELLLQTRLINHVSSMRPNIFLGVSNGSAQYRKWYYELIVDQALPFV
TAEASHLRVGWANTSGYAPYPSGGEGWGGNGVGDDLYSYGFDGLHLWSGCIARTVSSPNQ
HLLRAEDVVSCCLDLSVPSISFRINGQPVQGMFENFNSVGLFFPVASFSAGVKVRFLLGG
RHGEFKFLPPPGYAPCYEAVLPREKLRLEAGQDQTAAKDLLGPTITLSQAAFTPTPVDTS
QIVLPPHLERIREKLAENIHELWVMNKIELGWTYGAVRDDNKRQHPCLVEFSKLPEQERS
YNLQMSLETLKTLLALGCHVGLADEHALEKVKSMKLSPTYELSGGYKPAPLDLGHIKLTS
TQEAMVDKLAENAHNVWARDRVRQGWTYGIQQDVKNRRNPRLVPYILLDERTKKSNKDSL
REAVRTLLGHGYNLEAPDQDHAAQSEPDSVSLERFRIFRAEKTYCVNAGKWYFELKVLSA
GEMRVGWARPGCLPDLELGSDDQAFVFDGFKVQRWHQGNEHFGRAWQTGDVVGCMVDLNE
HTMMFTLNGEVLLDDSGSELAFKDFEVGEGFIPVSSLGVCQVGRMNFGKDVSTLKYFTIC
GLQEGYEPFAVNMNRDVTMWLSKRLPQFVPVPPNHPHMEVTRIDGTVESCPCLKVTQRSF
GSQNSYTDITFYRLSMPIECAESFSRSTANNSIFSPKRELEDFETVSDFEVLMKSSHGPN
GSNGPDRDEFNNHKDYNQEKPSKIKQRFTLKKNKTDISCPTSVRLSEEVMADDRDEYEFL
TQASTHYYSVRIFPGQEPSNVWVGWVTSDFHQYDPGFELHNVRTVTVTLGDERGKVHESI
KHSNCYMVWAGESSSPGQGRSNGLEIGCLVDTTNGLLTFTANGKELSTYYQVEPSTKLFP
AVFAQATSPNVFQFELGRIKNVMPLSAGLFKSERRNPVPQCPPRLHVQFLTPVLWSRVPN
HFLKVSASRVNDRHGWLVQCNKPLQFMSLHIPEENKSVDILELSEQSDLLKFHYHTLRLY
SAICALGNNRVAHALCSHVDEAQLLQAIENKYMPGLLRAGYYDLLIDIHLSSYTTARLMM
NSEYIVPMTDETKSITLFPDERKKHGLPGIGLSTSLRPRMHFSSPCFVCVNGVSHRNTGS
GDCFQYSPEFPLDVLKMKTIEMLTEAVREGSMHVRDPIGGSTEFLFVPLIKLFYTLLIMG
VFQSGDLKNILRLIEPSVFSERREGQEDLPTEPEALKEGGRSVPKEGLLQMKLPEPVKLQ
MCHVLQYLCDCQVRHRIEAVVAFSDDFVACLQDNQRFRYNEVMLALNMSAALTAKKTKEF
RSPPQEQINMLLNFKDEKQECPCPEYIREQLLDFHEDLMRHCGIELDEEGGIEGDSDFTI
RGRLMSLVEKVAYLKKKMGNPTKEKEDKKPSTLQQLISDTMVRWAQESVIEDPELVRAMF
VLLHRQYDGIGGQVRALPKTYTINSVSIEDTITLLAALGQIRSLLSVRMGREEEKLMIRG
LGDIMNNKVFYQHPNLMRALGMHETVMEVMVNVLSGGDSKEITFPKMVANCCRFLCYFCR
ISRQNQKAMFDHLSYLLENSSVGLASPSMRGSTPLDVAAASVMDNNELALALREPDLEKV
VQYLAGCGLQSCVMLVRNGYPDIGWNPVEGERYLDFLRFAVFCNGESVEENANVVVRLLI
RRPECFGPALRGEGGDGLLAAMEEAIRISQDLSRDGPSPTSESSKTLDMLEEEEDDTIHM
GDAIMTFYAALIDLLGRCAPEMHLIHAGKGEAIRIRAILRSLIPIDDLVGVISIPFSMPN
LAKDGLVVEPDMSAGFCPDHKAAMVLFLDRVYGIEDQNFLLHLLEVGFLPDLRAAASLDT
VALSATDMALALNRYLCTAALPLLTKCAPLFAGTEPFASLIDSLLHTVYRLSKGCCLTKA
QRDAIEECLLAVCGKLRPSMMQHLLRRLVFDVPLLNEHTKMPLKLLTNHYERCWKYYCLA
GGWGSFGAASDEELHLSRKLFWGIFDALSRKARRYDQELFKMALPCLSAVAGALPPDYME
SNYMAMMEKQSSMDSEGNFTPQPADTTNVIVPEKLDCFISRYAEHSHEKWCIGKFSNGWS
FGEQICEISKSHPLLKPYKGISEKEKEAYCWPVRASLKTMLAWGWSIDRIREGDAASLHN
KSRRISQASQQSLEGAPAFSPRPIDMSNVTLSRDMQAMAELLAENYHNIWARSKKMELEA
KGGGNHPLLVPYDTLTAKEKSKDRDKAQDILKFLQINGYTVSRGVKSQELDTPAIEKRFA
YTFLQQLITYVDQAHQHMMEFDLGTGPKGEKIPHEQQIKFFGKVVLPLVDQYFKNHQIYF
LSTAIHPISSGGHASNKEKEMVTSLFCKLGVLVRHRISLFGNNATSIVNCLQILGQSLDA
RTVMKTGLESVKAALRLFFVSAAEDLEKTQENLKLGQFTHSREQPRGVTQIINYTTFALL
PVLSSLFEHIGLNLFGEDLILEDVQVSCYRILNSLYFLGTNKSIYVERQRPAVGKCLAAF
SAAFPVAFLEPHIDKFNSFSIYNGKGTKDRAALGLPGHVGEVCPLIPNLEKCLEEIVELA
ESGMRYTQMPHVMEVVLPMLCSYMSHWWEHGPESNPDRADSCCTSVTSEHMNTLLGNILK
IIYNNLGIDEGAWMKRLAVFSQPIISKAKTQLLKTHFLPLMEKLKKKAAVVLMDEEHSKA
EGRGEMSETELLILDQFTVLVRDLYAFYPLLIRFVDYNRARWLRESNPGAEKLFRMVAEV
FIFWAKSHNFKREEQNFVVQNEINNMSFLITDTKCKMSKGIMSDQERKKMKRKGDRYTMQ
TSLIVATLKRLLPVGLNICAPGEHELIALAKNRFTQKDTEGEVREIIKNNLHLQGKLEDP
AIRWQMALYRDLPNHYEDTSDPVKTVERVLEIAHVLFHLDQVSKSTGSGPVHLISSAVEH
PQRSKKAVWHKLLSKQRKRAVVACFRMAPLYNLPRHRAVNLFLQGYEKSWIEAEEHYFED
KLIEDLAKPGDLEPLEEEEGLKHIDPLHQLIQLFSRTALTEKCKLDEDILYMAYADIMSK
SCHDEEEEDGEEVKSFEEKEMEKQKLLYQQARLHDRGAAEMVLQTISASKGEMGPMVAST
LKLGIAILNGGNSTVQQKMLDYLKDKKDVGFFQGLAGLMQSCSVLDLNAFERQNKAEGLG
MVTEEGSGEKVMQDDEFTCDLFRFLQLLCEGHNSDFQNYLRTQTGNNTTVNIIISTVDYL
LRVQESISDFYWYYSGKDVIDEQGQRNFSKAINVAKQVFNTLTEYIQGPCTGNQQSLAHS
RLWDAVVGFLHVFAHMQMKLSQDSSQIELLKELMDLQKDMVVMLLSMLEGNVVNGTIGKQ
MVDMLVESSNNVEMILKFFDMFLKLKDLTSSDAFKEYDPDGKGKAPNHSVISKRDFHKAM
ESHKHYTQSETDFLLSCAETDENELLDYEEFVERFHEPAKDIGFNVAVLLTNLSEHMPHD
TRLQTFLELADSVLNYFQPYLGRIEIMGSAKRIERVYFEISESSRTQWEKPQVKESKRQF
IFDVVNEGGEKEKMELFVNFCEDTIFEMQLAARMSDAGERSAVKEESEREKPDEENPEMG
FFSVTTVRMALLALQYNVVLLLKVLSMKTLKKQMKKIKNMTVKDMVTTLVSFYCSVLLGL
LHVAFSVARGFCRIFHSSFMGDNLVEGAKSIKVSELLASMPDPTQDEVRGEGEDRDKRPS
AKEDLADLAVNASETELLSDIFGLDLRREGGQYKITPHNPNASLTQLLNSPVPSSTPPTP
PTDTPPELRRRHPSQSSSSEEKAATAETECEPEKSPEAGRVEKPEKQQKNEKTKPKVRRH
HTSKSDEPDLQESAFLKKIIAYQRKLLNYFARNFYNMRMLALFVAFAINFILLFYKVSTS
SSVVEEMEVTYTSSQPDNRVHGEPLKPVSVRFVLEESTGYMEPMLRILAVLHTVISFFCI
IGYYCLKAGLATVPLVIFKREKEVARKLEFDGLYITEQPSEDDIKGQWDRLVINTQSFPN
NYWDKFVKRKVMDKYGEFYGHDRISELLGMDKAALDFSDSHKKRKPRRDSSLAAVLNSID
VKYQIWKLGVVFTDNSFLYLAWYMTMSILGHYNNFFFAAHLLDIAMGFKTLRTILSSVTH
NGKQLVLTVGLLAVVVYLYTVVAFNFFRKFYNKGEDGELPDMKCDDMLTCYMFHMYVGVR
AGGGIGDQIEDPAGDEYEIYRIIFDITFFFFVIVILLAIIQGLIIDAFGELRDQQEQVKE
DMETKCFICGIGNDYFDTVPHGFETHTLQEHNLANYLFFVMYLINKDETEHTGQESYVWK
MYQERCWEFFPAGDCFRKQYEDQLN
Download sequence
Identical sequences G3PRH8
ENSGACP00000020213 ENSGACP00000020213 69293.ENSGACP00000020213

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]