SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSARP00000000317 from Sorex araneus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSARP00000000317
Domain Number 1 Region: 139-316
Classification Level Classification E-value
Superfamily MIR domain 2.01e-50
Family MIR domain 0.0017
Further Details:      
 
Domain Number 2 Region: 1001-1088
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.000000000000686
Family SPRY domain 0.06
Further Details:      
 
Domain Number 3 Region: 3930-3985
Classification Level Classification E-value
Superfamily EF-hand 0.000000000931
Family Calmodulin-like 0.046
Further Details:      
 
Domain Number 4 Region: 579-727
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.0000000285
Family SPRY domain 0.058
Further Details:      
 
Domain Number 5 Region: 22-104
Classification Level Classification E-value
Superfamily MIR domain 0.0000034
Family MIR domain 0.021
Further Details:      
 
Domain Number 6 Region: 2064-2129
Classification Level Classification E-value
Superfamily IP3 receptor type 1 binding core, domain 2 0.0000955
Family IP3 receptor type 1 binding core, domain 2 0.023
Further Details:      
 
Weak hits

Sequence:  ENSSARP00000000317
Domain Number - Region: 1330-1450
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 0.000686
Family SPRY domain 0.026
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSARP00000000317   Gene: ENSSARG00000000339   Transcript: ENSSART00000000351
Sequence length 4864
Comment pep:novel genescaffold:COMMON_SHREW1:GeneScaffold_6936:7979:601703:1 gene:ENSSARG00000000339 transcript:ENSSART00000000351 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
QVDVEKWKFMMKTAQGSGHRTLLYGHAILLRHSYSGMYLCCLSTSRSSTDKLAFDVGLQE
DTTGEACWWTIHPASKQRSEGEKVRVGDDLILVSVSSERYLHLSYGNGSLHVDAAFQQTL
WSVAPISSGSEAAQGYLIGGDVLRLLHGHMDECLTVPSGEHGEEQRRTVHYEGGAVSVHA
RSLWRLETLRVAWSGSHIRWGQPFRLRHVTTGKYLSLLEDKNLLLMDKEKADVKSTAFAF
RSSKEKLDVGMRKEVDGMGTSEIKYGDSVCYIQHVNTGLWLTYQSVDVKSVRMGSIQRKA
IMHHEGHMDDGLNLSRSQHEESRTARVIRSTVFLFNRFIRGLDALSKKAKASTVDLPIES
VSLSLQDLIGYFHPPDEQLEHEDKQNRLRALKNRQNLFQEEXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXI
LEVLHCVLVESPANIIKEGHIKSIISLLDKHGRNHKVLDVLCSLCVCHGVAVRSNQHLIC
DNLLPGRDLLLQTRLVNHVSSMRPNIFLGVSEGSAQYKKWYYELMVDHTQPFVTAEATHL
RVGWASTEGYSPYPGGGEEWGGNGVGDDLFSYGFDGLHLWSXXXXXXXXXXXQHLLRTDD
VISCCLDLSAPSISFRINGQPVQGMFENFNIDGLFFPVVSFSAGIKVRFLLGGRHGEFKF
LPPPGYAPCYEAVLPKEKLKVEHSREYKQERTYTRDLLGPTVPLTQAAFTPVPVDTSQIV
LPPHLERIRERLAENIHELWVMNKIELGWQYGPVRDDNKRQHPCLVEFSKLPEQERNYNL
QMSLETLKTLLALGCHVGISDEHAEERVKKMKLPKNYQLTSGYKPAPMDLSFVKLTPSQE
AMVDKLAENAHNVWARDRIRQGWTYGIQQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXARAEVCSSTGERFRIFRAEKTYAVKTGRWYFEFEAVTAGDM
RVGWSRPGCQPDQELGSDDRAFAFDGFKAQQWDKVHEHYGRSWQAGDVVGCMVDMTEHTM
MLTLNREIFLDDSGSELALKDFDVGDGFIPVCSLGVAQVGRMNFGKDVSTLKYFTICGLQ
EGYEPFAVNTNRDITMWLSKRLPQFLQVPSNHEHIEVTRIDGTIDSSPCLKITQKSFGSQ
NSSTDIMFYRLSMPIECAEVFSKTGLPGAGLFGPKNDLEDYDADSDFEVLMKTAHGHLVP
DRVDKDKEATKPEFNNHKDYAQEKPSRLKQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXYYYSVRIFPGQEPANVWVGWITSDFHQYDTGFDLDRVRTVTVTLGDEKGKV
HESIKRSNCYMVCAGESMSPGQGRNNNGLEIGCVVDAASGLLTFTANGKELSTYYQVEPS
TKLFPAVFAQATSPNVFQFELGRIKNVMPLSAGLFKSEHKNPVPQCPPRLHVQFLSHVLW
SRMPNQFLKVDVSRISERQGWLVQCLDPLQFMSLHIPEENRSVDILELTEQEELLKFHYH
TLRLYSAVCALGNHRVAHALCSHVDEPQLLYAIENKYMPGLLRTGYYDLLIDIHLSSYAT
ARLMMNNEFIVPMTEETKSITLFPDENKKHGLPGIGLSTSLRPRMQFSSPSFVSINNECY
QYSPEFPLDILKAKTIQMLTEAVKEGSLHARDPVGGTTEFLFVPLIKLFYTLLIMGVFHN
EDLKHILQLIEPSVFKEAASPEEESESLEKDLGTEDSKQEGIMEEEARAGNGPKEGLLQM
KLPEPVKLQMCLLLQYLCDCQVRHRIEAIVAFSDDFVAKLQDNQRFRYNEVMQALNMSAA
LTARKTKEFRSPPQEQINMLLNFKDDKNECPCPEEIRDQLLEFHEDLMAHCXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTLQQLISDTMVRWAQESVIE
DPELVRAMFMLLHRQYDGIGGLVRALPKTYTINGVSVEDTINLLASLGQIRSLLSVRMGK
EEEKLMIRGLGDIMNNKVFYQHPNLMRALGMHETVMEVMVNVLGGGESKEITFPKMVANC
CRFLCYFCRISRQNQKAMFDHLSYLLENSSVGLASPAMRGSTPLDVAAASVMDNNELALA
LREPDLEKVVRYLAGCGLQSCQMLVSKGYPDIGWNPVEGERYLDFLRFAVFCNGESVEEN
ANVVVRLLIRRPECFGPALRGEGGNGLLAAMEEAIKIAEDPSRDGPSPTSGSSKTLDTEE
EEDDTIHMGNAIMTFYAALIDLLGRCAPEMHLIHAGKGEAIRIRSILRSLIPLADLVGVI
SIAFQMPTIAKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXAALSATDMALALNRYLCTAVLPLLTRCAPLFAGTEHHASLIDSLLHTVYRLS
KGCSLTKAQRDSIEVCLLSICGQLRPSMMHLLRRLVDVPLLNEHAKMPLKLLTNHYERCW
KYYCLPGGWGNFGAASEEELHLSRKLFWGIFDALSQKKYEQELFKLALPCLSAVAGALPP
DYMESNYVSMMEKQSSMDSEGNFNPQPVDTSNITIPEKLEYFINKYAEHSHDKWSMDKLA
TGWIYGEVFSDSSKVQPLMKPYKLLSEKEKEIYRWPIKESLKTMLAWGWRIERTREGDSM
ALYNRTRRISQTSQVSVDAAHGYSPRAIDMSNVTLSRDLHAMAEMMAENYHNIWAKKKKM
ELESKGGGNHPLLVPYDTLTAKEKAKDREKAQDILKFLQINGYAVSRGFKDLELDTPSIE
KRFAYSFLQQLIRYVDEAHQYILEFDGGSRTKGEHFPYEQEIKFFAKVVLPLIDQYFKNH
RLYFLSAASRPLCTGGHASNKEKEMVTSLFCKLGVLVRHRISLFGNDATSIVNCLHILGQ
TLDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXEDVQVSCYRILTSLYALGTSKSIYVEXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLNLPANVEDVCPNIPSLEKLMEEI
VDLAESGIRYTQMPHVMEVILPMLCSYMSRWWEHGPENNLERAEMCCTALSSEHMNTLLG
NILKIIYNNLGIDEGAWMKRLAVFSQPIINKVKPQLLKTHFLPLMEKLKKKAAMVVSEED
HLKAEARGDMSEAELLILDEFTTLARDLYAFYPLLIRFVDYNRAKWLKEPNPEAEDLFRM
VAEVFIYWSKSHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAAVSDQERKKMKRKGDR
YSMQTSLIVAALKRLLPIGLNICAPGDQELIALAKMRFSMKDTEDEVRDIIRSNIHLQGK
LEDPAIRWQMALYKDLPNRTEDTSDPEKTIERVLDIANVLFHLEQKSTYIYRRYGLXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRAVNLFLQGYEKSWIETEEHYFEDK
LIEDLAKPGVEPSEEDEGNKRVDPLHQLILLFSRTALTEKCKLEEDFLYMAYADIMAKSC
HDEEDDDGEEEVKSFEEKEMEKQKLLYQQARLHDRGAAEMVLQTISASKGETGPMVAATL
KLGIAILNGGNSTVQQKMLDYLKEKKDVGFFQSLAGLMQSCSVLDLNAFERQNKAEGLGM
VTEEGSGEKVLQDDEFTCDLFRFLQLLCEGHNSDFQNYLRTQTGNNTTVNIIISTVDYLL
RVQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGPCTGNQQSLAHSR
LWDAVVGFLHVFAHMQMKLSQDSSQIELLKELMDLQKDMVVMLLSMLEGNVVNGTIGKQM
VDMLVESNNVEMLKFFDMFLKLKDLTSSDTFKEYDPDGKGVISKRDFHKAMESHKHYTQS
ETEFLLSCAETDENETLDYEEFVKRFHEPAKDISFNVAVLLTNLSEHMPNDTRLQTFLEL
AESVLNYFQPFLGRIEIMGSAKRIERVYFEISESSRTQWEKPQVKESKRQFIFDVVNEGG
EKEKMELFVNFCEDTIFEMQLAAQISESDLNERSTNKEESEKEKPEEQGPRMGFFSILTV
KSALFALRYNILTLMRMLSLKSLKKQMKKVKKMTMKDMVTAFFSSYWSIFLGLLHFVASV
FRGFFRIICSLLLGGSLVEGAKKIKVAELLANMPDPTQDEVRGDGEEGERKPLEAPLPSE
DLTDLKELTEESDLLSDIFGLDLRREGGQYKLIPHNPNAGLSDLMSNPVPLPEVQEKFQX
XXXXXXXXXXXXXXXXXXXXXXGEDGQGEKVKDDKXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVPLVIFKR
EKEVARKLEFDGLYITEQPSEDDIKGQWDRLVINTQSFPNNYWDKFVKRKVMDKYGEFYG
RDRISELLGMDKAALDFSDAREKKKPKKDSSLSAVLNSIDVKYQMWKLGVVFTDNSFLYL
AWYMTMSVLGHYNNFFFAAHLLDIAMGFKTLRTILSSVTHNGKQLVLTVGLLAVVVYLYT
VVAFNFFRKFYNKSEDGDTPDMKCDDMLTCYMFHMYVGVRAGGGIGDEIEDPAGDEYEIY
RIIFDITFFFFVIVILLAIIQGLIIDAFGELRDQQEQVKEDMETKCFICGIGNDYFDTVP
HGFETHTLQEHNLANYLXXXMYLINKDEEHTGQESYVWKMYQERCWEFFPAGDCFRKQYE
DQLN
Download sequence
Identical sequences ENSSARP00000000317 ENSSARP00000000317

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]