SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSARP00000009344 from Sorex araneus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSARP00000009344
Domain Number 1 Region: 3805-4006
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 6.2e-39
Family Laminin G-like module 0.0035
Further Details:      
 
Domain Number 2 Region: 1126-1255
Classification Level Classification E-value
Superfamily Cadherin-like 1.44e-31
Family Cadherin 0.0016
Further Details:      
 
Domain Number 3 Region: 3327-3438
Classification Level Classification E-value
Superfamily Cadherin-like 2.49e-31
Family Cadherin 0.00094
Further Details:      
 
Domain Number 4 Region: 3221-3325
Classification Level Classification E-value
Superfamily Cadherin-like 3.43e-31
Family Cadherin 0.0012
Further Details:      
 
Domain Number 5 Region: 3119-3221
Classification Level Classification E-value
Superfamily Cadherin-like 2e-30
Family Cadherin 0.00058
Further Details:      
 
Domain Number 6 Region: 2271-2385
Classification Level Classification E-value
Superfamily Cadherin-like 3.66e-29
Family Cadherin 0.0012
Further Details:      
 
Domain Number 7 Region: 2803-2911
Classification Level Classification E-value
Superfamily Cadherin-like 2.57e-27
Family Cadherin 0.0015
Further Details:      
 
Domain Number 8 Region: 920-1023
Classification Level Classification E-value
Superfamily Cadherin-like 2.86e-27
Family Cadherin 0.0014
Further Details:      
 
Domain Number 9 Region: 1549-1673
Classification Level Classification E-value
Superfamily Cadherin-like 6.42e-27
Family Cadherin 0.0015
Further Details:      
 
Domain Number 10 Region: 1019-1136
Classification Level Classification E-value
Superfamily Cadherin-like 7.85e-27
Family Cadherin 0.0007
Further Details:      
 
Domain Number 11 Region: 1760-1880
Classification Level Classification E-value
Superfamily Cadherin-like 1.44e-25
Family Cadherin 0.0018
Further Details:      
 
Domain Number 12 Region: 815-926
Classification Level Classification E-value
Superfamily Cadherin-like 1.71e-25
Family Cadherin 0.00096
Further Details:      
 
Domain Number 13 Region: 2069-2176
Classification Level Classification E-value
Superfamily Cadherin-like 3.14e-23
Family Cadherin 0.0022
Further Details:      
 
Domain Number 14 Region: 3025-3124
Classification Level Classification E-value
Superfamily Cadherin-like 5.63e-23
Family Cadherin 0.001
Further Details:      
 
Domain Number 15 Region: 708-813
Classification Level Classification E-value
Superfamily Cadherin-like 6.14e-23
Family Cadherin 0.0015
Further Details:      
 
Domain Number 16 Region: 137-230
Classification Level Classification E-value
Superfamily Cadherin-like 3.85e-22
Family Cadherin 0.0018
Further Details:      
 
Domain Number 17 Region: 3431-3534
Classification Level Classification E-value
Superfamily Cadherin-like 7.42e-22
Family Cadherin 0.0013
Further Details:      
 
Domain Number 18 Region: 456-557
Classification Level Classification E-value
Superfamily Cadherin-like 8.85e-22
Family Cadherin 0.001
Further Details:      
 
Domain Number 19 Region: 2176-2283
Classification Level Classification E-value
Superfamily Cadherin-like 1.3e-21
Family Cadherin 0.0015
Further Details:      
 
Domain Number 20 Region: 1449-1553
Classification Level Classification E-value
Superfamily Cadherin-like 5.85e-21
Family Cadherin 0.0011
Further Details:      
 
Domain Number 21 Region: 2383-2483
Classification Level Classification E-value
Superfamily Cadherin-like 3.28e-20
Family Cadherin 0.0012
Further Details:      
 
Domain Number 22 Region: 1659-1782
Classification Level Classification E-value
Superfamily Cadherin-like 7.42e-20
Family Cadherin 0.0023
Further Details:      
 
Domain Number 23 Region: 2911-3010
Classification Level Classification E-value
Superfamily Cadherin-like 2.71e-19
Family Cadherin 0.00091
Further Details:      
 
Domain Number 24 Region: 2584-2694
Classification Level Classification E-value
Superfamily Cadherin-like 3.14e-19
Family Cadherin 0.0013
Further Details:      
 
Domain Number 25 Region: 2486-2596
Classification Level Classification E-value
Superfamily Cadherin-like 5.14e-19
Family Cadherin 0.0021
Further Details:      
 
Domain Number 26 Region: 1356-1455
Classification Level Classification E-value
Superfamily Cadherin-like 4.19e-17
Family Cadherin 0.01
Further Details:      
 
Domain Number 27 Region: 1232-1352
Classification Level Classification E-value
Superfamily Cadherin-like 6.28e-17
Family Cadherin 0.0041
Further Details:      
 
Domain Number 28 Region: 1873-1971
Classification Level Classification E-value
Superfamily Cadherin-like 9.95e-17
Family Cadherin 0.0033
Further Details:      
 
Domain Number 29 Region: 561-662
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000000000275
Family Cadherin 0.0034
Further Details:      
 
Domain Number 30 Region: 2700-2801
Classification Level Classification E-value
Superfamily Cadherin-like 0.000000000000157
Family Cadherin 0.0025
Further Details:      
 
Domain Number 31 Region: 1979-2081
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000000471
Family Cadherin 0.0022
Further Details:      
 
Domain Number 32 Region: 366-451
Classification Level Classification E-value
Superfamily Cadherin-like 0.00000000118
Family Cadherin 0.0032
Further Details:      
 
Domain Number 33 Region: 4121-4159
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000000183
Family EGF-type module 0.0055
Further Details:      
 
Domain Number 34 Region: 33-149
Classification Level Classification E-value
Superfamily Cadherin-like 0.0000000121
Family Cadherin 0.0086
Further Details:      
 
Domain Number 35 Region: 4034-4161
Classification Level Classification E-value
Superfamily Growth factor receptor domain 0.0000000204
Family Growth factor receptor domain 0.013
Further Details:      
 
Domain Number 36 Region: 4009-4046
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000125
Family EGF-type module 0.011
Further Details:      
 
Weak hits

Sequence:  ENSSARP00000009344
Domain Number - Region: 3562-3625
Classification Level Classification E-value
Superfamily Cadherin-like 0.000514
Family Cadherin 0.01
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSARP00000009344   Gene: ENSSARG00000010248   Transcript: ENSSART00000010332
Sequence length 4582
Comment pep:known_by_projection genescaffold:COMMON_SHREW1:GeneScaffold_795:10595:120418:-1 gene:ENSSARG00000010248 transcript:ENSSART00000010332 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MGKSLALLLLLLLLLQHFGDSAGSQTLEQTPLQFTHFQYNVTVYENSAAKTYVGHPVKMG
IYLTNPSWELRYKIISGDNENLFKAEEYILGDFCFLRIRTKGGNTAILNREVKDHYTLIV
KAVEKNTNAEARTKVRVRVLDTNDLRPLFSPTSYSVSLPENTALRTSIARVSATDADIGT
NGEFYYSFKDRTDMFAIHPTSGAIILTGRLDYMETKLYEMEILAVDRGMKLYGSSGISSM
ALTVHVEQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLTLQAKDKGSPPQFSSVKVIHMTSPQFK
AGPVKFEKSVYRAEISEFAPPNTPVVMVKATPSYPHLKYAFKSTPGKAKFSLNQNTGLIS
ILEPIRRQHASHFELEVTTSDRKVSTKVLVKVLSANSNPPEFTQTAYKASFDENVPIGTT
VLSVSALDPDEGENGYVTYSIANLNHVPFVIDHFSGAVSTSETLDYELMPRVYTLRVRAS
DWGLPYRREVEVLATVTLNNLNDNTPLFEKINCEGTIPRDLGVGEQITTVSAIDADELQL
VRYQIEAGNELELFSLNPSSGVLSLKQSLMDGVGAKVTFHSLRITATDGENFATPLYINI
TVAAPRKQINLHCEETGVAKMLAEKLLQANKLHSQGEVEDIFFDSHSVNAHAPQFRSALP
TSIEVKENHPVGSNIILMNATDLDTGFNGKLVYAISGGNEDSCFIIDMDTGMLKILSPLD
RETTDKYILNITVYDLGIPQKAAWRLLDVRVLDANDNPPEFLQESYFVEVSEDKEISSEI
IQIEATDKDLGANGHVTYSILTDTDKFSIDSVTGVIRIVSPLDRETQHVHYLKIEARDQA
KEEAQLFSTVLLKVSLEDVNDNPPKFIPPNYRVKVREDLPEGTIIMWLEAYDPDLGQSSQ
VRYSLLDHGEGHFDVDKLSGAVRIVQQLDFEKKQVYNLTVRAKDKGKPVSLSSTCFVEVE
VIDVNENLHPPVFSSFVEKGVVKEDVPIGSSVMTVSAHDEDTGRDGEIRYSIRDGSGVGV
FRIDEETGVIETSDRLDRESASHYWLTVYAADQGVVPLSSFIEIYIEVEDVNDNAPQTSE
PVYYPEVMENSPKDVAVVQIEAFDPDSSSNDKLTYKITSGNPQGFFSIHPKTGLITTTAR
KLDREQQDEHILEVTVTDNGIPPKSTIARVIVKILDENDNKPQFLQKFYKIRLPEREKPE
RERNAKREPIYRVIALDKDEGPNAEISYSIEEGNEHGKFFIEPKTGVVSSKKSSAAGEYD
ILSIKAVDNGRPQKSSTTRLHIEWIAKPKPSPEPIAFEESFFSFTVMESDPVAHMIGVIS
VERPGIPLWFDIIGGNYDSHFDVDKGTGTMIVAKPLDAEQKSSYNLTVEATDGTTSIFTQ
VFIKVIDTNDHRPQFSTSKYEVVIPEDTVPETEILQISALDKDEKNKLIYTLQSSIDPLS
LKKFRLDPATGSLYTSEKLDHEAIHQHVLTVMVRDQDVPVKRNFARIVVDVSDTNDHAPW
FTSSSYEGRVYESAAVGSVVLQVTALDKDKGKNAEVLYSIESGIFNIGNSFTIDPILGSI
KTAKELDRSNQVEYDLMIKATDKGNPPMSEITSVHIFVTVADNASPKFTSKQYSVEISET
IGIGSFVGMVTAQSQSSVVYEIKDGNIADAFDINPHSGSIITQKALDFETLPIYTLIVQG
TNMAGLSTNTTVLVHLQDENDNWPVFMQVEYTGLISESASVNSVVLTDKNVPLVVRATDA
DKESNALLVYHIVEPSIHKYFAIDSSTGAIHTVLSLDYEETRTFHFTVQVHDMGTPRLFA
EYAANVTIHVIDINDCPPVFSKSLYEASLLLPTYKGVKVITVNATDADSRAFSQLIYSIT
EGNIGEKFSMDYKTGTITVQNTTQLRSRYELTIRASDGRFASFTSVKINVKESKESHLKF
TQDFYSAVVKENSTEARTLAVITAIGNPINDPLFYQILNPDRRFKISRTSGVLSTTGIPF
DREQQEADVVVEVTQEHKPSAVAHVVVKVTIEDQNDNAPVFVNLPYYAVVKVDAAVGHVI
RSVTAVDKDSGRNGEVHYYLKEHHEHFQIGSSGEISLKKPFEPDTLNKEYLITVVAKDGG
DPAFSAEVIVPVTIMNRAMPVFEKPFYSAEIPENIQMHSPVVHVQANSPEGLKVFYSITD
GDPFSQFTINFNTGVINVIAPLDFEYHPAYKLNIRATDSLTGAHAEVFVDIIVEDINDNP
PVFVQQAYAATLSEASIIGTPIIQVRATDADSEPNRGISYHLFGNHSKSHDHFHIDSSTG
LISLARTLDYEQFQQHQIFIRAVDSGMPPLSSDTVVTVQITDLNDNPPLFDQQIYEARIS
EHASPGHFVTCVKAYDSDSSDIDKLEYSILSGNDHKNFVIDSETGIITLSNLRRHTLKPF
YSLNISVSDGVFRSAAQVHVTVIGGNLHSPVFLQNEYEVELAENAPLHTLVIEVKATDGD
SGIYGHITYHIVNDFAKDRFYTNDRGQILTLEKLDRETPAEKVIAIRLMAKDAGGKVAFC
TINVILTDDNDNAPQFRATKYEVNIGSSAPKGTSVIKVLSSDADEGSNADVTYAIEADSE
SVKENLEINKMSGIITTKESLIGLENEFFTFFVRAVDNGSPPKESVVPVYVKILPPEMQL
PKFTEPFYTYTISEDMPIGTEIDLIRAEHSGTVLYSLVKGNTPESNRDEFFVIDRQSGRL
KLEKSLDHETTKWYQFSILARCTHEDSEVMASVDVSIQVKDANDNSPVLESNPYEAYIVE
NLPGGSRVIQVRASDLDSGTNGHVMYSLDQSQSVDIIESFAINMETGWITTLKELDHEER
NNYQIQVIASDHGEKVQLSSTAIVDVTVTDVNDSPPRFTAEIYKGTVSEDDPPGGVIAIL
STTDADSEEINRQVTYCITGGDPLRQFGIEVQNEWKVYVKKPLDREQGDNYLLTITATDG
TFSSKAVVEVEXXXXXXXXXXXXXTLYSETIREDAFPGKLVMQVSATDADIRSNAEITYT
LFGPGAEKFKLNPDTGELKTSAPLDREEQATYSLFIKATDGGGRFCQAHVMLTLEDVNDN
APEFSADPYTITVFENTEPGTLLTRVQASDADEGLNRQISYSMVNSADGQFSINKVSGII
QLERPLDRELQAVYTLTVRAADQGSRSLTATSTVVVSVLDINDNPPVFEYREYGATVSED
ILIGTEVLQVYAASRDIEANAEITYSIISGNEHGKFSIDSKTGAIFVIENLDYESSHEYY
LTVEATDGGTPSLSDVATVNINVTDINDNSPVFSQDTYTAVISEDAVLEQSVITVMADDA
DGPSNSHIHYSIIDGNQGSPFTIDPARGEVKVTSLLDRETISGYTLTVQASDNGSPPRVN
TTTVNIDVSDVNDNAPVFSQGNYSVIIQENKPVGFSVLQLAVTDKDSSHNGPPFFFTIVS
GNDDGAFEVNQQGILLTSASIKRKVKDHYLLHIKVADNGKPQLSSLTHIDIRVVESIYPP
AILPLEIISASREEYSGGVGEIHDTDQDVYDTLSYSQDPHMDNLFSVFSTWGKLIAHKKL
DTGHYVLNVSVTDGKFTTTADISVHLRQFTQEMLNHTLAIRFANLTPEEFVGDYWRNFQR
ALRNILGVRKNDIQIVSLQPSEPHQHLDVLLYIEKSGSAPSTKQLLHKINSSVTDIEEII
GVKILDVFQKLCAGLDCPWKFCDEKVSVDEEVMSTHSTARLSFVTPRHRRTAVCLCKEGN
CPIVHHGCEDNPCPEGSECVTDPQEERYTCVCPGGTFGQCPGSSSLTFTGNSYVKYRLME
NENKLEMKLTMRLRTYSSHAVVMYARGTDYSILEIHNGRLQYKFDCGSGPGIVSVQSIQV
SDGLWHEVTLEVNGNYARLVLDQVHTASGTAPGTLKTLNLDNHVYFGGHTHQQGTRHGRS
SQVSNGFRGCMDSIYLNGQELPLNNKPRSYAHIEESVDVSPGCLLTATEDCSSNPCQNGG
VCNPSPTGGYYCKCSALHIGTYCEVSVNPCASNPCLYGGTCMVDNGDFVCQCRGSYTGQR
CQLSPYCKDEPCKNGGTCFDSLDGAVCQCDSGFRGERCQSDIDECAGNPCRNGALCENTH
GSYHCNCSHEYKGRHCEDAAPNQYVSTPWNIGLAEGIGIVVFVMGIFLLVVTFVLCRKMI
SRKKKPQAEPEDKHLGPSTAFLQRPYFDAKLNKNIYSDVPPQVPVRPISYTPSIPSDSRN
NLDRNSFEGSAIPEHPEFSTFNPEAVHGHRKAVAVCSVAPNLPPPPPSNSPSDSDSIQKP
NWDFDYDTKVVDLDPCLSKKPLEEKPSHPYSARESLSEVQSLSSFQSESCDDNGYHWDTS
DWMPTVPLPDIQEFPNYEVIDEQTPLYSADPNAIDTDYYPGGYDIESDFPPPPEDFPAPD
ELPPLPPEFSDQFESIHPPRDMPAAGSLGSSARSRQRFHLNQYLPNFYPVDLSEPQKAGT
GEVSACREPYAPYPPGYPRNFEAPPVENIPLSMYTATASCSDVSACCEVESEVMMSDYES
GDDGHFEEVTIPPLDSHQHTQV
Download sequence
Identical sequences ENSSARP00000009344 ENSSARP00000009344

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]