SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSARP00000001073 from Sorex araneus 69_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSARP00000001073
Domain Number 1 Region: 2759-2924
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 1.09e-38
Family Laminin G-like module 0.0000000703
Further Details:      
 
Domain Number 2 Region: 2928-3109
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 6.04e-34
Family Laminin G-like module 0.0000000429
Further Details:      
 
Domain Number 3 Region: 2517-2702
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 3.27e-27
Family Laminin G-like module 0.013
Further Details:      
 
Domain Number 4 Region: 2333-2514
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 5.23e-25
Family Laminin G-like module 0.0023
Further Details:      
 
Domain Number 5 Region: 2149-2314
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 7.77e-20
Family Laminin G-like module 0.0016
Further Details:      
 
Domain Number 6 Region: 815-867
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000335
Family Laminin-type module 0.011
Further Details:      
 
Domain Number 7 Region: 1420-1471
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000698
Family Laminin-type module 0.011
Further Details:      
 
Domain Number 8 Region: 1477-1529
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00000001
Family Laminin-type module 0.015
Further Details:      
 
Domain Number 9 Region: 865-916
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000109
Family Laminin-type module 0.0029
Further Details:      
 
Domain Number 10 Region: 1060-1108
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000156
Family Laminin-type module 0.011
Further Details:      
 
Domain Number 11 Region: 284-343
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000156
Family Laminin-type module 0.033
Further Details:      
 
Domain Number 12 Region: 757-809
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000159
Family Laminin-type module 0.0066
Further Details:      
 
Domain Number 13 Region: 1014-1057
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000176
Family Laminin-type module 0.014
Further Details:      
 
Domain Number 14 Region: 1527-1564
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000206
Family Laminin-type module 0.014
Further Details:      
 
Weak hits

Sequence:  ENSSARP00000001073
Domain Number - Region: 1838-2081
Classification Level Classification E-value
Superfamily Methyl-accepting chemotaxis protein (MCP) signaling domain 0.000136
Family Methyl-accepting chemotaxis protein (MCP) signaling domain 0.0089
Further Details:      
 
Domain Number - Region: 341-397
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000335
Family Laminin-type module 0.022
Further Details:      
 
Domain Number - Region: 1119-1163
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00103
Family Laminin-type module 0.034
Further Details:      
 
Domain Number - Region: 1378-1404
Classification Level Classification E-value
Superfamily EGF/Laminin 0.00698
Family EGF-type module 0.074
Further Details:      
 
Domain Number - Region: 86-164
Classification Level Classification E-value
Superfamily Galactose-binding domain-like 0.0247
Family APC10-like 0.053
Further Details:      
 
Domain Number - Region: 721-759
Classification Level Classification E-value
Superfamily EGF/Laminin 0.053
Family EGF-type module 0.033
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSARP00000001073   Gene: ENSSARG00000001113   Transcript: ENSSART00000001169
Sequence length 3112
Comment pep:novel genescaffold:COMMON_SHREW1:GeneScaffold_2503:51855:687173:1 gene:ENSSARG00000001113 transcript:ENSSART00000001169 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MPGAAGVLLALLLSGGLGGGEAQHRRQSQAHQQRGLFPAVLNLASNALITTNATCGEKEP
EMYCKLVEHVPGKPVRDPQCRICNQSSSDPKQRHPITNAIDGKNTWWQSPSIKNGIEYHY
VTITLDLQQMLQRAYVIVKAANSPRPGNWILERSLDDVEYKPWQYHAVTDSECLVLYNIQ
PRTGPPSYAKDDEVICTSFYSKIHPLENGEIHISLINGRPSADDPSPELLEFTSARYIRL
RFQRIRTLNADLMMLAHKDPREIDPIVTRRYYYSVKDISVGGMCICYGHARACPLDPATN
KSRCECEHNTCGDSCDQCCPGFHQKPWRAGTFLTKSECEACNCHGKTKECYYDENVARRN
LSLNIHGKYIGGGVCINCTENTSGINCETCIDGFFRPKGXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIQDMSGW
YLTDLTGHIKVTPEQDDAIPLQEISISSSEAQKALPESYFWSAPAPYLGNKLTAAGGHLT
FTTSYDFEDEEEDTEHRLQFIIILEGNGLRISTVQDEVYLQPSEELTNVFPFKEELFTIH
GTNSPVSRKEFMTVLANLKRLLIQITYSLGMDAIFRLGSVGLESAVTHPVQFPNGRTAVA
VEVCQCPAGYTGTSCESCWPRHRRVNGTIFGGICEACQCFGHAESCDDVTAECLNCKDHT
GGPYCNQCLSGFYGDPTKGTSEDCQPCACPLNIPSNNFSPTCHLDRSLGLICDECPAGYS
GPRCERCAEGYFGQPSVPGGSCQPCQCNDNLDFSIPGSCDSLSGSCLICKPGTTGRYCEL
CADGYFGDAVDAKNCQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCDCAHLG
NNCDPKTGRCICPPNTIGEKCSKCAPNTWGHSIITGCKACNCSTEGSLDFQCNVNTGQCN
CHPKFSGAKCSECSRGHWNFPHCTPCDCFLPGTDDSTCDSETKRCFCSDKVGQCTCKVNV
EGVHCDKCQPGKFGLDAKNPLGCSSCYCFGVTTQCSEAKGLIRTWVTLTPEQTILPLVDE
ALQHTTTKGIAFQHPEIVAQMNLVKQDLHLEPFYWKLPEQFEGKKLMAYGGKLKYTIYFE
ARDETGFSTYNPQVIIRGGTRTHARVLVRHMAAPLNGQLTRHEIEMTEKEWKYYGDDPRM
SRTVTREHFLDTLYDIHYILIKATYGNVMRQSRISEISMEVAEPGQISAVTPAAHMIEKC
DCPTGYSGFSCESCMQGFYRSRSDLGGHTAGPTLGTCVPCQCNGHSNMCDPETSICQNCQ
HNTAGDFCDRCAPGYYGIVKGFPNDCRQCACPLISSSNNFSPSCVLEGLDDYRCTECPRG
YEGQYCERCAPGYTGSPRSPGGSCQECECDPHGSLPVPCDPVTGFCTCRPGATGKKCDGC
EHLHAREGTECIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYGLNTTQELKH
LLSPQRAPERLIQLAEGNLNTLVTEMNELLTRATKVTADGEQTGQDAERSNTRANALGEF
IKELIRDAEAVNEKAIKLNETLGTQDKAFERNIQELQKEIDQMMSELRRKNLDSQKEVAE
DELVXXXXXXXXXXXXXXXSRGKNEEMEMNLREKLTDYQNVDDAQDLLREATDKLRETNR
LSAANKQNMTILEXXXXXXXXXXXXXENTLKDGNNLLDEANRLADEINSVIDYVEDIQTK
LPSMSDELKDKIDDLSQKIKDQKLAEKVSQAESHAAQLKDSSAILDRILDEAKNISFNAT
AAFNAYSNIKNYIDEAEKLAMEAKDLANEATKLXXXXXXXXXXDAKTSLQKSFGINEARK
LANDVKENDHHLNGLTTRLENADARNGNLLRALNDTLGKLSAIPSDTADKLQAVKDKARQ
ANDTAKSVLAQIRDLHQNLDGLKKNYKQLADSVAKTNAVVKDPLKNKVXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIKVSVSSGGDCIRTYKPEIK
KGSYNNIVVNVKTPVADNLLFYLGSAKFVDFLAIEMRKGKVSFLWDVGSGVGRVEYPDLT
IDDSYWYRIEASRTGRNGTISVRALDGPKASILPSVYHSASPPGYTILDVDANAMLFVGG
LTGKIKKADAVRVITFTGCMGETDFDKHRWNFREXXXXXXXXXXXPQVEDSEGTIQFDGE
GYALVSRPIRWFPNISTVTLKFRTSSNXXXXXXXXXXXRDFMSVELADGHIKVSYDLGSG
MASVVSNQNHNDGKWKSFTMSRTQKQANISIIDIDTNQEENIVTSSPGNNFGLDLKADDK
IYFGGLPTLRNLSLIKKPEVNLKKYSGCLRDIEISRTPYNILSSPDYVGITKGCSLENIY
TVSFPKPGFVELSPVAIDVGTEINLSFSTRNESGLILLGSGGTFKRKRRQTGQAYYAIFL
NKGRLEVHLSTGARTMRKIVIKPEPSLFHDGREHSVHVERTRGVLIVQVDEDRRHMQDLT
TEQAVEVKKLFVGGAPSEFQPFPLRNIPPFEGCIWNLVINSVPMDFAQPVSFKNADIGRC
THQRPGEDEEGVGPAEAVIEPQPVPTPGFPTHIPFLVHGPCAAESEPALLIGSKQFGLSK
NSHIAIAFDDTKVKNRLTIEFEVRTEAESGLLFYMARINHADFATVQLRNGLPYFSYDLG
SGDTNTLIPTKINDGQWHKIKITRVKQEGILYVDDASNRTISPKKADILDVVGMLYVGGL
PINYTTRRIGPVTYSIDGCIRNLQMAEAPVDLEQPTSSFNVGTCFANAQKGTYFDGTGFA
KAVGAFKVGLELLVEFEFRTTRTTGVLLGISSQKMDGMGIEMIDEKLMVHVDNGAGRFTA
VYDAGIPGHLCDGKWHKVTASKIKHHIKLTVDGNQVEAKSPNPASTSADTNDPVFVGGFP
DGLKQFGLTTSVRFRGCIRSLRLTKGTGKPMEVNFTKALELRGVQPVSCPVN
Download sequence
Identical sequences ENSSARP00000001073 ENSSARP00000001073

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]