SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSSARP00000000940 from Sorex araneus 76_1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSSARP00000000940
Domain Number 1 Region: 267-517
Classification Level Classification E-value
Superfamily Kelch motif 1.96e-38
Family Kelch motif 0.0071
Further Details:      
 
Domain Number 2 Region: 644-777
Classification Level Classification E-value
Superfamily C-type lectin-like 4.55e-24
Family C-type lectin domain 0.001
Further Details:      
 
Domain Number 3 Region: 2-108
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 1.31e-19
Family Spermadhesin, CUB domain 0.0028
Further Details:      
 
Domain Number 4 Region: 915-961
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000259
Family Laminin-type module 0.015
Further Details:      
 
Weak hits

Sequence:  ENSSARP00000000940
Domain Number - Region: 201-229
Classification Level Classification E-value
Superfamily Kelch motif 0.0183
Family Kelch motif 0.024
Further Details:      
 
Domain Number - Region: 150-227
Classification Level Classification E-value
Superfamily Galactose oxidase, central domain 0.0837
Family Galactose oxidase, central domain 0.031
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSSARP00000000940   Gene: ENSSARG00000001014   Transcript: ENSSART00000001023
Sequence length 1280
Comment pep:known_by_projection genescaffold:COMMON_SHREW1:GeneScaffold_1568:1636:512880:1 gene:ENSSARG00000001014 transcript:ENSSART00000001023 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
LTEPSGYLTDGPINYKYKTKCTWLIEGYPNAVLRLRFNHFATECSWDHMYVYDGDIYAPL
IAVLSGLIVPETRGNETVPEVVTTSGYALLHFFSDAAYNLTGFNIFYSINSCPNNCSGHG
KCTTSTSVPSQVYCECDKYWKGEACDIPYCKANCGSPDHGYCDLTGEKLCVCNDSWQGPD
CSLNVPSTESYWILPNVKPFSPSVGRASHKAVLLGKLMWVIGGYTFNYSSFQMVLXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXEDIFMYGGRIETSDGNVTDELWIFNIQRQSWSTKT
PTVLGHGQQYAVEGHSAHIMELDSRDVVMIIIFGYSSIYGYTSSIQEYHISSNTWLVPET
KGGIVQGGYGHTSVYDEITKSIYVHGGYKALPGNKYGLVDDLYKYEVNTKTWTILKESGF
ARYLHSAVLINGAMLIFGGNTHNDTSLSNGAKCFSADFLAYDIACDEWKILPKPNLHRDV
NRFGHSAVVINGSMYIFGGFSSVLLNDILVYKPPNCKAFKDEELCKNAGPGIKCIWNKNH
CESWESGNTNNIIKAKCPPKTAASDDRCYRYADCASCTANTNGCQWCDDKKCISASSNCS
MXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHLCGEGWNHIGDAC
LRLNSSRESYDNAKLYCYNLSGNLASLTTSKEVEFVLDEIQKYTQQKVSPWVGLRKINIS
YWGWEDMSPFTNTSLQWLPGEPNDSGFCAYLERAAVAGLKANPCTSMADGLVCEKPVXXX
XXXXXXXXXXXXXXXXXXXXXXTGMECMWCSSTRRCVDSNAYIISLPYGQCLEWQTATCS
PQNCSGLRTCGQCLEQPGCGWCNDPSNTGRGHCVEGSSRGPMKLVGTHNSEMVLDTSLCP
KEKNYEWSFIQCPACQCNGHSTCINNNVCEQCKNLTTGKQCQDCMPGYYGDPTNGGQCTA
CTCSGHANICHMHTGKCFCTTKGIKGDQCQLXXXXXXXXXXXXXXXXXXSLLIDYQFTFS
LLQEDDRHHTAINFIANPEQSNKNLDISINASNNFNLNITWSVGSTAGTISGEETPIVSK
TNIKDYRDSFSYEFNFRSNPNITFYVYVSNFSWPIKIQIAFSQHNTIMDLVQFFVTSFSC
FLSLLLVAAVVWKIKQTCWASRRREQLLRERQQMASRPFASVDVALEVGAEQTDFLRGPL
EGAPKPIAIEPCAGNRAAVLTVFLCLPRGSSGAPPPGQSGLAIASALVDISQQKPSENKD
KTSGVRNRKHHLSTRQGTCV
Download sequence
Identical sequences ENSSARP00000000940 ENSSARP00000000940

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]