SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSMUSP00000023566 from Mus musculus 63_37 (longest transcript per gene)

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSMUSP00000023566
Domain Number 1 Region: 807-1068
Classification Level Classification E-value
Superfamily Trypsin-like serine proteases 4.66e-88
Family Eukaryotic proteases 0.000000127
Further Details:      
 
Domain Number 2 Region: 389-547
Classification Level Classification E-value
Superfamily Concanavalin A-like lectins/glucanases 7.28e-33
Family MAM domain 0.0037
Further Details:      
 
Domain Number 3 Region: 575-678
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 4.32e-28
Family Spermadhesin, CUB domain 0.00098
Further Details:      
 
Domain Number 4 Region: 53-165
Classification Level Classification E-value
Superfamily SEA domain 2.35e-19
Family SEA domain 0.0074
Further Details:      
 
Domain Number 5 Region: 276-380
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 1.22e-16
Family Spermadhesin, CUB domain 0.0028
Further Details:      
 
Domain Number 6 Region: 718-819
Classification Level Classification E-value
Superfamily SRCR-like 0.00000000000000445
Family Scavenger receptor cysteine-rich (SRCR) domain 0.011
Further Details:      
 
Domain Number 7 Region: 684-723
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.0000000000681
Family LDL receptor-like module 0.00094
Further Details:      
 
Domain Number 8 Region: 228-264
Classification Level Classification E-value
Superfamily LDL receptor-like module 0.00000445
Family LDL receptor-like module 0.0027
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSMUSP00000023566   Gene: ENSMUSG00000022857   Transcript: ENSMUST00000023566
Sequence length 1069
Comment pep:known chromosome:NCBIM37:16:78953253:79091337:-1 gene:ENSMUSG00000022857 transcript:ENSMUST00000023566
Sequence
MKSSRDEAVGHHSISSFEVMLSALFIMLMVFSIGLIAVSWLAVKESEGDAALGKSHEVRG
TFKITSGVTYNPNLQDKHSVDFKVLAFDLQQMIDEIFESSSLKNEYEKSKVFQFEKGSVI
VLFDLFFAQWVSDKNVKEELIQGIEANISSQLVTLHIDLNSIDITASLSDFTTAVPVTTS
DKLTTSSPMTTSASLGNLSTTVAATTSAPLCNLSTATFATTSGHVSIECQPGSRPCAHAW
NCVATDLFCDGEVNCPDGSDEDTGLCATACDGRFLLTGDSGVFQADRYPRPDESGVVCRW
IIRVNQGLSIRMNFGSFIPHYTDVLDIYEGIGPSKILRGSFWETDPGTIRIFSNLVTVTF
LIKSDEYDYIGFNATYSTFNNSELNNYEKIDCTFDDGFCFWTQDLDDDNEWERIQVTTFP
CYTGPRFDHTYGNGSGFYISTPTEQGWRSERVGLSSLSLDLTSEPVCLHFWYYMCCENVY
NLNIHISSAETTDKIVFQRKGNYGRNWNYGQVTLNETGEFKVVFNAFRNRGCSTIALDDI
SLTNGICSQSPYPEPTLVPTPPPELPTDCGGPFELWEPNSTFSSPNFPDKYPNQASCIWN
LNAQRGKNIQLHFQEFDLENINDVVEVRDGGEFDSLLLAVYTGPGPVKDLFSTTNRMTVI
FTTNMETRRKGFKANFTSGYYLGIPEPCQDDEFQCKDGNCIPLGNLCDSYPHCRDGSDEA
SCVRFLNGTRSNNGLVQFNIHSIWHIACAENWTTQISNEVCHLLGLGSANSSMPISSTGG
GPFVRVNQAPNGSLILTPSLQCSQDSLILLQCNHKSCGEKKVTQKVSPKIVGGSDAQAGA
WPWVVALYHRDRSTDRLLCGASLVSSDWLVSAAHCVYRRNLDPTRWTAVLGLHMQSNLTS
PQVVRRVVDQIVINPHYDRRRKVNDIAMMHLEFKVNYTDYIQPICLPEENQIFIPGRTCS
IAGWGYDKINAGSTVDVLKEADVPLISNEKCQQQLPEYNITESMICAGYEEGGIDSCQGD
SGGPLMCQENNRWFLVGVTSFGVQCALPNHPGVYVRVSQFIEWIHSFLH
Download sequence
Identical sequences P97435
NP_032967.1.92730 10090.ENSMUSP00000023566 ENSMUSP00000023566 ENSMUSP00000023566 ENSMUSP00000023566

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]