SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000021572 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000021572
Domain Number 1 Region: 142-337
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 7.34e-60
Family Astacin 0.0000675
Further Details:      
 
Domain Number 2 Region: 769-876
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 4.71e-38
Family Spermadhesin, CUB domain 0.00022
Further Details:      
 
Domain Number 3 Region: 458-562
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 2.62e-37
Family Spermadhesin, CUB domain 0.0002
Further Details:      
 
Domain Number 4 Region: 344-453
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 3.53e-36
Family Spermadhesin, CUB domain 0.00045
Further Details:      
 
Domain Number 5 Region: 609-719
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 8.9e-36
Family Spermadhesin, CUB domain 0.00023
Further Details:      
 
Domain Number 6 Region: 900-993
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 1.83e-27
Family Spermadhesin, CUB domain 0.00047
Further Details:      
 
Domain Number 7 Region: 720-762
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000000977
Family EGF-type module 0.0083
Further Details:      
 
Domain Number 8 Region: 574-606
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000129
Family EGF-type module 0.011
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000021572   Gene: ENSGGOG00000000148   Transcript: ENSGGOT00000031348
Sequence length 1004
Comment pep:known_by_projection chromosome:gorGor3.1:4:176380840:176615950:1 gene:ENSGGOG00000000148 transcript:ENSGGOT00000031348 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MLVWLVASGIVFYGELWVCTGLDYDYTFDGNEEDKTETIDYKDPCKAAVFWGDIALDDED
LNIFQIDRTIDLTQNPFGNLGHTTGGLGDHVMSKKRGALYQLIDRIRRIGFGLEQNNTVK
GKVPLQFSGQNEKNRVPRAATSRTERIWPGGVIPYVIGGNFTGSQRAMFKQAMRHWEKHT
CVTFIERSDEESYIVFTYRPCGCCSYVGRRGNGPQAISIGKNCDKFGIVVHELGHVIGFW
HEHTRPDRDNHVTIIRENIQPGQEYNFLKMEPGEVNSLGERYDFDSIMHYARNTFSRGMF
LDTILPSRDDNGIRPAIGQRTRLSKGDIAQARKLYRCPACGETLQESNGNLSSPGFPNGY
PSYTHCIWRVSVTPGEKIVLNFTTMDLYKSSLCWYDYIEVRDGYWRKSPLLGRFCGDKLP
EVLTSTDSRMWIEFRSSSNWVGKGFAAVYEAICGGEIRKNEGQIQSPNYPDDYRPMKECV
WKITVSESYHVGLTFQSFEIERHDNCAYDYLEVRDGTSENSPLIGRFCGYDKPEDIRSTS
NTLWMKFVSDGTVNKAGFAANFFKGNSQNSKFQKXGCEQRCLNTLGSYQCACEPGYELGP
DRRSCEAACGGLLTKLNGTITTPGWPKEYPPNKNCVWQVVAPTQYRISVKFEFFELEGNE
VCKYDYVEIWSGLSSESKLHGKFCGAEVPEVITSQFNNMRIEFKSDNTVSKKGFKAHFFS
DKDECSKDNGGCQHECVNTMGSYMCQCRNGFVLHENKHDCKEAECEQKIHSPSGLITSPN
WPDKYPSRKECTWEISATPGHRIKLAFSEFEIEQHQECAYDHLEVFDGETEKSPILGRLC
GNKIPDPLVATGNKMFVRFVSDASVQRKGFQATHSTECGGRLKAESKPRDLYSHAQFGDN
NYPGQVDCEWLLVSERGSRLELSFQTFEVEEEADCGYDYVELFDGLDSTAVGLGRFCGSG
PPEEIYSIGDSVLIHFHTDDTINKKGFHIRYKSIRYPDTTHTKK
Download sequence
Identical sequences ENSGGOP00000021572 ENSGGOP00000000147

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]