SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000000147 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000000147
Domain Number 1 Region: 151-346
Classification Level Classification E-value
Superfamily Metalloproteases ("zincins"), catalytic domain 7.34e-60
Family Astacin 0.0000675
Further Details:      
 
Domain Number 2 Region: 768-875
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 4.71e-38
Family Spermadhesin, CUB domain 0.00022
Further Details:      
 
Domain Number 3 Region: 467-572
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 2.09e-37
Family Spermadhesin, CUB domain 0.0002
Further Details:      
 
Domain Number 4 Region: 353-462
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 3.53e-36
Family Spermadhesin, CUB domain 0.00045
Further Details:      
 
Domain Number 5 Region: 608-718
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 8.77e-36
Family Spermadhesin, CUB domain 0.00023
Further Details:      
 
Domain Number 6 Region: 899-992
Classification Level Classification E-value
Superfamily Spermadhesin, CUB domain 1.83e-27
Family Spermadhesin, CUB domain 0.00047
Further Details:      
 
Domain Number 7 Region: 719-761
Classification Level Classification E-value
Superfamily EGF/Laminin 0.000000000234
Family EGF-type module 0.0083
Further Details:      
 
Domain Number 8 Region: 573-605
Classification Level Classification E-value
Superfamily EGF/Laminin 0.0000000101
Family EGF-type module 0.011
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000000147   Gene: ENSGGOG00000000148   Transcript: ENSGGOT00000000149
Sequence length 1003
Comment pep:known_by_projection chromosome:gorGor3.1:4:176381005:176615950:1 gene:ENSGGOG00000000148 transcript:ENSGGOT00000000149 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MGLGTLSPRMLVWLVASGIVFYGELWVCTGLDYDYTFDGNEEDKTETIDYKDPCKAAVFW
GDIALDDEDLNIFQIDRTIDLTQNPFGNLGHTTGGLGDHVMSKKRGALYQLIDRIRRIGF
GLEQNNTVKGKVPLQFSGQNEKNRVPRAATSRTERIWPGGVIPYVIGGNFTGSQRAMFKQ
AMRHWEKHTCVTFIERSDEESYIVFTYRPCGCCSYVGRRGNGPQAISIGKNCDKFGIVVH
ELGHVIGFWHEHTRPDRDNHVTIIRENIQPGQEYNFLKMEPGEVNSLGERYDFDSIMHYA
RNTFSRGMFLDTILPSRDDNGIRPAIGQRTRLSKGDIAQARKLYRCPACGETLQESNGNL
SSPGFPNGYPSYTHCIWRVSVTPGEKIVLNFTTMDLYKSSLCWYDYIEVRDGYWRKSPLL
GRFCGDKLPEVLTSTDSRMWIEFRSSSNWVGKGFAAVYEAICGGEIRKNEGQIQSPNYPD
DYRPMKECVWKITVSESYHVGLTFQSFEIERHDNCAYDYLEVRDGTSENSPLIGRFCGYD
KPEDIRSTSNTLWMKFVSDGTVNKAGFAANFFKEGCEQRCLNTLGSYQCACEPGYELGPD
RRSCEAACGGLLTKLNGTITTPGWPKEYPPNKNCVWQVVAPTQYRISVKFEFFELEGNEV
CKYDYVEIWSGLSSESKLHGKFCGAEVPEVITSQFNNMRIEFKSDNTVSKKGFKAHFFSD
KDECSKDNGGCQHECVNTMGSYMCQCRNGFVLHENKHDCKEAECEQKIHSPSGLITSPNW
PDKYPSRKECTWEISATPGHRIKLAFSEFEIEQHQECAYDHLEVFDGETEKSPILGRLCG
NKIPDPLVATGNKMFVRFVSDASVQRKGFQATHSTECGGRLKAESKPRDLYSHAQFGDNN
YPGQVDCEWLLVSERGSRLELSFQTFEVEEEADCGYDYVELFDGLDSTAVGLGRFCGSGP
PEEIYSIGDSVLIHFHTDDTINKKGFHIRYKSIRYPDTTHTKK
Download sequence
Identical sequences ENSGGOP00000000147

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]