SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000020870 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000020870
Domain Number 1 Region: 8-343
Classification Level Classification E-value
Superfamily Cysteine proteinases 8.34e-116
Family Calpain large subunit, catalytic domain (domain II) 0.0000000881
Further Details:      
 
Domain Number 2 Region: 348-496
Classification Level Classification E-value
Superfamily Calpain large subunit, middle domain (domain III) 6.8e-37
Family Calpain large subunit, middle domain (domain III) 0.0000713
Further Details:      
 
Domain Number 3 Region: 513-672
Classification Level Classification E-value
Superfamily EF-hand 1.12e-26
Family Penta-EF-hand proteins 0.0013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000020870   Gene: ENSGGOG00000003021   Transcript: ENSGGOT00000027769
Sequence length 684
Comment pep:known_by_projection chromosome:gorGor3.1:2a:31743835:31787077:-1 gene:ENSGGOG00000003021 transcript:ENSGGOT00000027769 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MSLWPPFRCRWKLAPRYSRRASPQQPQQDFEALLAECLRNGCLFEDTSFPATLSSIGSGS
LLQKLPPRLQWKRPPELHSNPQFYFAKAKRLDLCQGIVGDCWFLAALQALALHQDILSRV
VPLNQSFTEKYAGIFRFWFWHYGNWVPVVIDDRLPVNEAGQLVFVSSTYKNLFWGALLEK
AYAKLSGSYEDLQSGQVSEALVDFTGGVTMTINLAEAHGNLWDILIKATYNRTLIGCQTH
SGEKILENGLVEGHAYTLTGIRKVTCKHRPEYLVKLRNPWGKVEWKGDWSDSSSKWELLS
PKEKILLLRKDNDGEFWMTLQDFKTHFVLLVICKLTAGLLSQEAAQKWTYTMREGRWEKR
STAGGQRQLLQDTFWKNPQFLLSVWRPEEGRRSLRPCSVLVSLLQKPRHRCRKRKPLLTI
GFYLYRYPQYHDDQRRLPPEFFQRNAPLSQPDRFLKEKEVSQELCLEPGTYLIVPCILEA
HQKSEFILRVFSRRHIFYEIGSNSGVVFSKEIEDENERQNEFFTKFFEKHPEINAVQLQN
LLNQMTWSNLGSRQPFFSLEACQGILALLDLNASGTMSIQEFRDLWKQLKLSQKVFHKQD
RGSGYLNWEQLHAAMREAGIMLSDDVCQLMLICYGGPRLQMDFVSFIHLMLRVENMEDVF
QNLTQDGKGIYLQKPEWMMMALYS
Download sequence
Identical sequences ENSGGOP00000020870

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]