SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000001053 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000001053
Domain Number 1 Region: 38-294
Classification Level Classification E-value
Superfamily Cysteine proteinases 1.55e-36
Family Calpain large subunit, catalytic domain (domain II) 0.0066
Further Details:      
 
Weak hits

Sequence:  ENSGGOP00000001053
Domain Number - Region: 784-854
Classification Level Classification E-value
Superfamily Globin-like 0.000385
Family Globins 0.011
Further Details:      
 
Domain Number - Region: 627-659
Classification Level Classification E-value
Superfamily Cysteine proteinases 0.00174
Family Calpain large subunit, catalytic domain (domain II) 0.034
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000001053   Gene: ENSGGOG00000001069   Transcript: ENSGGOT00000001077
Sequence length 1667
Comment pep:known_by_projection chromosome:gorGor3.1:6:147532247:147738425:1 gene:ENSGGOG00000001069 transcript:ENSGGOT00000001077 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MASKQTKKKEVHRINSAHGSDKSKDLYPFGSNIQSGSIEQKKGKFPIWPEWSEADINSEK
WDAGKGAKEKDKTGKSPVFHFFEDPEGKIELPPSLKIYSWKRPQDILFSQTPVVVKNEIT
FDLFSANEHLLCSELMRWIISEIYAVWKIFNGGILSNYFKGTSGEPPLLPWKPWEHIYSL
CKAVKGHMPLFNSYGKYVVKLYWMGCWRKITIDDFLPFDEDNNLLLPATTYEFELWPMLL
SKAIIKLANIDIHVADRRELGEFTVIHALTGWLPEVISLHPGYMDKVWELLKEILPEFKL
SDEASSESKIAVLDSKLKEPGKEGKEGKEIKDGKEVKDVKEFKPESSLTTLKAPEKSDKV
PKEKTDARDIGKKRSKDGEKEKFKFSLHGSRPSSEVQYSVQSLSDCSSAIQTSHMVVYAT
FTPLYLFENKIFSLEKMADSAEKLREYGLSHICSHPVLVTRSRSCPLVAPPKPPPVPPWK
LVRQKKETVITDEAQELIVKKPERFLEISSPFLNYRMTPFTIPTETHFVRSLIKKGIPPG
SDLPSVSETDETATHSQTDLSQITKATSQGNTASQVILGKGTDEQTDFGLGDAHQSDGLN
LERDIVSQATATQEKSQEELPTTNNSVSKEIWLDFEDFCVCFQNIYIFHKPSSYCLNFQK
SEFKFSEERVSYYLFVDSLKPIELLVCFSALVRWGEYGALTKDSPPIEPGLLTAETFSWK
SLKPGSLVLKIHTYATKATVVRLPVGRHMLLFNAYSPVGHSIHICSMVSFVIGDEHVVLP
NFEPESCRFTEQSLLIMKAIGNVIANFKDKGKLSAALKDLQTAHYPVPFHDKELTAQHFR
VFHLSLWRLMKKVQITKPPPNFKFAFRAMALDLELLNSSLEEVSLVECLDVKYCMPTSDK
EYSAEEVAAAIKIQAMWRGTYVRLLMKARIPDTKENISVADTLQKVWAVLEMNLEQYAVS
LLRLMFKSKCKSLESYPCCQDEETKIAFADYTVTYQEQPPNSWFIVFRETFLVPQDMILV
PKVYTTLPICILHVVNNDTMEQVPKVFQKVVPYLYTKNKKGYTFVAEAFTGDTYVAASRW
KLRLIGSSAPLPCLSRDSPCNSFAIKEIRDYYIPNDKKILFRYSVKVLTPQPATIQVRTS
KPDAFIKLQVLENEETMVSSIGKGQAIIPAFHFLKSEKGLSSQSSKHILSFHSASKKEQE
VYVKKKAAQGIQKSPKGRAVSSIQDIGLPLVEEETTSIPTREDSSSTPLQNYKYIIQCSV
LYNSWPLTESQLTFVQALKDLKKSNTKAYGERHEELINLGSPDSLTISEGQKSSVTSKIT
RKGKEKSSEKEKTAKEKQALRFEPQISTVHPQQEDPNKPYWILRLVTEHNESELFEVKKD
TERADEIRAMKQAWETTEPGRAIKASQARLHYLSGFIKKTSDAESLPISESQTKPKEEVE
TAARGVKEPNSKNSAGSESKEMTQTGSGSAVWKKWQLTKGLRDVAKSTSSESGGVSSPGK
EEHEQSTRKENIQTGPRTRSPTILETSPRLIRKALEFMDLSQYVRKTDTDPLLQTDELNQ
QQAMQKAEEIHQFRQYRTRVLSIRNIDQEERLKLKDEVLDMYKEMQDSLDEARQKIFDIR
EEYRNKLLEAERLKLEALSAQEAATKLETEKTTPAPDTQKKKKGKKK
Download sequence
Identical sequences ENSGGOP00000001053 ENSGGOP00000001053

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]