SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for ENSGGOP00000014542 from Gorilla gorilla 76_3.1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  ENSGGOP00000014542
Domain Number 1 Region: 590-782,1159-1314
Classification Level Classification E-value
Superfamily Cysteine proteinases 3.92e-107
Family Ubiquitin carboxyl-terminal hydrolase, UCH 0.00000386
Further Details:      
 
Domain Number 2 Region: 372-491
Classification Level Classification E-value
Superfamily HSP20-like chaperones 9.59e-37
Family GS domain 0.000000593
Further Details:      
 
Domain Number 3 Region: 119-205
Classification Level Classification E-value
Superfamily HSP20-like chaperones 0.000000000000279
Family GS domain 0.032
Further Details:      
 
Domain Number 4 Region: 888-936
Classification Level Classification E-value
Superfamily HIT/MYND zinc finger-like 0.00000000279
Family MYND zinc finger 0.0085
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Protein: ENSGGOP00000014542   Gene: ENSGGOG00000014895   Transcript: ENSGGOT00000014955
Sequence length 1419
Comment pep:known_by_projection chromosome:gorGor3.1:3:50365140:50377032:-1 gene:ENSGGOG00000014895 transcript:ENSGGOT00000014955 gene_biotype:protein_coding transcript_biotype:protein_coding
Sequence
MSGGASATGPRRGPPGLEDTTSKKKQKDRANQESKDGDPRKETGSRYVAQAGLELLASGD
PSASASHAAGITGSRHHTRLFFPSSSGSASTPQEEQTKEGACEDPHDLLATPPPELLLDW
RQSAEEVIVKLRVGVGPLQLEDVDAAFTDTDCVVRFAGGQQWGGVFYAEIKSSCAKVQTR
KGSLLHLTLPKKVPMLTWPSLLKKPLGTQELVPGLQCQENGQELSPIALEPGPEPHRAKQ
EARNQKRAQGRGEVGSGAGPGAQAGPSAKRAVHLCRGPEGEGSRDDPGPRGDAPPFVADP
ATQAEADEQLCIPPLNPQTCLLGSEENLAPLAGEKAVPPGNDPVSPAMVRSRNPGKDDCA
KEEMAVAADAATLVDEPESMVNLAFVKNDSYEKGPDSVVVHVYVKEICRDTSRVLFREQD
FTLIFQTRDGNFLRLHPGCGPHATFRWQVKLRNLIEPEQCTFCFTASRIDICLRKRQSQR
WGGLEAPAARVGGAKVAVPTGPTPLDSTPPGGAPHPLTGQEEARAVEKDKSKARSEDTGL
DSVATRTPMEHVTPKPETHLASPKPTCMVPPMPHSPVSGDSVEEEEEEEKKVCLPGFTGL
VNLGNTCFMNSVIQSLSNTRELRDFFHDRSFEAEINYNNPLGTGGRLAIGFAVLLRALWK
GTHHAFQPSKLKAIVASKASQFTGYAQHDAQEFMAFLLDGLHEDLNRIQNKPYTETVDSD
GRPDEVVAEEAWQRHKMRNDSFIVDLFQGQYKSKLVCPVCAKVSITFDPFLYLPVPLPQK
QKVLPVFYFAREPHSKPIKFLVSVSKENSTASEVLDSLSQSVHVKPENLRLAEVIKNRFH
RVFLPSHSLDTVSPSDVLLCFELLSSELAKERVVVLEVQQRPQVPSVPISKCAACQRKQQ
SEDEKLKRCTRCYRVGYCNQLCQKTHWPDHKGLCRPENIGYPFLVSVPASRLTYARLAQL
LEGYARYSVSVFQPPFQPGRMALESQSPGCTTLLSTGSLEAGDSERDPIQPPELQLVTLM
AEGDTGLPRVWAAPDRGPVPSTSGISSEMLASGPIEVGSLPAGERVSRPEAAVPGYQHPS
EAMNAHTPQFFIYKIDSSNREQRLEDKGDTPLELGDDCSLALVWRNNERLQEFVLVASKE
LECAEDPGSAGEAARAGHFTLDQCLNLFTRPEVLAPEEAWYCPQCKQHREASKQLLLWRL
PNVLIVQLKRFSFRSFIWRDKINDLVEFPVRNLDLSKFCIGQKEEQLPSYDLYAVINHYG
GMIGGHYTACARLPNDRSSQRSDVGWRLFDDSTVTTVDESQVVTRYAYVLFYRRRNSPVE
RPPRAGHSEHHPDLGPAAEAAASQASRIWQELEAEEEPVPEGSGPLGPWGPQDWVGPLPR
GPTTPDEGCLRYFVLGTVAALVALMLNVFYPLVSQSRWR
Download sequence
Identical sequences ENSGGOP00000019802 ENSGGOP00000014542

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]