SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for Tc00_g043842 from Theobroma cacao B97-61/B2 v1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  Tc00_g043842
Domain Number 1 Region: 407-539
Classification Level Classification E-value
Superfamily Ribonuclease H-like 1.52e-23
Family Retroviral integrase, catalytic domain 0.018
Further Details:      
 
Domain Number 2 Region: 217-252
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.0000000436
Family Retrovirus zinc finger-like domains 0.0052
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Tc00_g043842
Sequence length 983
Comment Putative uncharacterized protein
Sequence
MWDVTTDGPFMPSTLNVVTNDMIPKPMSEWIEAETKKVQTNFKAINTLHCVWDKLRIIHE
GTSQVKDSKIALLTHNYEMFKMEPGEDITSMLDRFTNITNKLSQLGQPIPEHEIVKRLLK
KAKDLNVITLDEICGSLLTHELELKEEEEEDRREAKEKKKSIALKASILEEELDELSYDD
DEELALVARRFRKLMGKKDQRLARRGFKRDQGFSWRARNKNDSNKMKERTCFECKKPGHF
KFECPLLKKETPKRNNKSMKAMVAVTCSDNDTSSSEVEEEKAEERENLCLMALDDESKLD
KRKGGTVSFGDDSKGRIHGIGTVGKNSQTQISHLLLVKGLKHNLLSISQLCDKRFKVCFD
LTKCEVIDISTNKVLFIGKRIKNMYIVFLKNLEMDYETCLVAKLKMMVGYGKSYGFVIVD
DYSRYTWVYFLAHKNDALPAFISHCRKVENEKGLAIVSIRSDHGGEFEEDEFENFCNEKG
LDHNFSTPRTPHNIPKYFWAEVVNIAPYIFNRVSIRAMISKTPYELYKGRKPNISHLRSF
SCKCFVLNNGKQPLGKFDAKSDKAIFLGYALNSKAYRVFNKRSLTVEESIHVVFNESNTL
QKEISDDENDTDILEKQMEEMSLDDKKNSEENTLDRDIKPPPIETLYNKDHPQKQIISDI
SHGVKTRRATRETCEFSAFISQIEPTKELDQFKRNRVWSLVSRPLNHPIVGTKRVFRNKV
DEQGNVVRNKARLVAQGYNQEEGIDYDETFAPVARIEAIRLLLAFECFMNFKLFQMDVKS
ERLSKFLVEIGYVRGSIDTTLFIKRYLNDLIVVQIYVDDIIFGVTNEALCKKFKGIFINQ
ERYIQDMLKRFDMLKLKSICTPMSPSTRLDVDEKEKDVDQKLYRGMIGSLLYLTVSRPDI
QFSVCLCACFQSQPKESHLTTIKRSFRYLIDTQTLGIWYPRESSFSLIGYSNVDFASSKI
DRKSTSRTCQFLGSMLVSWSIKK
Download sequence
Identical sequences Tc00_g043842

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]