SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for Tc09_g023671 from Theobroma cacao B97-61/B2 v1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  Tc09_g023671
Domain Number 1 Region: 227-330
Classification Level Classification E-value
Superfamily Ribonuclease H-like 4.17e-17
Family Retroviral integrase, catalytic domain 0.028
Further Details:      
 
Weak hits

Sequence:  Tc09_g023671
Domain Number - Region: 440-564,633-788
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 0.000129
Family Reverse transcriptase 0.048
Further Details:      
 
Domain Number - Region: 62-90
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.00126
Family Retrovirus zinc finger-like domains 0.0061
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Tc09_g023671
Sequence length 806
Comment Putative uncharacterized protein
Sequence
MLQIELKKSKADFELLKLELESKNIALQKVMDENVALKASMKENLKLNLENEEKPKKITM
FKRKNHASSQHCYVCVKKGHLSYNCYHNGNRLPKTKLVWVPKESHVLVNPQGPIKVWERE
PWYLDSGCSRQMTGNENLFVNLDKNKSGSVFFGDDSKSIIQRIGIIGLKHNLLSTSQLCD
RGFKVCFDSHGCQVIDIDTNKVAFIGKRIKNMYVIFLDEIESSESCLIANDVCDSWLWCK
RVRYVSLTTVNVRSDYGVKFENDDFEIFCNENDFDHNFSALRTLQQNGIAERKNKTLKEI
ARIMLCENNLPKYFWAKVVNTVAYILNRVSIRLRSFGCKCFVLNNRKHSLGKFDAKSDEG
IFLGYPLNSKAYRVFNKRTLVIEESIHIVFYETNAAQRKVVFNDDDAEDIEKKMEKINLD
NKVDDGKIIAMQEELGQFERNKVWTLVPRTTNHPIVGTKWVFKTKMDKLGNVVRNKARLV
FQGYNQEEGIDYDETFALVARLETIRLLLAYALYVKQPPSFEVFNKIDHMYKLHKALYGL
KQAPRAWYERLPKFLIEKGYSRGSVDNVVTCSPPRNATVPPLLYTRFSGSGGGTPHCLVP
GHPGASSTNLPYAKRGAWQDNTFFIKKNETDLIVIQTYVDDIVFGVTNNRLCENFIKEMH
GEFEMSMMGELKFFLGLQINGVNMKVIRIPMSLSKKLDKDDRGKNMDQKLYRGVIGSLFY
LTESHFTAMKRIFRYLLDTQGLGLWYPKSSSFNLLGYFNADFAVPIYRDNMSAINISKNS
VQHSKTRYIEIKHHFIIDHVPKGDVK
Download sequence
Identical sequences Tc09_g023671

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]