SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for Tc00_g083401 from Theobroma cacao B97-61/B2 v1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  Tc00_g083401
Domain Number 1 Region: 473-856
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 9.78e-24
Family Reverse transcriptase 0.07
Further Details:      
 
Domain Number 2 Region: 216-335
Classification Level Classification E-value
Superfamily Ribonuclease H-like 0.0000000379
Family Retroviral integrase, catalytic domain 0.025
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Tc00_g083401
Sequence length 868
Comment Putative Retrovirus-related Pol polyprotein from transposon TNT 1-94
Sequence
MEQLEQLQILLTQHQLSAQVPSSNCSIAQKGNYPFALSANSESNGPWIMYSGASDHMTRC
HRFFSTYALCSKNLKISVVDGVLSSVVGKGFVNLANMTLKYMLHVPNLKCNLVSISKLTN
ALNCVTKFYPSFSEFQDLLSRRTIGNAKMHDRLYYLEDKGQVNKQTLLSHPSFPYLKNLF
PSLFKNKNINSFCCETCIFVKHTRTCYPNQPYKSSKPFSLIRCDIWGPSSVNNISGTKWL
ITFIDDHSRACWVYLLKEKSDAVLVSKPHNKMVLLNVKIAIYWRLYMHLCSQLMFLNSFD
EIILTTSYLINRLLSPILGFKTPLSILLKTYPQTRLFTFLSPKVFGCTSFVHNTSPTCEK
LYPKAIKCIFVRYFPNKKRYNYYYPTTKKMFVSLYVFFLEDQCFYPNVTLQKEILREKKL
WDCIIPLPVVADILETPPRNKNPPIMASTNFSLETREDPNGVVVLEEMRALKRNETWDVM
ELLEGKSSVWCKWVFTIKYKSNGEIERYKAHLITKGFTQVFGVDYTEKFTPVAKLNTIRV
LLSLTANFDLALHQMDVKNAFLNKELDEEVYMDLPLGFEGAIGNKKVCRLKKSLYGLKQS
RRAWFDRFAKTIKRYGYQQGQTDHTLFFNHSQDGKKTILIVHVDDIILTENDIEEMERLK
KTLKSEFEIKDLGKYTLDLLKDIGMLGCKLAATPIVMNMKLGRTRSGIPVDKGRYQRLVG
RLIYLSYTRLDIYLKSTLDKGLFFKKNELRSVEAFTDVDWASSVEDKRSTFGYCTKVWGN
LVTWRSKKQPVVACSSAKVELRALAQGTCELIWLKRLMEELKVFSMGPMKLYCDNKAAIN
TAHNLVHHDRTKHVEIDRHFIKKKIEYG
Download sequence
Identical sequences Tc00_g083401

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]