SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for Tc00_g061050 from Theobroma cacao B97-61/B2 v1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  Tc00_g061050
Domain Number 1 Region: 457-817
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 1.67e-117
Family Reverse transcriptase 0.00015
Further Details:      
 
Domain Number 2 Region: 970-1079
Classification Level Classification E-value
Superfamily Ribonuclease H-like 0.00000000000000109
Family mu transposase, core domain 0.056
Further Details:      
 
Domain Number 3 Region: 305-401
Classification Level Classification E-value
Superfamily Acid proteases 0.00000205
Family Retroviral protease (retropepsin) 0.043
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Tc00_g061050
Sequence length 1183
Comment Putative retrotransposon protein, identical
Sequence
MSLKTTHALATFFVALTGQAQAGPVPPVVFPVTTSIPPPPIPPQTPNVSISKKLKEARQL
GCILFTGELDATAARDWVIQVSETLTDMELDDEMKLKVATRLFEKRARTWWSSVKSRSPI
QLTWMDFLREFDGQYYTYFHQKEKNREFFSLKQGNMTIEDYETRFNELMSYVPELVRLEQ
DQVNYFKKRLSNEIRERMIVTGKESYKEVMHMALQAEKLVTENRRIRAEFTKRKNLRGTI
VTSTPLTRLSILRRDTSGSQSRQGPVIQSGMESNTPKQPSSRPQLEILTKVFAVTEDEAR
VRPSTMTCTMIVFDRDTHILIDSGSDISYVSISFASFLDRNLSPLEEEIVVHIPLGEWLV
RNTYYSDCGIKVGEEEFMGDLIPLEIRDFDFILGMDWLSTHLVKGAKVVFTRERQVLPFC
VISTLKALKLVRKGYPAYLAHVIDISREEPKLENVPVVSEFPDVFPDELPELPPDCEFEF
TIELLLSTAPISIPPYKMASVELKELKVQLKELVDKGFIRPSTSPWGALVLFEKKKDSTL
RLCLDYRQLNRMTIKNKYPLTRYDHYEFLVMPFGLTNTPVVFMVLMNRLFHPYLDKFVIV
FIDDILVYSRDNDEHTTHLRKVLQTLRERQLYAKFSKCEFLLQEVVFLGHVVSGQPKTVT
EIRSFLGLAGYYQKFVQGFSLIAAPLTRLTRKGVKFKWDNVCESRFQELKNWLTSTSVLT
LPVSGKGFVVYSDASTLGLGCILMQDEKVVAYASRQLKRHEANYPTHDLELATVVFALKI
WRHYLYGEHCRIFKDHKSLKYLLTQKELNLRKANVVADALSYEISWGQLRNGEDGSLLAS
FIVRPSLLNQIKNIQRFDDELRKKIQKLTDGGVSEFRFREDNILMFRDRVFVPEGNQLRQ
AIMEEAHSSAYALHPRSTKMYRTIRENYWWPGMKRDVAEFIVKCLVCQQVKEEHQRPVGT
LQSLPILEWKWEGKDSIWVIVDRLTKSTHFLAVHSTYSIEKLAQLYIDEIVRLHGVPVSI
VSNRDPRFTSPLWPKFQEALKTKLKFSTTMEDMLRACVMDFLGSWDRHLPLVEFAYNNMG
ERKLVSVELIDLTNDKIKVIQERLKVAQDRQKSYADKRRKDLEFEIDDRVFLKVSPWKYV
IRFTKRGKLNPRYIGPFRIIERIGLVAYRLKLPLELDPIHNVX
Download sequence
Identical sequences Tc00_g061050

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]