SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for Tc00_g063251 from Theobroma cacao B97-61/B2 v1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  Tc00_g063251
Domain Number 1 Region: 559-908
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 3.67e-132
Family Reverse transcriptase 0.0000223
Further Details:      
 
Domain Number 2 Region: 416-508
Classification Level Classification E-value
Superfamily Acid proteases 0.00000957
Family Retroviral protease (retropepsin) 0.063
Further Details:      
 
Weak hits

Sequence:  Tc00_g063251
Domain Number - Region: 328-358
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.00195
Family Retrovirus zinc finger-like domains 0.0037
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Molecular Function IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) Tc00_g063251
Sequence length 908
Comment Putative retrotransposon protein, identical
Sequence
MSVYRDTVAVVTGSRGVPGCDNSIGIRITMPLQRGRPPLTRSVGRGRGRSQRHQPNTVEE
ESAASTIWAAPAAEQADSPSHPPSPQPPTGILAVPTKAAQALAAFFAAMTGQAQTGQVPP
VVPPTTLLVPPLVQDVSISKKLKEARQLGCVSFTGELDATVAKDWINQVSETLSDMGLDD
DMKLMVATRLLEKRARTWWNSVKSRFATLQTWSDFLREFDDLVKSEQDQASYFEEGLRNE
IRERMTVIGRERIQIEFAKRRNLGMSSSQPVKRGKDLATSGSTTSVSVTSPRPPFSQSQQ
RLLRFSRSAMTGSEKSSGGSDRCKNCENYHSGLCRGPTRCFQCGQTGHIRSNCPRLGRAT
VAASSPPARTDMQRRDSSRLSLRQGVAIRSDVESNTPTHPPLRPQTPVTGTMSLFDKDAY
VLIDSGSNRSYVSTTFASIVDRNLSPLEGEIVVHTPLGEQLIRNTCYRDCGVRVGEEEFR
GDLIPLEILDFDLILGMDWLTAHRVNMDCFQKEVVLQNSKGAEIVFVEEHQVLPSCVISA
IKASKLVQKGYPTYLDVPIVSEFPDVFPDDLRGLPPDRELGFPIDLLPGTAPISIPPYRM
APAELKELKVQLQDLVDKGFIRPSISPWGAPVLFVKKKDGTLRLCIDYHDLFDQLRGAMV
FSKIDLRSGYYQLRIKEQDVPKTAFRTRYGHYEFLVMSFGLTNAPAVFMDLMNRVFHPYL
DKFVIVFIDDILVYSKNDDENAAHLPIVLQTLHERQLYAKFSKCEFWLKEVVFLGHVVSG
AGIYLDSKKIEAILQWEQPRTVIEIRSFLGLAGYYRRFVQGFSLIAAPLTRLTRKGVKFE
WDDVCENRFQELKNRLTSAPVLTLPISGKEFVVYSDASKLGLGCVLMQDEKVIAYASRQL
KKHETNYP
Download sequence
Identical sequences Tc00_g063251

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]