SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for Tc04_g007412 from Theobroma cacao B97-61/B2 v1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  Tc04_g007412
Domain Number 1 Region: 358-515
Classification Level Classification E-value
Superfamily Ribonuclease H-like 9.11e-37
Family Retroviral integrase, catalytic domain 0.0065
Further Details:      
 
Domain Number 2 Region: 728-1023
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 0.0000000000147
Family Reverse transcriptase 0.033
Further Details:      
 
Weak hits

Sequence:  Tc04_g007412
Domain Number - Region: 137-158
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.0366
Family Retrovirus zinc finger-like domains 0.0083
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Tc04_g007412
Sequence length 1074
Comment Putative Retrovirus-related Pol polyprotein from transposon TNT 1-94
Sequence
MVATTWSDSDTSSSESEEEKVEERENLCLMAQDDETEISSSPYDISIDDLQDEYECLYDE
FEKLFSKYKILKKKVQRNFLQIELEHSKTDFEVLKLELDNKSEALQKTLDENITLKRLKN
ETLKRNVYYKKNSASVVSKCCICNKIGHLSYYCFTKRSIQKIRKIWVSKGSYVATNNQGP
IKVELHLKETYSRAQLKKKQPWYMDSGCSRHMTGDEMLFAKLDKRKGGTIFCGDDSKGRI
HGIRTDGKNSQTQISHVLLVKGLKHNLLSISQLCDKGFKVCFDSNKCEVIDISTNKVSFI
GRRLKNMYVVFLEDLEMNCEMCLVANAENDSWLWHRRLGYVRKQVRTSFKTKKIVSTTKP
LELLHIDLFGPISTTILEGKSYGFVIVDDYSRFTWVYFLAHKNDALPAFISHCTKVENEK
GLAIVSIKSDHGGEFESNEFEKFCNEKGLDHNFSTPKTPQQNGVVERKHRTLKEMAEAVN
TVAYILNRVSIRAMISRTPYKLYKGRKPNISHLRSFGCKCFVLNNGKQSLGKFDAKSDEV
IFLGYALNLKAYRVFNKRTLTVEESIHVVFDESNALQKEIHADDDDVEILEEQMEEMSLE
NNKNNEESSPKRENETPPLENLYAKNHPQEQVIGDIFEGVKTRKATRETCELLTFISQIE
PKTFEEAKKEENWILAMQEELDQFKRNRVWSLVSRPLNHPITDETFAPVARIEAKRLLLA
FACFMNFKLYQMDVKSAFLNGFIQEEVYVEQPPGFEDFEKSYHVFKFHKALYRLKQAPRA
SYERLSKFFIEKGYVRGSIDTTLFIKRYLKDLIVVQIYVDDIVFGATNEALCKNFAKEMQ
GEFEMSVMGELKYFLGLQIKQSEKGIFINQERYTQDMLKKFDILKLKSICTLTNPSTKLD
LDKKGKDVDQKLYKGMIGSLLYLTTRIWYSRESLFNLVGYSDADFARSKIDRESTSGTSL
FTVEAEYISLGSCCAQILWINQQLRDFGITVHNVPIFCDNISAINISKNPIQHSRTKHIE
IRHHFVRDHVVKGDIKIEFVNTFNQLADIFTKPLNKDRFCEIRRNLGMTNMQEL
Download sequence
Identical sequences Tc04_g007412

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]