SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for Tc02_g022471 from Theobroma cacao B97-61/B2 v1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  Tc02_g022471
Domain Number 1 Region: 627-791
Classification Level Classification E-value
Superfamily Ribonuclease H-like 4.17e-40
Family Retroviral integrase, catalytic domain 0.01
Further Details:      
 
Domain Number 2 Region: 1076-1136,1166-1329
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 0.00000000231
Family Reverse transcriptase 0.035
Further Details:      
 
Domain Number 3 Region: 223-262
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.0000366
Family Retrovirus zinc finger-like domains 0.0069
Further Details:      
 
Domain Number 4 Region: 385-413
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.0000541
Family Retrovirus zinc finger-like domains 0.0041
Further Details:      
 
Weak hits

Sequence:  Tc02_g022471
Domain Number - Region: 313-381
Classification Level Classification E-value
Superfamily Inorganic pyrophosphatase 0.0068
Family Inorganic pyrophosphatase 0.01
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Tc02_g022471
Sequence length 1362
Comment Putative Retrovirus-related Pol polyprotein from transposon TNT 1-94
Sequence
MLIYIRTIDYEMWYVIIDGPFMPSTMNVVTNELMPKPKSELTEAETKKVQINFKAINTLH
CALTPTEFNKVSSCTTTKQVWEKLRIIHERTSQVKESKMVLLTHSYEMFKMEPGEDITSM
FDRFTNITNKLSQLGKPIPEHELVKRLLRSLPKSWKPKVTAIREVKDLNIITLDEIYGSL
LTHELDITLKASILEEELEELSCDDDEELALVARKFRKLMNKRNRKLTRRGFRKDQGEEM
ICYEYKKPGHFKSECPLLKDETPKKNKKSKKAMMAAAWSDSDTSSSETVDEKSEERANIC
LMSQENETEVPSSPCINSYDDLQDEYECLYDEFENLFSKYKSLKKKAVLKQELENKFEAL
EITLDENTALKCLKNESSKRDVYHNKNSTKALPRCYNCGKHGHLSYECFKKRTVQKVKKI
WVPKGSFVPWYLDSGCSQHITGHEMLFVQLDKRKDGTVSFGNDSKGRIHGIGTVGKNFQT
QISHVLLVKGLKHNLLSISQLCDKGFRVFFDSTKCEVVKLNLINEEMIDMSTNKISFIGN
RLKNMYVIFLEDLEVNSEVCLIANAENDSWLWHRRLGHVSMNTMSKLIKKNLIVGLPKLK
FENDRICDACQLGKQVRTSFKSKKIVSTSRPLELLHVDLFGPISTTSLGGKSYGFVIVDD
YSRYSWVYFLAHKNDALHAFLSHCKKVENEKGLAIVSIRSDHGGEFENDEFEKFCNENGL
DHNFFALRTPQQNGFVEKKNRTLKEMARTMLCENNLPKYLWAEAVNTVTYILNRVSIRPF
ISKTPYEFYKGRKPNISHLRSFGCKCFVLNNRKQPLGKFDAKSDEAIFLGYALNSKAYRV
FNKRTLIVEESIHVVFDESNALQKEVHDDDDDVKVLEKQMEEMILENNKSNEESSPRRHD
DLPRSWRFVRDHPQDQIIGEISQGVKTRKATRETCEFSAFISQIEPKNFEEAEKEESWIM
IINKARFVVKGYNQEEGINYDKTFASVARIEAIRLLLAFAYFMNFKLFQMDVKNAFLNRL
IQEEVYVEQPPGFEDFENLIMFSNFIKLSKFLVEKGYDRGMIDTTLFIKRYLNDLIVVQI
YVDDIVFGATNEALCNNFAKEMQGEFEMSMMGELKYFIGLQIKQSKEGIFINQERYTYDM
LKKFDMLKLKPISTPMSLSTILDLDEKGKDVDQKLYRGMIGSFLYLIASRPDILFSVCLC
ARFQSQPKESQLTAIKRIFRYLIDIQELGIWYSRNSTLSLIGYSDADFVGRRMLVSWSSK
KQNSVTLSTAEAEYVSLGSYCAQILWIKQQLKDYGLTMHNVPIYCDNTSAINISKNHVQH
SRTKHIEIRHHFIKDHVMKNDIKIEFVNTLHQLTDIFTKPPE
Download sequence
Identical sequences Tc02_g022471

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]