SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for Tc02_g007220 from Theobroma cacao B97-61/B2 v1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  Tc02_g007220
Domain Number 1 Region: 14-172
Classification Level Classification E-value
Superfamily NAD(P)-binding Rossmann-fold domains 2.45e-44
Family 6-phosphogluconate dehydrogenase-like, N-terminal domain 0.00013
Further Details:      
 
Domain Number 2 Region: 776-902
Classification Level Classification E-value
Superfamily Ribonuclease H-like 1.26e-29
Family Retroviral integrase, catalytic domain 0.012
Further Details:      
 
Domain Number 3 Region: 174-305
Classification Level Classification E-value
Superfamily 6-phosphogluconate dehydrogenase C-terminal domain-like 2.04e-25
Family Hydroxyisobutyrate and 6-phosphogluconate dehydrogenase domain 0.0014
Further Details:      
 
Domain Number 4 Region: 1026-1232,1261-1425
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 4.23e-23
Family Reverse transcriptase 0.03
Further Details:      
 
Domain Number 5 Region: 283-452
Classification Level Classification E-value
Superfamily RmlC-like cupins 0.00000000607
Family Germin/Seed storage 7S protein 0.01
Further Details:      
 
Domain Number 6 Region: 578-608
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.0000015
Family Retrovirus zinc finger-like domains 0.0028
Further Details:      
 
Weak hits

Sequence:  Tc02_g007220
Domain Number - Region: 1470-1525
Classification Level Classification E-value
Superfamily RmlC-like cupins 0.000267
Family Germin/Seed storage 7S protein 0.0059
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) Tc02_g007220
Sequence length 1559
Comment Putative Retrovirus-related Pol polyprotein from transposon TNT 1-94
Sequence
METPYPKPISPAETRVGWIGIGIMGAAMASHLLSAGYSLTIYARTPSKASSLQSQGAHIT
DSPQELARRCNVVFTMVGNPQDVKQIVLETNGVLSTLNPGAVLVDHTTSSPSLAREIYAS
ARKKGCWSIDAPVSGGDIGAREGKLAIFAGGESSVVEWLKPLFDLMGRVTYMGEAGCGQS
CKIGNQIMVGANLMGLSEGFVFAEKAGLDLRKYMEAVRGGGAASMAMELFGGRMIRRDFK
PGGFAEYMVKDMGLGVDVVKEDDDGKVVVLPGAALGKQMFSAMVANGDGKLGTQGLITVV
ERINGDIIPIPLGSVSWWYNHGRFDLVMLFPGEASKACLPCAISYFLLTGAIGHLSAFSL
EVIARTYHRCRKKHKNLPIDSFNTWTKNLESASPDVDVKKGGKSTTLTRADFPLLEEVGL
SANLLVLKANATRSPMYTADAQVFYVSYGSGTDKFLSLKYFEFKISDNLPIMDQVHNLQV
LVSKLKDLEVKVSDALQIGTILSKLPQSWNNYRKKVLHSMESLSIEQFMTQMQIECETRA
CDTLFQPSDSKVNLVSQNSKNFGKNSMKFKSSALKVSRQTFKKKDKKNIKCFHCGKKGHM
ISECRSRKVGKNFGSINPEKANIIEEAVNELVAMVSTMNIGMVTELNMVVDSNKSKDWWL
DSSATIHVCNDKNLFKTYMKINKSENVLMGNHVTTKIEGKGTVELNFTSGQKLTLLNVLH
VSEIRKNLVSASLLAKKGFKIVIESDNVIVSNGEKCEICIQAKMTKKPFRSVKRNSQLLD
IVHSDICELNEKLTRGGKRYFITFIDDYSKFTYVYLIKTKDETFQKFKEYKSVVENQKGR
KIKILRSVRGGEYFPIEFDNFCEENGLIHQKSAPYSPQQNGLAERKNKTLVDMTNAMLLN
LGPRAIKGIFVGYAQHSKAYRILDLQSNIIVESIHVEFVENKFMYNFNNKDCEQNVLIPS
CDQSNMTSHNTSSKRKRADTQIELKRNPILLNVEDDPKTFKEAMSSRDVAFWKEAINDEM
DSILSNKTWVLVDLSPGSKPIGCKWVFRRKYNTDGSVQTFKARLVAKGFTITSIRTLLAL
ASIHKLHVHQMNVKTAFLNGDLNEEIYMEQLEGFILPENERKVCRLIKSLYGLKQAPKQW
HEKFDSPILSNGFVHNNADKCIYSKFTKSYGVIICLYVDDLLIFSTNMLGIVETKKYLTS
VFKMKDLNEVDTILGIKVKRDQNVISLNQSHYIEKIIHKYSHLEIKEFNTPYDSSIKLTK
NCERVMAQLEDASVIGSIMYVAHYTKPDIAFAVCKLSRFTSKPSKDHWKAITRVLGYLKK
TKNLKICYNGFPSILKDFLILIFTLEGGAISWTSKKQTCITHSTMESKFLALAAAGKEAE
WLRNLLLDIKLWPQPMPAISIYCDSEATMCVAHNKIYNGKSRHISLRHAYIRELISNGII
TIVYVKSCKNLVDPLTKALPREITRKLVLDTKVKTGQLFVVPRFFMVSLLADREGMECFS
VMTSALPVIGELAGKDSMLNTIPSALQVCLNVTPKFTQEFKGIMETGTIIVPPMFGASI
Download sequence
Identical sequences Tc02_g007220

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]