SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for Tc00_g072080 from Theobroma cacao B97-61/B2 v1

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  Tc00_g072080
Domain Number 1 Region: 372-676
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 1.89e-96
Family Reverse transcriptase 0.00039
Further Details:      
 
Domain Number 2 Region: 834-990
Classification Level Classification E-value
Superfamily Ribonuclease H-like 6.22e-35
Family Retroviral integrase, catalytic domain 0.074
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) Tc00_g072080
Sequence length 1130
Comment Hypothetical protein
Sequence
MSINRDVAAVVMGPREVPGRDNSIGIKAFGLKIIMPPRRELPPITGSTGRGRGRPRQSRS
DSIEEESAASSFRAVPTVESTDIPVPPPPPVATLSILAMSPEAAQALATFFATLTGQAQA
GPVSPAVSPVTALVPPPPIPPQTLDVSISKKLKEARQLGCVSFTGELDATAARDWVIQVL
ETLADMGLDDEMKLKVATQLFEKRACTWWSSVNYFEEGLRNGIRERMIMTGKESYKEVMQ
MALRVEKLIGTFSIRGRDCSAYSSRRRLVRNTYYRDCGIKVGEEKFMGDLIPLEIRDFDL
ILSMDWLSTHRAKVDCFKKEVILQSLEGAEVVFTGERRVLQFCVISAFKALKLVRKRYPT
YLAHVIDISREEPKLENVPIFEFTIDLLPGTAPISIPPYRMAPAELKLKVQLQELVDKGL
IRPSTSSWGAPSLFVKKKDGTLRLCIDYRQLNKMTIKNKYPLSEYHQLRIKEQDVPKIAF
RTRYGHYEFLVMPFGLTNAPIAFVDLMNRVFHPYLDKFVIVFIDDILIYSRDNDEHATHL
RIVLQTLRERQLYIVTEIRSFLGLAGYYQRFVKGFSLIAAPLTRLTRMGVKFEWDDVCEN
RFQKLKNRLTSFIVMQDEKVVAYASRQLKRHEANYPTHDLELAAVVFTLKIWRHYLYREQ
RRWLELIKDYDLVIDYHPGKANVVADALSCKSSSSLAALQSCYFLALLEMKSLGVQLRNG
EDGSLLASFIVRPLLLNQMKDIQRSDDELRKKIQKFIDGGVSEFRFREDNVLVFRDRVCV
PQENQLRQAIMEKAHSSAYALHPGRDVAEFVAKCLVCQQVKAEHQKPAGTLQSLPIPKCK
WEHVTMDFVLGLPWTQRGKDAIWVIVDQLTKFAHFLVVRSTYSIGKLAQLYIDEIVRLHG
VPISIVSDRDPRFTSRFWSKFQEALRTKLKFSTAFHPQTDGQSERNIQTMEDMLRACVMD
FLGSWDGHLPLVKFAYNNNFQSSIGMAPYKALYGTKCHTPLCWDEVGERKLVSVELIDLT
NDKIKVIRERKDLEFEIDDKVFLNVSPWKGVIQFAKRGKLNSRYIGPFQIIERIRPVKYV
PDPSHILEAPPIELHDDLKFEVQPVIILDRKDRVLRNKSISMVKVLWKSA
Download sequence
Identical sequences Tc00_g072080

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]