SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for CGD0014632 from Theobroma cacao Matina 1-6 v0.9

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  CGD0014632
Domain Number 1 Region: 567-961
Classification Level Classification E-value
Superfamily DNA/RNA polymerases 5e-148
Family Reverse transcriptase 0.0000334
Further Details:      
 
Domain Number 2 Region: 1086-1245
Classification Level Classification E-value
Superfamily Ribonuclease H-like 2.58e-40
Family Retroviral integrase, catalytic domain 0.031
Further Details:      
 
Domain Number 3 Region: 402-498
Classification Level Classification E-value
Superfamily Acid proteases 0.00000087
Family Retroviral protease (retropepsin) 0.069
Further Details:      
 
Domain Number 4 Region: 289-325
Classification Level Classification E-value
Superfamily Retrovirus zinc finger-like domains 0.00000279
Family Retrovirus zinc finger-like domains 0.0034
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score

Protein sequence

External link(s) CGD0014632
Sequence length 1403
Comment aalen=1403,91% Dbxref=vitis:XP_002268669.1,prunus:ppb023014m.218
Sequence
MRQPVQIPPKTPATSRRMGEQDASTEMADRPQASTLRGWGRRGRATRSVRADTPVSSQEE
GQSSGNVDRQPTKGITIEDLAASLQGVNRVVEMMATHMDDIQKAVEGRPTVQESPSSQGQ
VDRQHHKVERGPLEISLLDFLKLQPPSFSGSDASEKPKVFLDKMEKICKALGCFSARSIE
LVAFRLEDVAVRNVKAREFETLVQTSSMTMSEYDIKFTQLARYAPYLVSTEEMKIQRFVD
GLVEPIFRAVACRDFTTYSVAVDCAQRIEMRTSESWAARDRAKRAKTEGYQGHRDFSSGV
SSSGRVCFGCGQPGHTRRNCPMAYQSHDSTRGSTKPASSAPSVIASSNREASGSRGRGAG
ASSQGRPSRSKCQSSAGRGQARVFALIQHEAQTSNAVVSSILSICNMNARVLFNPGATHS
FISPCFASRLGRDHVRRKEQFVVSTPLKEVFVVEWEYESCVVQVKDKDTLVNLVVLDTLD
FDVILGMDWLSPCHASVDCYHKLVRFDFPGEPSFSIQGDRSNTSTNLISVMSVKRLFRQG
CIGYLAVVRDTQAKVGDISQVSVVNEFMDIFPKELPELEELKDQLEDLLNKGFIRPSVSP
WGAPVLFVNKKDGSLRLCIDYRQLNKVTVKNKYPLPRIDDLFDQLQGAQCFSKIDLQSGN
HQLRIQNEDVPKIAFRTRYGHSEFLVMTFGLTNAPAAFMDLMNRVFKPYLDKFVVVFIDD
ILIYLRSMGEHEQHPKIVLQILREHRLYAKFSKCQFWLKSVAFLGHVVSKDGVQIDPKKV
EAVEKWPRLTSVTKIKSFLGLDGYYRRFVIDSSKIAAPLTKLTRKDTKFEWSDACENSFE
KLKACLTTAPVLSLPQATRCYTVFCDASRVGLGCVLMQQGKLIVYASRQLKRHKQNYPVH
DLEMVAIVFALKIWRHYLYGETCEISMDHKSLKYIFQQRDLNLLQRRWMELLKDYDCIIL
YHPSKANPILMDRIKEAQGKDEFVAKALEDPQGRKGKMFTKGTDRVLRYGTKLCVPDGDG
LRREILEEAHMVAYVVHLGATKMYQDLKEVYWWEGLKRDVVEFISKCLVCQQVKAEHQRP
AGLLQPLLVPEWKWEHIAMDFVTGLPRTNGGYDSIWIKVDRLTKSVHFIPIKTTYGATQY
ARVYVDEIVQLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTI
QTLEDMLRACVIDLGIRWDQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIRWLEVG
ERKLLGPELVEDATKKIRMVLQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKRV
MRFGKKGKLNPRYIGPFEILEKVGAMAYRLALPPDLLNIHPVFHVSMLRKYNPNPSQLKD
DLTDEEQPVAILNLQVKSSIQKK
Download sequence
Identical sequences CGD0014632

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]