SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for CGD0033094 from Theobroma cacao Matina 1-6 v0.9

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  CGD0033094
Domain Number 1 Region: 390-678
Classification Level Classification E-value
Superfamily Protein kinase-like (PK-like) 4.78e-74
Family Protein kinases, catalytic subunit 0.00015
Further Details:      
 
Domain Number 2 Region: 23-96
Classification Level Classification E-value
Superfamily alpha-D-mannose-specific plant lectins 0.0000000000017
Family alpha-D-mannose-specific plant lectins 0.0056
Further Details:      
 
Domain Number 3 Region: 73-167
Classification Level Classification E-value
Superfamily alpha-D-mannose-specific plant lectins 0.00000183
Family alpha-D-mannose-specific plant lectins 0.0033
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Molecular Function IC (bits) H-Score
Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score

Protein sequence

External link(s) CGD0033094
Sequence length 697
Comment aalen=697,82% Dbxref=vitis:XP_002283186.1,prunus:ppa019044m.2
Sequence
MVIWFSMIQQASRSGLLVWLELGVSYAAMLDTGNFVLAREDSRILWQSFDNPTDTILPTQ
VMNQDSQLIARYTETNYSSGRFKFILQRDGNLLLYTTNFPFDDNVAAYWSAQTSIGSGFQ
VISNQSGNIYLTARKGSILNMVFSTQSSTQDFYLKAIVDYDGVFRQYAYPKSVSTSNGRW
PRSWTTLSLIPSNICMRIGRDNGSGACGYNSYCILGDDQRPICDCVPGYSFIDTNDIRKG
CRPNFTFCEETSQETDLYQFILMNNADWPDSSYESFKEVTEDWCRLACLNDCFCAVATFR
DGECRKKKTPLANGRVDPEIGGKALLKVRNNSTASKNSAKDKKDQSIVIRVVSVLLGGFV
FVNFLLLLVTLTLIFRLKRKQAEVQPQKVMPPMNLLSFPCSELDKATNGFQEELGCGAFG
TVYKGELASEPTELVAAKKLNKMERDGEQEFQAEVRAIGRTNHKNLVQLLGFCNEGQNRL
LVYEYMSNGSLAKFLFANARPNWYQRIQIAFGIAIRLFYLHEECSSQIIHCNIKPQNILL
DDSFSAKISDFGLAKLLKKDQTRTTTAIRGTKGYVAPEWFRNMPITVEVDVYSFGILLLE
LICCRKNFEPNVKEEDQMILVDWAYDCFMERKLQLLVENDEEATDEIKKVKKFVMIAVWC
IQEDPSLRPTMKKVVQMMEGVAEVPIPPNPALFPSSI
Download sequence
Identical sequences CGD0033094

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]