SUPERFAMILY 1.75 HMM library and genome assignments server

SUPERFAMILY 2 can be accessed from supfam.org. Please contact us if you experience any problems.

Domain assignment for gi|158316602|ref|YP_001509110.1| from Frankia sp. EAN1pec

Domain architecture


Domain assignment details

(
show help)
Strong hits

Sequence:  gi|158316602|ref|YP_001509110.1|
Domain Number 1 Region: 10-517
Classification Level Classification E-value
Superfamily Acetyl-CoA synthetase-like 1.03e-147
Family Acetyl-CoA synthetase-like 0.000000974
Further Details:      
 
Domain Number 2 Region: 3039-3405
Classification Level Classification E-value
Superfamily Thiolase-like 2.59e-98
Family Thiolase-related 0.002
Further Details:      
 
Domain Number 3 Region: 1143-1529
Classification Level Classification E-value
Superfamily Thiolase-like 2.39e-95
Family Thiolase-related 0.0013
Further Details:      
 
Domain Number 4 Region: 3580-3715,3777-3872
Classification Level Classification E-value
Superfamily FabD/lysophospholipase-like 3.92e-62
Family FabD-like 0.00018
Further Details:      
 
Domain Number 5 Region: 1699-1830,1892-1989
Classification Level Classification E-value
Superfamily FabD/lysophospholipase-like 9.42e-58
Family FabD-like 0.00041
Further Details:      
 
Domain Number 6 Region: 4460-4683
Classification Level Classification E-value
Superfamily NAD(P)-binding Rossmann-fold domains 4.09e-41
Family Tyrosine-dependent oxidoreductases 0.0000116
Further Details:      
 
Domain Number 7 Region: 2640-2888
Classification Level Classification E-value
Superfamily NAD(P)-binding Rossmann-fold domains 2.7e-40
Family Tyrosine-dependent oxidoreductases 0.0000407
Further Details:      
 
Domain Number 8 Region: 583-753
Classification Level Classification E-value
Superfamily NAD(P)-binding Rossmann-fold domains 3.74e-25
Family Tyrosine-dependent oxidoreductases 0.0088
Further Details:      
 
Domain Number 9 Region: 2921-3004
Classification Level Classification E-value
Superfamily ACP-like 1.14e-20
Family Acyl-carrier protein (ACP) 0.025
Further Details:      
 
Domain Number 10 Region: 4758-4837
Classification Level Classification E-value
Superfamily ACP-like 1.44e-19
Family Acyl-carrier protein (ACP) 0.059
Further Details:      
 
Domain Number 11 Region: 774-970
Classification Level Classification E-value
Superfamily NAD(P)-binding Rossmann-fold domains 4.38e-19
Family Tyrosine-dependent oxidoreductases 0.00053
Further Details:      
 
Domain Number 12 Region: 1055-1161
Classification Level Classification E-value
Superfamily ACP-like 2.62e-17
Family Acyl-carrier protein (ACP) 0.03
Further Details:      
 
Domain Number 13 Region: 4277-4439
Classification Level Classification E-value
Superfamily NAD(P)-binding Rossmann-fold domains 0.00000000000000181
Family Tyrosine-dependent oxidoreductases 0.012
Further Details:      
 
Domain Number 14 Region: 2427-2630
Classification Level Classification E-value
Superfamily NAD(P)-binding Rossmann-fold domains 0.00000000000305
Family Tyrosine-dependent oxidoreductases 0.042
Further Details:      
 
Domain Number 15 Region: 3714-3776
Classification Level Classification E-value
Superfamily Probable ACP-binding domain of malonyl-CoA ACP transacylase 0.00000000106
Family Probable ACP-binding domain of malonyl-CoA ACP transacylase 0.0054
Further Details:      
 
Domain Number 16 Region: 1829-1891
Classification Level Classification E-value
Superfamily Probable ACP-binding domain of malonyl-CoA ACP transacylase 0.00000000575
Family Probable ACP-binding domain of malonyl-CoA ACP transacylase 0.013
Further Details:      
 

Gene Ontology term assignment details

The top 10 most specific Gene Ontology terms for each namespace assigned to this domain architecture as determined by dcGO Predictor

(show help)

Biological Process IC (bits) H-Score
Cellular Component IC (bits) H-Score
Molecular Function IC (bits) H-Score

Protein sequence

External link(s) gi|158316602|ref|YP_001509110.1|
Sequence length 4930
Comment beta-ketoacyl synthase [Frankia sp. EAN1pec]
Sequence
MRMLRTELIAPLPKLLRGNAERLGDRIAFRDASRAVGWAELERRTGWLAGHLADLRLQPG
DRAAIVLGNCVEVVESYLGFARASVVGVPINPRVTETELAYLLDDCGARLVVTDPARIDM
VGRVLRDRPGLRVVVTGGHAPPPSAPAGTLSFAALAGAQPRSAARDDLGLDDVAWMLYTS
GTTGRPKGVLSTQRSCLWSVAACYAPIPGLSEQDRVLWPLPLFHSLSHIACVLGVTAVGA
SARLLDGFAASEVLAAIQEDGSTFLAGVPTMYHYLVRAARESGFSAPSLRMCLVGGAITT
ARLRRDFEEAFGAPLLDAYGSTETCGSITINWPTGARVEGSCGLPVPGLGVRLVDPETGL
DVGAGAEGEVWVRGPNVMVGYHNQPEATAAALRDGWYRTGDLARRDDAGYFTITGRIKEL
IIRGGENIHPGEVEEVLRGVPGVADVAVVARPHDLLGEVPVAFLVPGPEGLDPDRLLATC
RERLSYFKVPEELYEIDRIPRTASGKITRHVLLERPARLRAASSGHHDTLFRVDWIPRPS
VTSSSVRARPDHTSSPVQAGGPGRQPLSAPPVGTSVPPVGGERRWAIIGADAFGFAPVLT
EAGILVSQYPNLDAVRRAAADGDEVPDLAVLTCGSVLGKAGVLSDDAARAVTWLSGEIAG
WLAQERLAGVKLVIATRGAVAVGPDDDIEDLIRAPLWGLLRTLQTEHPDRFVLFDLTVDD
PAGAAALPAVVESGEPQVAVRQGVVLLPRLARVAAVPTDQDSPTGGNLFADPLRTVVVTG
ADGPVGAALARHLVANYGVRRLLLVSRPGVAGDLGDLGELGGAGGLGGATGEHSAEALRA
ELAHAGATVTHVPCDLADRAALAAVLHRHARSVCAVFHAQSAAGPVGGLRNDEQRLAATL
AGALNLHDLIGGPETRAFVLCSSAAGLLGGAGLAEPAAVSVFFGALVQHRAARGLPAASV
SWGPWEGSELPELPGSRALRVREGLAMFDAALGADQSVLAVLRPDAAVLADDAPPAPLRG
LIDVPVAHRPPDDAVAVELRTKLAGLPEADALRLLTTLVRTEVARVAGGSAGAAVEAGTA
FRDLGLTSVTTVELRNRLTASTGLRLPVTVAFDHPTPLELARELRRGLLGEASAAVAVRN
RRAVSDEPVAIVGMACRLPGGVVSAEGLWDVVAGGVDAVSGFPSDRGWDLAGLAGDGVGG
VGGVGGVGRVGSSVAGSGGFLRDVAGFDAGLFGVSPREALAMDPQQRLLLEVSWEVLERA
GIDPGSLRGEPVGVFTGLMHHDYARGNTAAAERLEGYLSIGTAGSVASGRVAYSFGFEGP
ALTVDTACSSSLVALHLAVGSLRSGECSLALAGGVAVMATPEVFVDFSRQGGLAVDGRCK
AYADAADGTGWAEGAGVLLLERLSDAERNGHRVLAVVRGSAVNQDGASNGLTAPSGRSQE
RVIRAALADAGLTTADVDVVEGHGTGTRLGDPIEVGALLATYGTGRSPDRPVLLGSLKSN
IGHTQAAAGVAGVIKTVQALRHGVVPRTLHVDAPSSRIDWSAGAVSLVTEPVVWPETGRP
RRAGVSSFGVSGTNAHVIIEQAPDESPDTATDEVADAAPDGVVAGEAVPWLLSAASPAAL
RAQADALAEFAAARPAPAAAEIGRSLATTRARLARRAVVVAGSHDERVTALRALGAGTPH
PDVIFEPSRPQVGADRGNVFVFPGQGAQWVGMGAGLLGGSSRLSEVFRGVVEEVSGVLAG
LVDWSLVDVLRGVGSDGVLERVEVVQPASFAVGLGLVRVWGELGVVPGAVLGHSQGEVVA
ACVAGALGVGDAVRVVVGRSRVVAERLSGRGGMVSVFLPVDEVVGLLPVGVEVAAVNGPG
VTVVSGERAGLVELVGVLEGRGVRVRWVAVDYASHSSQVDGVAGELRELLAGVRSVVPRV
PFFSTVEGRWVSGAGELEGDYWFRNLRSRVGFAGAVGVLAGEGFRSFVEVGAHPVLVGAV
GEVLEEVGVSDAVVVGSLRRGEGGGGRVLRSAAELFVRGVRVDWSGVFDGRGGVAVGVGV
GPDLPTYPFQHERYWLDPDPSGGDGLGDVSTAGLDPVDHPLLGAGVTVAGESFPDTSPDP
LAAGLTEGGPGGDGRLLYTGRLTTDRHPWLADHRVGGAVVLPGSALLDVLAWIGDRVGRP
VLAELTISAPILIDEPAENTAANPAAGTDIQIVVGPPDGERSRPVTVHSRTSPDEAWTEN
ARGILAQPAGTATGGPTDPTGRPGWAASWPPAAASPVDIDECYDRLPVDYGPAFHALRAA
WVAEGTVYAEVTLPDAAVAPAGDVSPASRWLRSGGVELSDGTDGTEPSDGYGLHPVLLDA
ALHPLGVAGFFPDPDQPRLAFAWSGVRLWATGARSLRVRIAPAGPDTMTISAVDDEGAAV
IDVEALVVRPVDPERLTAGRGAARRSHALFEVRWEQTDAAAVRGTEWAFHPDSAVRDSAV
PGGSGGVTGTPVAERAARPPFVVFVPASATGVDGGAGGDAGTVPGAAVAVRDVPGAVRAA
GLRTLRVLQDWLADPDSASSRLVVVTRDGDLAHGSVRGLVRAAQAEHPGRFGLLNLDRAY
HLPEPPSVGETPWLEAIAAGLGAPADEPWVAVRPAGAGGTEVLAARLRRAERAPVPARSL
LEGGTVLVTGATGGLGRLAADHLARVHRVRELVLVARSVAGEEQVAELRATGVTVRAVAA
DVADRAAMAEIVASVADRLTAVVHIAGIVEDGVIEALDEPRWHAVLRTKADAAWHLHELT
AGLDLAAFVLYSSAASVFGGAGQGNYAAGNGFLDALARHRRAAGLPAVSLAWGLWQEAAG
MGGRLSATDRARMTRDGTRALTAADGLALFDSALVDSRPELVPVLLDLGALRRRDSLPPL
LRGLVPAPAPARRRAAVSAGSAAPDRSTLRDRLAAASAGTRDEILLELVQATAAVVLGHT
DPAAVEPDRAFRDLGFDSLTSVELRNGLITAAGVRLSATVVFNHPTPRSLAGHLAAQLAP
DLPDLPDGPSASGDPPRAVTAVAAGTVRGSGGRTREQDRTDDPIVIVAMACRFPGGIGSP
ADLWQVASDGVDVVGPLPTDRGWNLTELYDPDPDRPGRTYVHAGGFLTDVTGFDAGLFGI
SPREAQAMDPQQRLLLETSWEVLERAGLDPTSLAGTPTGVYVGTHGQDYASEVSGERADE
GYLVIGRAASVLSGRVSYAFGFEGPALTVDTACSSSLVALHTAAAALRAGEIGLALVAGV
SIMSSPEGLLGFSRQRGLAADGRCKAYADAADGFGMAEGVGVLLVERLSDARRHGRRVLA
VVRGSAVNQDGASNGLTAPSGRSQERVIRAALADAGLTTADVDVVEGHGTGTTLGDPIEA
QALLATYGRRPADRPVLLGSVKSNIGHTQAAAGVAGIIKMVQALDHAVVPKTLHVDRPSG
HVDWSAGAVSLVTEPVAWPETGRPRRAAVSSFGVSGTNAHVIIEQAPVASTEPAETEAPA
APGESEVPAGPVVPVVLSAASREALRGQAGRLAEFVRTRTDVPVAAVAATLLTRARLGQR
AVAVAAERGELVAGLEALAGDLPDPAVVSGAASPRGRGPVFVFPGQGAQWVGMGAGLLSG
PSVPSSALSSVFRETVDEVAGALAGLVDWSLVDVLRGEGPDGALERVEVVQPASFAVALG
LARVWRELGVTPGAVVGHSQGEVAAACVAGALGTADAVRVVVARSRVVGARLAGRGGMVS
VSLPAAELAGLLPPGVEVAAVNGPGTTVISGASAALAELVAALEVRGVRARSVAVDYASH
SAQVDAVAEELAELLAGVRPESPRIPFFSTVEGRWIAGAELTGDYWVRNLRRTVGFAQAV
GVLAGEGFRSFVEVGAHPVLVPSIGEVLEEAGFGDTAVVGSLRRGEGGPERLLRSAAELF
VAGVPVDWTKAFPAAAPRAAGRLGAAAELPTYAFQHERYWLAPGAPGSGDVTAAGLDATG
HPLLGAAVDLPGSAPDAAEVAFTARLSARTHPWLADHAVRGVRLLPATAWIEIGLHAGDR
VGHPVLDELLIEAPLAVPAEGSVTLRVIVDGPDADGRRPVRLYARPDGAAAPGDGDGDGT
EGTGTRWTRHGTGLLSAEVTEPGHGYEAWPPADARPVDLADFYPRLADRGYDYGPAFTGL
RAAWTRGREVFAEVELPAAEASAEPPGAYGLHPALLDAALQATNLGAVPAAEEGHVLLPF
AWSGIRRFSSGVTALRVHATPSDLAAAPGSHGVSVRMSDRGGAPVAEIGSLVLRATPLAQ
LDRLDRSAGSGGAGTAEALFRVEWVGVPTGPAGTPLGRPAVSPAAGAEPPEADVLDVTGR
ASVDPTAVRALVADVLEALQKRLAPAGGAPAVTAPGHGWRVPDGPLVILTDDPAGEPASA
AVWGLVRSAQAEHPGRFVLLGGAPEESRAVLPTVLASGEPQAVVRDGQVLVPRLARAARP
SADTSAPDGASPLSGMSPVDGTTLVTGGTGTLGAIVARELVRTHGVRHLVLLSRTGAATP
GAPELVAELRHAGAAVEVVAADAADREAVRALLADIPPAHPLTAVVHVAGVVDDGLVTSL
DRGRLDSVFRPKADAAWNLHELTAELGLAAFVLFSSAAGAFGGAGQGNYAAANGYLDALA
EYRAGLGLPAVAVAWGLWERASSLTASLTPADRDRMARGGVRGLSDAEGAAFFGAALRSP
DAVLVAAAVDVPALRRRASAGGLPPLLRGLVPAPVPAADSAAPDSPGGITGTGSAGAGSA
SGQPGRALARRLAGQAEPERRRVLLDVVRAHTATVLAHRSGNAVGVGQTFQELGFDSLTG
VELRNRLAAAVGVRLAATMVFDHPTPAALAEHLLRLLDLDGPDDPDSHHGLDGPGTPAVR
NSSVLDQLARLERVLTGATDARIAVPDEVAARLRALAASLGPRRDDGAGLDLTSATDDEM
FELLDRGLGS
Download sequence
Identical sequences A8LG01
WP_020462324.1.9268 298653.Franean1_4838 gi|158316602|ref|YP_001509110.1|

Jump to [ Top of page · Domain architecture · Domain assignment details · Most Informative Gene Ontologies ]