Research article
Molecular evolution of the plant ECERIFERUM1 and ECERIFERUM3 genes involved in aliphatic hydrocarbon production

https://doi.org/10.1016/j.compbiolchem.2019.02.009Get rights and content

Highlights

  • Chlorophytes have a common homolog of the genes CER1 and CER3.

  • Chlorophytes may be one of the earliest plant taxa to contain CER1 and CER3.

  • CER1 and CER3 proteins are structurally similar, but CER1 proteins have more conserved histidine-containing motifs.

  • For CER1 proteins, there was no significant loss or gain of protein motifs after ancient and recent duplications.

  • CER1 proteins are highly conserved throughout evolution with no evidence of positive selection.

Abstract

The Arabidopsis ECERIFERUM1 (CER1) protein is a decarbonylase that converts fatty acid metabolites into alkanes. Alkanes are components of waxes in the plant cuticle, a waterproof barrier serving to protect land plants from both biotic and abiotic stimuli. CER1 enzymes can be used to produce alternative and sustainable hydrocarbons in eukaryotic systems. In this report we identified 193 CER1 and 128 CER3 sequences from 56 land plants respectively. CER1 and CER3 proteins have high amino acid similarity and both are involved in alkane synthesis in Arabidopsis. The common homologues of CER1 and CER3 genes were identified in three species of chlorophytes, which may be one of the earliest plant taxa that possess CER1 and CER3 genes. To facilitate potential applications, the 3-dimensional structure and conserved motifs of CER1 proteins were also characterized. CER1 and CER3 proteins are structurally similar, but CER1 proteins have more conserved histidine-containing motifs common to fatty acid hydroxylases and stearoyl-CoA desaturases. There was no significant loss or gain of protein motifs after ancient and recent duplications, suggesting that varied properties of CER1 proteins may be associated with less-conserved regions. Among 56 land plants, the codon-based assessments of selection modes revealed that neither entire proteins nor individual amino acids of CER1 proteins were significantly subjected to positive selection, indicating that CER1 proteins are highly conserved throughout evolution.

Introduction

Alkanes are acyclic saturated hydrocarbons and have the general chemical formula CnH2n+2 (Gutman, 2008). Alkanes are the major constituents of fossil fuels including petroleum and natural gas and exist in the form of a gas, liquid or wax, depending on the length and structure of the carbon backbone (Arora, 2006). Alkanes can also be naturally produced by living organisms, including bacteria, animals, fungi, and plants (Marsh and Waugh, 2013). In general, biological alkanes have an odd number of carbon atoms, indicating that they are derived by the loss of one carboxyl carbon from fatty acids with even numbers of carbons. The enzymatic decarbonylation processes in various living organisms convert fatty acid metabolites into alkanes (Marsh and Waugh, 2013).

The 2000s energy crisis triggered extensive research efforts in identifying genes that encode enzymes involved in enzymatic decarbonylation of fatty aldehydes, a promising biocatalytic conversion for the production of alternative and sustainable hydrocarbon fuels from renewable sources (Bhujade et al., 2017; Schirmer et al., 2010). As a result, an alkane biosynthesis pathway consisting of an acyl-acyl carrier protein reductase and an aldehyde decarbonylase in cyanobacterium Synechococcus elongatus PCC7942 was first identified and characterized (Schirmer et al., 2010). This microbial operon has been utilized to modify an Escherichia coli strain to produce C13 to C17 alkanes from renewable sources in a single fermentation step (d’Espaux et al., 2015; Schirmer et al., 2010). Although the first decarbonylase was discovered in bacteria, alkane biosynthesis in plants was investigated over three decades ago (Cheesbrough and Kolattukudy, 1984). The aerial epidermis of terrestrial plants produces and secretes alkane waxes, which are very-long-chain molecules (between 20–36 carbons) derived from very-long-chain fatty acids (Fu et al., 2015). The alkane waxes are deposited in the plant cuticle, a waterproof barrier that plays a preventive role against both biotic and abiotic stimuli in terrestrial plants.

The Arabidopsis thaliana ECERIFERUM1 (CER1) gene was proposed, over two decades ago, to encode a plant alkane-producing decarbonylase (Aarts et al., 1995). However, how the CER1 protein works was not well understood until recently. A successful reconstitution of alkane biosynthesis in yeast found that CER1 and A. thaliana CER3 proteins interact to produce alkanes together (Bernard et al., 2012). The CER3 protein serves as a fatty acyl reductase to produce metabolic precursors such as fatty aldehydes or other intermediates that are currently unknown. CER1, which serves as a decarbonylase, converts the precursors into alkanes. Interestingly, CER1 and CER3 proteins share 35% amino acid identity, indicating that they likely originated from a common ancestor gene (Bernard et al., 2012).

Research has attempted to improve the catalytic efficiency of these enigmatic alkane-producing enzymes based on the understandings of their catalytic mechanisms (Hayashi et al., 2015; Wang et al., 2014). Protein structure information would facilitate the efforts. Decarbonylases in cyanobacteria are soluble proteins and crystal structures of several cyanbacterial decarbonylases are now available (Buer et al., 2014; Jia et al., 2015; Park et al., 2016). In contrast, Arabidopsis CER1 and fruit fly decarbonylase CYP4G1 are membrane proteins, of which overexpression and purification are a considerable challenge (Marsh and Waugh, 2013). In addition, the decarbonylases from different organisms or species exhibit differential preferences of substrates in alkane production (Qiu et al., 2012; Schirmer et al., 2010). Various decarbonylases are required to produce gasoline (C4-C9), jet fuel (C8-C16), diesel (C10-C18), and lubricants (C16-C30). It would therefore be necessary to carry out further screening for the identification of efficient decarbonylases that produce alkanes in different chain lengths for various applications.

To our knowledge, the alkane-producing enzymes in plants have not been comprehensively investigated. In this report, we examined the phylogeny and evolution of plant CER1 and CER3 genes. To facilitate the potential applications, the protein structures and motifs of CER1 proteins were investigated as well. This in silico analysis would lay a foundation for further experimental study.

Section snippets

Acquisition of CER1 and CER3 sequences

The identifications of CER1 and CER3 sequences were conducted in Phytozome v12.1.5 (https://phytozome.jgi.doe.gov/), using the Arabidopsis CER1 (Locus AT1G02205) and CER3 (Locus AT5G57800) protein sequences in The Arabidopsis Information Resource (TAIR) (https://www.arabidopsis.org/) as the Basic Local Alignment Search Tool-protein (BLASTp) queries, respectively.

We identified and retrieved CER1 and CER3 sequences from 56 out of 57 land plant species, including Amaranthus hypochondriacus (Ahy),

CER1 and CER3 genes were obtained from 56 land plants and 3 chlorophyte species

193 CER1 proteins and their corresponding coding sequences were identified and downloaded for 56 land plant species (Additional file 1). Like the Arabidopsis CER1 gene, the identified CER1 sequences encode histidine-containing motifs and the putative WAX2 C-terminal domain. Histidine clusters are essential for the catalytic activity of hydroxylases and desaturases (Shanklin and Cahoon, 1998). We failed to identify a CER1 homologue in duckweed Spirodela polyrhiza. Since the aquatic species has

Conclusion

We identified 193 CER1 and 128 CER3 gene sequences from 56 land plant respectively, and three common homologues (i.e. CER1/3) of CER1 and CER3 from three chlorophytes. This is the first survey of CER1 and CER3 genes covering a wide range of plant taxa. The presence of common homologue of CER1 and CER3 in chlorophytes and the structural similarity of their encoding proteins shown in this study support the common origin of CER1 and CER3 genes. Although alkane synthesis already occurs in

Competing interest

We have no competing interests.

References (49)

  • R. Bhujade et al.

    Algae to economically viable low-carbon-footprint oil

    Annu. Rev. Chem. Biomol. Eng.

    (2017)
  • B.C. Buer et al.

    Insights into substrate and metal binding from the crystal structure of cyanobacterial aldehyde deformylating oxygenase with substrate bound

    ACS Chem. Biol.

    (2014)
  • Y. Cao et al.

    Phylogenetic relationships among eutherian orders estimated from inferred sequences of mitochondrial proteins: instability of a tree based on a single gene

    J. Mol. Evol.

    (1994)
  • T.M. Cheesbrough et al.

    Alkane biosynthesis by decarbonylation of aldehydes catalyzed by a particulate preparation from Pisum sativum

    Proc. Natl. Acad. Sci. U. S. A.

    (1984)
  • D. Darriba et al.

    ProtTest 3: fast selection of best-fit models of protein evolution

    Bioinformatics

    (2011)
  • D. Das et al.

    Oxygen-independent decarbonylation of aldehydes by cyanobacterial aldehyde decarbonylase: a new reaction of di-iron enzymes

    Angew. Chem. Int. Ed. Engl.

    (2011)
  • W.J. Fu et al.

    Hydrocarbons, the advanced biofuels produced by different organisms, the evidence that alkanes in petroleum can be renewable

    Appl. Microbiol. Biotechnol.

    (2015)
  • I. Gutman

    The chemical formula CnH2n+2 and its mathematical background

    Teach. Math.

    (2008)
  • Y. Hayashi et al.

    Role of cysteine residues in the structure, stability, and alkane producing activity of cyanobacterial aldehyde deformylating oxygenase

    PLoS One

    (2015)
  • W. He et al.

    Research progress in protein posttranslational modification site prediction

    Brief. Funct. Genomics

    (2018)
  • A.L. Hughes

    The evolution of functionally novel proteins after gene duplication

    Proc. Biol. Sci.

    (1994)
  • C. Jia et al.

    Structural insights into the catalytic mechanism of aldehyde-deformylating oxygenases

    Protein Cell

    (2015)
  • D.T. Jones et al.

    The rapid generation of mutation data matrices from protein sequences

    Comput. Appl. Biosci.

    (1992)
  • K. Katoh et al.

    MAFFT multiple sequence alignment software version 7: improvements in performance and usability

    Mol. Biol. Evol.

    (2013)
  • Cited by (15)

    • Origin and diversification of ECERIFERUM1 (CER1) and ECERIFERUM3 (CER3) genes in land plants and phylogenetic evidence that the ancestral CER1/3 gene resulted from the fusion of pre-existing domains

      2021, Molecular Phylogenetics and Evolution
      Citation Excerpt :

      CER1 and CER3 proteins have a characteristic bi-domain structure (ERG3/FAH and WAX2) that may show differential divergence (Chen et al., 2003; Bernard et al., 2012). Our phylogenetic analyses of these proteins from green plants provide insights into the origin, diversification and variation in copy number of the genes across land plants that are consistent with results of a recent study (Wang et al., 2019). Shedding new light on the question, we present phylogenetic evidence that: (a) a single ancestral gene, CER1/3, duplicated in the ancestral land plants to generate the paralogs; CER1 and CER3, supporting a critical role for cuticular waxes in early colonization of land (Niklas et al., 2017), (b) WAX2-WxAR sequences are limited to relatively few lineages across the tree of life, and the CER1/3 protein may have been assembled from the WAX2-WxAR and ERG3/FAH domains in green plants and two other lineages.

    • A single SNP in Brcer1 results in wax deficiency in Chinese cabbage (Brassica campestris L. ssp. pekinensis)

      2021, Scientia Horticulturae
      Citation Excerpt :

      The CER1 protein is localized on the endoplasmic reticulum. Compared with CER3 proteins, CER1 proteins have two more conserved motifs, fatty acid hydroxylases and stearoyl-CoA desaturases (Wang et al., 2019). CER1 is expressed in many organs, such as stems, flowers, siliques, seedlings, and leaves.

    View all citing articles on Scopus
    1

    Present address at: U.S. Department of Agriculture, Agricultural Research Service, Hard Winter Wheat Genetics Research, Throckmorton Hall, Manhattan, KS 66506.

    View full text