Research Article
Identification and functional prediction of stress responsive AP2/ERF transcription factors in Brassica napus by genome-wide analysis

https://doi.org/10.1016/j.compbiolchem.2017.09.004Get rights and content

Highlights

  • By implementing genome-wide analysis, 321 AP2/ERF transcription factors were identified in Brassica napus.

  • BnAP2/ERFs were formed through duplication events, occurring before Brassica napus formation.

  • The BnERF039 and BnERF245 are candidates for improving salt-tolerant Brassica napus.

  • BnERF227, BnERF228, BnERF234, BnERF134, BnERF132, BnERF176, and BnERF235 are candidates for improving Leptosphaeria-tolerant Brassica napus.

Abstract

Using homology and domain authentication, 321 putative AP2/ERF transcription factors were identified in Brassica napus, called BnAP2/ERF TFs. BnAP2/ERF TFs were classified into five major subfamilies, including DREB, ERF, AP2, RAV, and BnSoloist. This classification is based on phylogenetic analysis, motif identification, gene structure analysis, and physiochemical characterization. These TFs were annotated based on phylogenetic relationship with Brassica rapa. BnAP2/ERF TFs were located on 19 chromosomes of B. napus. Orthologs and paralogs were identified using synteny-based methods Ks calculation within B. napus genome and between B. napus with other species such as B. rapa, Brassica oleracea, and Arabidopsis thaliana indicated that BnAP2/ERF TFs were formed through duplication events occurred before B. napus formation. Kn/Ks values were between 0 and 1, suggesting the purifying selection among BnAP2/ERF TFs. Gene ontology annotation, cis-regulatory elements and functional interaction networks suggested that BnAP2/ERF TFs participate in response to stressors, including drought, high salinity, heat and cold as well as developmental processes particularly organ specification and embryogenesis. The identified cis-regulatory elements in the upstream of BnAP2/ERF TFs were responsive to abscisic acid. Analysis of the expression data derived from Illumina Hiseq 2000 RNA sequencing revealed that BnAP2/ERF genes were highly expressed in the roots comparing to flower buds, leaves, and stems. Also, the ERF subfamily was over-expressed under salt and fungal treatments. BnERF039 and BnERF245 are candidates for salt-tolerant B. napus. BnERF253-256 and BnERF260-277 are potential cytokinin response factors. BnERF227, BnERF228, BnERF234, BnERF134, BnERF132, BnERF176, and BnERF235 were suggested for resistance against Leptosphaeria maculan and Leptosphaeria biglobosa.

Introduction

APETALA2/Ethylene Responsive Factors (AP2/ERF) represent a large family of transcription factors (TFs) in plants. AP2/ERF TFs are identified by the presence of an AP2 DNA-binding domain consisting 60–70 highly conserved amino acids (aa) (Licausi et al., 2013, Sakuma et al., 2002). The AP2 domain was initially identified in Arabidopsis and tobacco AP2 and ERF proteins, respectively (Jofuku et al., 1994, Ohme-Takagi and Shinshi, 1995). This domain consists of 3 stranded-anti-parallel β-sheets and an α-helix parallel to β-sheets (Allen et al., 1998). Two regions designated as YRG and RAYD element, at the N- and C-terminal of the AP2 domain, play an important role in the DNA-binding activity. YRG element contains almost 20 residues that facilitates DNA-binding through the basic and hydrophilic moieties. RAYD element with about 40 residues interacts with the major groove of DNA through amphipathic α-helix formation (Okamuro et al., 1997). Although the AP2 domain had been initially considered as a plant specific domain, it was later found in viral and bacterial HNH endonuclease (A class of homing endonucleases). Hence, the gain of AP2 domain in plants was as a result of horizontal gene transfer (Magnani et al., 2004).

AP2/ERF superfamily has been divided into four major subfamilies according to DNA-binding domain and sequence similarity. These four subfamilies are Dehydration Responsive Element Binding (DREB) Factor, ERF, AP2, and a subfamily called Related to ABI3 and VP1 (RAV) (Riechmann et al., 2000, Sakuma et al., 2002). A fifth subfamily could be found in most of the higher plants nominated as the Soloist. The Soloist subfamily has a single AP2 domain with low conservation (Song et al., 2013). Within this manuscript, DREB, ERF, AP2, and RAV have been called subfamilies and their clusters have been defined as groups. The DREB and ERF subfamilies, both containing a single AP2 domain, are distinguished from each other based on their differences in the conserved residues of DNA-binding domain. Two classification methods have been suggested for DREB and ERF subfamilies. According to Sakuma et al., DREB and ERF subfamilies are subdivided into A1-A6 and B1-B6 groups, respectively, based on the DNA-binding domain (Sakuma et al., 2002). Nakano clusters ERF and DREB subfamilies together into 12 groups in A. thaliana and 15 groups in rice based on other conserved motifs outside of the AP2 domain and gene structure (Nakano et al., 2006). The majority of AP2 subfamily members have two AP2 domains. The RAV subfamily is identified by the presence of one extra DNA-binding domain, called B3 domain (Licausi et al., 2010a, Nakano et al., 2006).

AP2/ERF TFs are generally known to activate gene expression through Dehydration Responsive Element (DRE)/C-repeat (CRT) and Low Temperature Responsive Element (LTRE). However, there are some other regulatory elements attributed to abscisic acid (ABA)-dependent and independent pathways. The ERF subfamily binds to GCC box (AGCCGCC), while the DREB subfamily interacts with (A/GCCGAC) (Mizoi et al., 2012, Ohme-Takagi and Shinshi, 1995, Yamaguchi-Shinozaki and Shinozaki, 2005). The AP2 subfamily binds to a completely different sequence, called GCAC(A/G)N(A/T)TCCC(A/G)ANG(C/T) in spite of having the AP2 domain (Gong et al., 2008, Nole-Wilson and Krizek, 2000). The RAV subfamily generally trans-activates the CAACA or CACCTG elements (Sohn et al., 2006).

AP2/ERF superfamily could be considered as an activator or repressor. Generally, repressors in the AP2/ERF superfamily consist of two major motifs, including the ERF-associated amphiphilic repression (EAR) motif (DLNxxP or LxLxLx) in the C-terminal (Hiratsu et al., 2003, Ohta et al., 2001) and B3 repression domain (BRD) motif (RLFGV) in the RAV subfamily. Activators usually do not have a specific motif and merely have regions rich in acidic amino acids, such as Gln, Ser, Pro, and Thr (Liu et al., 1999). However, the EDLL motif has been recognized as a potent plant transcription activation within the AP2/ERF TFs (Tiwari et al., 2012).

AP2/ERF TFs have important functions in biological processes, including development, reproduction, primary and secondary metabolite biosynthesis, and adaptation to biotic and abiotic stresses (Licausi et al., 2013, Mizoi et al., 2012). They are mainly activated in response to stresses such as drought (Golldack et al., 2011, Yamaguchi-Shinozaki and Shinozaki, 2006), heat (Sakuma et al., 2006b), water logging (Hinz et al., 2010), high salinity (Abogadallah et al., 2011), freezing (Yang et al., 2005), osmotic stress (Zhu et al., 2010) and pathogen stimuli (Gutterson and Reuber, 2004). For example, the DREB subfamily is mainly induced in response to abiotic stresses (Mizoi et al., 2012). DREB A1/Cold Binding Factor (CBF) group is considered as a master regulator of cold-stress. Several genes being assigned to cold stress, including Late Embryogenesis Abundant (LEA) genes, fatty acid biosynthesis genes, and sugar metabolism genes are highly activated in transgenic plants, over-expressing DREB1/CBF TFs (Fowler and Thomashow, 2002, Maruyama et al., 2009). The increased numbers of CBF TFs in Eucalyptus grandis might be associated with well acclimation of Eucalyptus to different climate regions (Cao et al., 2015). However, DREB1/CBF TFs are not only induced in response to cold stress, but also to circadian clock, high salinity, dehydration, and growth (Mizoi et al., 2012). DREB A2 group (equal to group IV) plays a dual function in drought and heat shock adaptive pathways through activating genes encoding LEA proteins and heat-shock proteins (Sakuma et al., 2006b). DREB A6 group (equal to group I) is responsible for the biosynthesis of lipid and cell-wall components (Mizoi et al., 2012). Some other roles, including ABA and sugar signaling (Arenas-Huertero et al., 2000, Huijser et al., 2000, Song et al., 2005), increasing drought tolerance through changing architecture of roots and leaves (Abogadallah et al., 2011; Wilson et al., 1996c), regulation of DREB A1/A2 in a negative feedback (Mizoi et al., 2012) and binding to the promoters of genes, involved in the biosynthesis of antioxidants, have been attributed to the other members of DREB (Shaikhali et al., 2008). Group V is involved in the wax biosynthesis and secondary wall formation (Aharoni et al., 2004, Broun et al., 2004, Lasserre et al., 2008). Group V as a part of ERF B6 includes groups V, VI-L, and Xb-L according to Nakano’s classification method. Members of the ERF B1 group (equal to group VII) were activated under oxygen limitation, and conferred submergence tolerance (Licausi et al., 2010b, Xu et al., 2006). The ERF B3 group (equal to group IX) was responsible for secondary metabolite biosynthesis (Mizoi et al., 2012, Shen et al., 2016). The ERF subfamily is also associated with developmental processes (Vernié et al., 2008) and pathogen defensive procedures (Fischer and Dröge-Laser, 2004) along with drought and high salinity (Fujimoto et al., 2000, Park et al., 2001). The AP2 subfamily mostly carry out developmental roles entailing floral development (Elliott et al., 1996, Horstman et al., 2014), fruit size (Chialva et al., 2016), leaf shape and development (Luo et al., 2012, van der Graaff et al., 2000), embryogenesis (El Ouakfaoui et al., 2010, Heidmann et al., 2011), and plant height (Luo et al., 2012). The RAV subfamily is poorly studied. It has been shown that the RAV TFs are involved in postponing floral induction (Castillejo and Pelaz, 2008), growth inhibition (Zhao et al., 2008), bud outgrowth (Moreno-Cortés et al., 2012), leaf senescence (Woo et al., 2010) as well as tolerance to cold, osmotic stress (Zhuang et al., 2011) and pathogen infection (Sohn et al., 2006).

Brassica napus is the second oilseed crop in the world (Fareeha Zafar et al., 2015). It is considered as one of the crops of choice for biodiesel and biofuel production (Blackshaw et al., 2011, Hemmati et al., 2017).

B. napus (genome AnAnCnCn) is an allopolyploid formed by hybridization between B. oleracea (genome CoCo) and B. rapa (genome ArAr) as ancestors (Allender and King, 2010). Moreover, B. napus shares more than 86% similarity in protein coding sequence with the model plant A. thaliana (Cavell et al., 1998). As a consequence, having the knowledge of B. napus closest relatives allows us to investigate its genome.

Cultivation of B. napus is jeopardized by a number of biotic stresses, including fungal and viral pathogens, insects, and weeds. The fungal genus Leptosphaeria, including Leptosphaeria maculan and Leptosphaeria biglobosa are economically important fungal plant pathogens (Howlett, 2004, Van de Wouw et al., 2008). A number of abiotic stresses adversely affect B. napus growth and productivity, of which cold (Meza-Basso et al., 1986), heat (Yu et al., 2014), salinity, flooding, and drought are of special notice.

Since, AP2/ERF TFs are ideal candidates for crop improvement, elucidation of this superfamily in B. napus would have drastic effects on the crop yield and resistance toward biotic and abiotic stresses. In parallel with our study, another paper describing some aspects of genome wide analysis of AP2/ERF TFs in B. napus has been published by Song (Song et al., 2016). The major approach of the mentioned study was finding out the origination, expansion, and evolutionary relation of AP2/ERF TFs within the main land plants. However, in this manuscript, our main purpose is the elucidation of AP2/ERF functions based on various bioinformatic analyses which are quite different than the ones in the paper of Song et al. We report the identification of 321 AP2/ERF TFs in B. napus genome. Furthermore, cluster analysis of the identified proteins, motif-distribution recognition, gene structure analysis in conjunction with gene ontology annotation, evolutionary analysis, and protein characterization were carried out. Expression profile in different tissues and expression profile under biotic and abiotic stresses were monitored via digital RNA-seq data. Results from this study provide the basis for functional experiments associated with genetic engineering of B. napus.

Section snippets

Identification of BnAP2/ERF TFs

The AP2/ERF protein sequences of A. thaliana were retrieved from the database of Arabidopsis transcription factors (DATF) (http://datf.cbi.pku.edu.cn) (Guo et al., 2005). The protein sequences of B. rapa AP2/ERF TFs were retrieved from the Brassica database (http://brassicadb.org/brad/) (Cheng et al., 2011). These sequences were used as queries in the NCBI database (http://blast.ncbi.nlm.nih.gov/Blast.cgi) for PSI-BLAST. BLAST was performed with an e-value of 1e-3 against B. napus. BLAT

Identification and classification of BnAP2/ERF TFs

In this study, based on homology analysis and presence of the complete AP2 domain, 321 AP2/ERF TFs were identified in B. napus. Their sequence length ranged from 123 to 665 aa. The total number of recognized BnAP2/ERF TFs exceeds the number of AP2/ERF TFs in all other previous studied species (Fig. 1). This number was far smaller than the identified numbers of BnAP2/ERF TFs by Song (Song et al., 2016). The reason of this difference will be fully explained in the discussion part. The general

Significance of the identification of AP2/ERF TFs in B. napus

Plants are threatened by a number of biotic and abiotic stresses, affecting their growth and productivity. Previously, it was reported that TFs were accounted for more than 5% of the A. thaliana genome almost twice the ratio in animals and yeast genomes (Riechmann et al., 2000). Hence, TFs play pivotal roles in various processes, including defensive mechanisms in plants, of which AP2/ERFs are of special importance. The AP2/ERF TFs have a key function in response to biotic and abiotic stresses

Conclusions

AP2/ERF TFs carry out a diverse range of functions with the emphasis on coping with biotic and abiotic stresses. As a consequence, their identification and characterization is fundamental to develop stress-tolerant crops and new functionalities. B. napus, a tetraploid species formed by the hybridization between B. rapa and B. oleracea, is a high yielding crop extensively investigated because of its economic value in agriculture, nutrition, and industry. The present study was a wide comparative

Declaration of interest

The authors declare no conflict of interest.

Funding sources

This work was supported by Shiraz University of Medical Sciences, Shiraz, IRAN. [Grant number: 93-01-05-8275].

Acknowledgments

Authors would like to thank Shiraz University of Medical Sciences, Shiraz, IRAN. This work was part of the thesis of Hajar Owji.

References (219)

  • L.D. Hurst

    The Ka/Ks ratio: diagnosing the form of sequence evolution

    Trends Genet.

    (2002)
  • A. Iwase et al.

    The AP2/ERF transcription factor WIND1 controls cell dedifferentiation in Arabidopsis

    Curr. Biol.

    (2011)
  • H.-G. Kang et al.

    Overexpression of FTL1/DDF1, an AP2 transcription factor, enhances tolerance to cold, drought, and heat stresses in Arabidopsis thaliana

    Plant Sci.

    (2011)
  • B. Khraiwesh et al.

    Role of miRNAs and siRNAs in biotic and abiotic stress responses of plants

    Biochim. Biophys. Acta

    (2012)
  • M.J. Kim et al.

    The Arabidopsis AP2/ERF transcription factor RAP2. 11 modulates plant response to low-potassium conditions

    Mol Plant

    (2012)
  • J.-G. Kim et al.

    Xanthomonas type III effector XopD desumoylates tomato transcription factor SlERF4 to suppress ethylene responses and promote pathogen growth

    Cell Host Microbe

    (2013)
  • J. Kyte et al.

    A simple method for displaying the hydropathic character of a protein

    J. Mol. Biol.

    (1982)
  • Y. Lai et al.

    Overexpression of a Chinese cabbage BrERF11 transcription factor enhances disease resistance to Ralstonia solanacearum in tobacco

    Plant Physiol. Biochem.

    (2013)
  • G.M. Abogadallah et al.

    Overexpression of HARDY, an AP2/ERF gene from Arabidopsis, improves drought and salt tolerance by reducing transpiration and sodium uptake in transgenic Trifolium alexandrinum L

    Planta

    (2011)
  • E. Afgan et al.

    The Galaxy platform for accessible, reproducible and collaborative biomedical analyses : 2016 update

    Nucleic Acids Res.

    (2016)
  • A. Aharoni et al.

    The SHINE clade of AP2 domain transcription factors activates wax biosynthesis, alters cuticle properties, and confers drought tolerance when overexpressed in Arabidopsis

    Plant Cell

    (2004)
  • M.D. Allen et al.

    A novel mode of DNA recognition by a β-sheet revealed by the solution structure of the GCC-box binding domain in complex with DNA

    EMBO J.

    (1998)
  • C.J. Allender et al.

    Origins of the amphiploid species Brassica napus L. investigated by chloroplast and nuclear molecular markers

    BMC Plant Biol.

    (2010)
  • M.M. Ambavaram et al.

    Coordinated activation of cellulose and repression of lignin biosynthesis pathways in rice

    Plant Physiol.

    (2011)
  • S. Andrews

    FastQC a Quality-control Tool for High-throughput Sequence Data

    (2014)
  • F. Arenas-Huertero et al.

    Analysis of Arabidopsis glucose insensitive mutants, gin5 and gin6, reveals a central role of the plant hormone ABA in the regulation of plant vegetative development by sugar

    Genes Dev.

    (2000)
  • T.L. Bailey et al.

    Fitting a Mixture Model by Expectation Maximization to Discover Motifs in Bipolymers

    (1994)
  • T.L. Bailey et al.

    Combining evidence using p-values: application to sequence homology searches

    Bioinformatics

    (1998)
  • H. Banno et al.

    Overexpression of Arabidopsis ESR1 induces initiation of shoot regeneration

    Plant Cell

    (2001)
  • R. Bari et al.

    PHO2, microRNA399, and PHR1 define a phosphate-signaling pathway in plants

    Plant Physiol.

    (2006)
  • A. Bender

    An Evaluation of Cluster 3.0 as a General Tool for Principal Component Analysis

    Proceedings of the 47th ACM Technical Symposium on Computing Science Education, ACM

    (2016)
  • R. Blackshaw et al.

    Alternative oilseed crops for biodiesel feedstock on the Canadian prairies

    Can. J. Plant Sci.

    (2011)
  • K. Boutilier et al.

    Ectopic expression of BABY BOOM triggers a conversion from vegetative to embryonic growth

    Plant Cell

    (2002)
  • P. Broun et al.

    WIN1, a transcriptional activator of epidermal wax accumulation in Arabidopsis

    Proc. Natl. Acad. Sci U. S. A.

    (2004)
  • V. Busov et al.

    EARLY BUD-BREAK1 (EBB1) defines a conserved mechanism for control of bud-break in woody perennials

    Plant Signal. Behav.

    (2016)
  • S.B. Cannon et al.

    OrthoParaMap: distinguishing orthologs from paralogs by integrating comparative genome data and gene phylogenies

    BMC Bioinf.

    (2003)
  • S.B. Cannon et al.

    The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsisthaliana

    BMC Plant Biol.

    (2004)
  • P.B. Cao et al.

    Genome-wide analysis of the AP2/ERF family in Eucalyptus grandis: an intriguing over-representation of stress-responsive DREB1/CBF genes

    PLoS One

    (2015)
  • A.C. Cavell et al.

    Collinearity between a 30-centimorgan segment of Arabidopsis thaliana chromosome 4 and duplicated regions within the Brassica napus genome

    Genome

    (1998)
  • B. Chalhoub et al.

    Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome

    Science

    (2014)
  • T. Chen et al.

    Expression of an alfalfa (Medicago sativa L.) ethylene response factor gene MsERF8 in tobacco plants enhances resistance to salinity

    Mol. Biol. Rep.

    (2012)
  • T. Chen et al.

    An alfalfa (Medicago sativa L.) ethylene response factor gene, MsERF11, enhances salt tolerance in transgenic Arabidopsis

    Plant Cell Rep.

    (2012)
  • H.Y. Chen et al.

    ORA47 (octadecanoid‐responsive AP2/ERF‐domain transcription factor 47) regulates jasmonic acid and abscisic acid biosynthesis and signaling through binding to a novel cis‐element

    New Phytol.

    (2016)
  • F. Cheng et al.

    BRAD, the genetics and genomics database for Brassica plants

    BMC Plant Biol.

    (2011)
  • M.-C. Cheng et al.

    The Arabidopsis ETHYLENE RESPONSE FACTOR1 regulates abiotic stress-responsive gene expression by binding to different cis-acting elements in response to different stress signals

    Plant Physiol.

    (2013)
  • C. Chialva et al.

    Expression of grapevine AINTEGUMENTA-like genes is associated with variation in ovary and berry size

    Plant Mol. Biol.

    (2016)
  • A. Conesa et al.

    Blast2GO: A comprehensive suite for functional analysis in plant genomics

    Int. J. Plant Genomics

    (2008)
  • A. Conesa et al.

    A survey of best practices for RNA-seq data analysis

    Genome Biol.

    (2016)
  • X. Dai et al.

    psRNATarget: a plant small RNA target analysis server

    Nucleic Acids Res.

    (2011)
  • R.M. Davidson et al.

    Comparative transcriptomics of three Poaceae species reveals patterns of gene expression evolution

    Plant J.

    (2012)
  • Cited by (0)

    View full text