Research Article
Molecular evolution and structural variations in nuclear encoded chloroplast localized heat shock protein 26 (sHSP26) from genetically diverse wheat species

https://doi.org/10.1016/j.compbiolchem.2019.107144Get rights and content

Highlights

  • Identification and in silico characterization of choloroplast specific HSP26 allelic variants in wild relatives of wheat.

  • A novel HSP26 gene discovered in DD-genome assembly of diploid progenitor wheat, the orthologue of which -is not present in hexaploid wheat genome.

  • First report -on the isolation of a novel chloroplast specific sHSP26 protein from Ae. tauschiiaccession pau14376 (GenBank:MN119514).

  • A 9-bp indel in TmHSP26-1(GA) translated into a deletion of SPM amino acid segment in chloroplast specific conserved consensus region III.

  • High degree of divergence in nucleotide sequence between cultivated and wild species appeared in the form of higher ω values (Ka/Ks >1).

Abstract

Heat shock proteins are an important class of molecular chaperones known to impart tolerance under high temperature stress. sHSP26, a member of small heat shock protein subfamily is specifically involved in protecting plant’s photosynthetic machinery. The present study aimed at identifying and characterizing sequence and structural variations in sHSP26 from genetically diverse progenitor and non-progenitor species of wheat. In silico analysis identified three paralogous copies of TaHSP26 to reside on short arm of chromosome 4A while one homeologue each was localized on long arm of chromosome 4B and 4D of cultivated bread wheat. Wild DD-genome donor Aegilops tauschii carried an additional sHSP26 gene (AET4Gv20569400) which was absent in the cultivated DD genome of bread wheat. In vitro amplification of this novel gene in wild accessions of Ae. tauschii and synthetic hexaploid wheat but not in cultivated bread wheat validated this finding. Further, significant length polymorphism could be identified in exon1 from diverse sHSP26 sequences. Multiple sequence alignment of procured sequences revealed numerous sSNPs and nsSNPs. D3A, P125 L, Q242 K were designated as homeolog specific- while A49 G as non-progenitor specific amino acid replacements. A 9-bp indel in TmHSP26-1(GA) translated into a deletion of SPM amino acid segment in chloroplast specific conserved consensus region III. High degree of divergence in nucleotide sequence between cultivated and wild species appeared in the form of higher ω values (Ka/Ks >1) indicating positive selection during the course of evolution. Phylogenetic analysis elucidated ancestral relationships between wheat sHSP26 proteins and orthologous proteins across plant kingdom. Overall, data mining approach may be employed as an effective pre-breeding strategy to identify and mobilize novel stress responsive genes and distinct allelic variants from wider germplasm collections of wheat to enhance climate resilience of present day elite wheat cultivars.

Introduction

Wheat, a cool season crop, has been found to suffer enormously due to any change in ambient temperature in its growth environment. Global warming has resulted in a slow but gradual increase in daily atmospheric temperature. Lobell and Field (2007) reported that wheat yield decreased globally by ∼5.4% per 10C rise in mean minimum or maximum temperature from 1991 to 2002. Further, climate change is predicted to have a significant impact on temperature (estimated to increase by ±20C by 2050) and precipitation profiles in wheat growing regions around the world. Accordingly there is an urgent need to identify new sources of genetic variation and develop wheat cultivars that are resilient to climatic uncertainties.

Wheat experiences heat stress to varying degrees during different phenological stages of its development. Photosynthesis is one of the most sensitive physiological processes to elevated temperatures as any sudden spike in day temperature alters structural integrity and normal functioning of the chloroplast (Hugly et al., 1989.; Olmos et al., 2007). Oxidative burst, as one of the by-product of heat stress induced redox perturbation leads to lipid peroxidation in thylakoids membranes resulting in increased membrane fluidity and dislodging of light harvesting complex, especially photosystem II (Al‐Khatib and Paulsen, 1984; Prasad et al., 2008). This impaired steady state physiology is accompanied by protein degradation and inactivation of the enzymes involved in chlorophyll biosynthesis, tetrapyrrole metabolism (Tewari and Tripathi, 1998) and carbon assimilation (Xue et al., 2008). Accelerated leaf senescence reduces net supply of photo assimilates to the developing kernels affecting starch deposition. Accordingly, wheat grown in tropical climates of India is particularly prone to yield penalty on account of rising diurnal temperature and concomitant drought stress developed due to increased transpiration and surface evaporation especially during the grain filling period.

Of the diverse endogenous tolerance mechanisms developed by the plants during the course of evolution, production of heat shock proteins under high temperature stress is well known. Heat shock proteins are an evolutionary conserved class of proteins, divided into sub-families based on their apparent molecular weight i.e. HSP100, HSP90, HSP70, HSP60 and small HSPs (Boston et al., 1996). They have been found to play a very significant role in preventing misfolding of native proteins and aggregation of denatured proteins under stress (Nakamoto and Vigh, 2007). Of these, small heat shock proteins (size ranging from 15 kDa to 40 kDa) have been recognized as the first line of defense in the cell when proteins begin to misfold (Hilton et al., 2012). Sixteen sub-families of sHSPs have been identified in higher plants, with 11 cytoplasmic or nuclear localized (CI –CXI) and five organelle-localized, each of which is known to encode a protein destined for a specific cellular compartment (Waters et al., 1996). sHSP26 is a nuclear encoded protein that consists of a variable amino terminal region, a more conserved carboxyl terminal region (that harbors heat shock domain, consensus region I, consensus region II) and a consensus region III, also referred to as methionine bristles. The N-terminal region bearing the leader sequence (specific for each sub-cellular organelle) is highly diverse, with functional role implicated in substrate binding (Giese et al., 2005). The 90–100 amino acid long HSP20 domain (α-crystalline domain) is essentially conserved among different subfamilies of sHSP and is suggested to be important for oligomerization (Sun and MacRae, 2005; Basha et al., 2012). However, methionine rich amphipathic α-helix has been found to be exclusively present in sHSP polypeptides functional in chloroplast (Chen and Vierling, 1991; Waters, 1995). The ability of these methionine bristles to undergo oxidation dependent conformational changes may protect chloroplast from oxidative stress (Harndahl et al., 1999). sHSP26 is synthesized as a precursor polypeptide, which when transported to chloroplast gets associated with either soluble stroma portion or grana region of the thylakoid membrane and plays its part in enhancing the photochemical efficiency of PSII against heat and accompanied oxidative stress (Lee et al., 2000; Chauhan et al., 2012; Kim et al., 2012). The first complete sequence of this low molecular weight (LMW) chloroplast HSP from wheat was reported in 1991 (Weng et al., 1991), followed by the identification of its novel isoform in 1993 (Nguyen et al., 1993).

Up-regulation in the expression of heat stress responsive transcription factors (HSFs) and induction of heat shock proteins (HSPs) upon exposure to elevated temperature is well established (Chauhan et al., 2011; Waters, 2013; Abu-Romman, 2015). In order to understand the importance of HSP26 in acquired thermotolerance, a few studies concerned with development of transgenic (Chauhan et al., 2012), promoter analysis (Khurana et al., 2013), variability in the expression of heat shock proteins in durum (Yildiz and Terzi, 2008; Rampino et al., 2009) and wild species (Rampino et al., 2018) have been carried out in wheat. Hu et al., 2015a demonstrated ZmHSP26 to strongly interact with four chloroplastic proteins, namely ATP synthase subunit β, chlorophyll a–b binding protein, oxygen evolving enhanced protein I and PSI reaction center subunit IV to protect chloroplast under high temperature stress. Comastri et al., 2018 has recently isolated and molecularly characterized homologs of sHSP26 from T. durum cultivars and identified SNPs using TILLING online resource and KASP markers.

The structural-, functional- and evolutionary relationships of sHSPs across the plant kingdom have been extensively studied past two decades and are well documented (Waters 1195, 1996, Waters et al., 2008a; Safdar et al., 2012; Bondino et al., 2012). In recent years, several studies have reported genome wide identification of HSPs in general and sHSP gene family in particular in different plants like Arabidopsis (Scharf et al., 2001; Siddique et al., 2008), sugarcane (Borges et al., 2007), populus (Waters et al., 2008a), rice (Sarkar et al., 2009), CAM plants (Shakeel et al., 2012), soybean (Lopes-Caitar et al., 2013), pepper (Guo et al., 2015), barley (Pandey et al., 2015), setaria (Singh et al., 2016), wheat (Pandey et al., 2015; Muthusamy et al., 2017; Wang et al., 2017). Genome wide analysis of HSFs in T.aestivum (Xue et al., 2014), T.urartu and Ae. tauschii with spatial variation in expression has also been conducted (Yang et al., 2014). Though Muthusamy et al., 2017 carried out a comprehensive analysis of sHSP gene family in bread wheat genome and Wang et al., 2017 gave an evolutionary angle to the theme by including sHSPs sequences from related wild wheat species, but exploring the degree of sequence divergence and extent of structural variations in chloroplast specific sHSP26 as the molecular basis of the inherent thermo tolerant nature of wild wheat relatives did not emerge as the primary focus of these studies.

Wheat is an allohexaploid, combining genome of three diploid grass species, A-genome of T. urartu, B-genome from a species within the sitopsis section and related to Ae. speltoides and D-genome of Ae. tauschii (Dvorak and Zhang, 1992; Feldman et al., 1995). Wild wheat species have been found to be more tolerant to abiotic stress constraints (like high temperature, drought, and salinity stress) than cultivated wheat (Vierling and Nguyen, 1990, 1992, Ehdaie and Waines, 1992; Zaharieva et al., 2001; Dulai et al., 2006; Gupta et al., 2010; Pradhan et al., 2012; Awlachew et al., 2016; Suneja et al., 2015, 2017; Suneja et al., 2019). Wheat breeding efforts have generally been directed towards assessing acquired thermotolerance potential of genetically diverse accessions or genotypes in a segregating population using different empirical traits like cell membrane stability (Ibrahim and Quick, 2001), TTC based cell viability (Fokar et al., 1998), chlorophyll fluorescence, stay green habit, canopy temperature depression (Reynolds et al., 1998). Research into physiological and biochemical basis of crop adaptation to stress is well established (Fischer and Turner, 1978; Richards et al., 2002) with molecular technologies adding a new dimension to it (Chaves et al., 2003). Since the rice genome was completely sequenced more than ten years ago, the reference genome sequence of several other major crops- including barley, millet, maize, sorghum, pearl millet, brachypodium, secale cereale, soybean, chickpea, potato, tomato, brassica etc. have been reported (Bolger et al., 2014). International wheat genome sequencing consortium has already sequenced 16 Gb long genome of hexaploid wheat (IWGSC, 2014; Marcussen et al., 2014). Genome sequence of Ae. tauschii (Luo et al., 2012; Jia et al., 2013) and T. urartu (Ling et al., 2013) has also been published.

Successful implementation of international efforts like 1000 Genome project, HapMap Project (in humans) and 1001 Genomes for Arabidopsis (Weigel and Mott, 2009) have demonstrated the -significance of single nucleotide polymorphism (SNPs) as the basis of genetic variation within a species or a population and subsequently brought re-sequencing efforts to the forefront. With sequencing becoming cost effective (and molecular applications like Genotyping by Sequencing (GBS) proving their mettle in revealing intraspecific variation (Vikram et al., 2016; Singh et al., 2018), followed by parallel advancement in genome analysis tools, rapid inspection of sequence variation is now within the reach of crop scientists. Data mining approach may thus be used as a surrogate to identify new sources of genetic variability to be utilized in pre-breeding programs to improve plant performance for agronomic traits of economic importance. With the annotated genome sequence of wheat and its wild relatives becoming publically available, gene information can now be effectively utilized for dissecting specific traits of interest. Allelic richness of wild species and old landraces can accordingly be examined for the key genetic loci to identify a potentially adaptive allele, which if introduced into a useful genetic background, may add to the efficiency of persisting stress tolerance mechanisms and enhance overall performance of the elite cultivar (Bansal et al., 2014; Brozynka et al., 2016). Wide hybridization, alien introgression and cisgenics offer good prospects for using retrieved genomic information to breed heat-tolerant crops, particularly wheat.

Therefore, the present study was planned to identify genomic homeologs and orthologs from genetically diverse wild and cultivated relatives of wheat and explore the extent of sequence level variations in sHSP26 gene accumulated during the course of wheat polyploidization. Sequences from whole genome assemblies of released cultivars- Cadenza, Claire, Robigus, Paragon (hexaploid bread wheat) and Kronos, Svevo (tetraploid durum wheat) were also included (keeping in mind the importance of resequencing) to further confirm species specific SNPs identified from Chinese Spring database (and identify cultivar-specific differences, if any) . Allelic diversity in the coding region of sHSP26 sequences provided novel insights into molecular evolution of sHSP26 gene within the triticeae tribe. Amplification of in silico identified sub-genome specific copies of sHSP26 gene and its homologs across 18 wild and cultivated wheat accessions allowed validation of key results obtained from data mining approach. It also shed light on how diversity in the nucleotide sequence translated into differences in amino acid composition of conserved domains and motifs that might influence functionality of this heat shock protein under stress. Estimation of Ka/Ks between all gene pairs and phylogenetic analysis illustrated sub-genome specific clustering of sHSP26 proteins and their evolutionary diversification.

Section snippets

Data resources and identification of sHSP26 gene from wheat species

The exploration of nucleotide sequences for wheat sHSP26 in GenBank revealed presence of seven different cDNA sequences submitted for wheat chloroplastic sHSP26, namely HM802264 from C-306, and six variants from winter wheat cultivar Mustang i.e. four sequences designated as E-(AF097656), G-(AF097657), I- (AF097658), M-allele (AF097659), along with X58280 and X67328 in NCBI database. AF097657 was chosen as the reference sequence for the present study since it had the longest cDNA with an ORF

Identification, chromosome localization and structure analysis of sHSP26 gene sequences across wheat species

Database search enabled the identification of five homeologous copies of sHSP26 on chromosome number ‘4’ of the three sub-genomes of wheat (4AS, 4BL and 4DL) (Table 2). With respect to copy number differences within the respective sub-genomes, three copies of sHSP26 (in tandem) were identified on short arm of chromosome 4A as against one copy of the gene located on 4BL and 4DL. Gene structure analysis revealed sHSP26 to be made of two exons interspersed by an intron (Fig. 1A and Supplementary

Conclusions

The present study reports identification and in silico characterization of HSP26 allelic variants in wild relatives of wheat. Nuclear encoded sHSP26 is an important member of sHSP20 gene family, product of which is found to function exclusively in chloroplast. 77 sHSP26 gene sequences from 11 species of wheat were identified and analyzed. Three paralogous copies of sHSP26 were found on 4AS and one each on 4BL and 4DL of respective sub-genome of cultivated hexaploid bread wheat. An additional

References (99)

  • M. Siddique et al.

    The plant sHSP superfamily: five new members in Arabidopsis thaliana with unexpected properties

    Cell Stress Chaperones

    (2008)
  • E.R. Waters et al.

    Comparative analysis of the small heat shock proteins in three angiosperm genomes identifies new subfamilies and reveals diverse evolutionary patterns

    Cell Stress Chaperones

    (2008)
  • S. Abu-Romman

    Genotypic response to heat stress in durum wheat and the expression of samll HSP genes

    Rend. Fis. Acc. Lincei

    (2015)
  • K. Al‐Khatib et al.

    Mode of high temperature injury to wheat during grain development

    Physiol. Plant.

    (1984)
  • S.F. Altschul et al.

    Gapped BLAST and PSI-BLAST: a new generation of protein database search programs

    Nuc. Acid Res.

    (1997)
  • Z.T. Awlachew et al.

    Transfer and mapping of the heat tolerance component traits of Aegilops speltoides in tetraploid wheat Triticum durum

    Mol. Breed.

    (2016)
  • K.C. Bansal et al.

    Genomic resources for breeding crops with enhanced abiotic stress tolerance

    Plant Breed.

    (2014)
  • H.G. Bondino et al.

    Evolution and functional diversification of the small heat shock protein/α-crystalline family in higher plants

    Planta

    (2012)
  • R.S. Boston et al.

    Molecular chaperones and protein folding in plants

    Post-transcriptional Control of Gene Expression in Plants

    (1996)
  • M. Brozynka et al.

    Genomics of crop wild relatives: expanding the gene pool for crop improvement

    Plant Biotechnol. J.

    (2016)
  • T. Cadalen et al.

    An inter- varietal molecular marker map in Triticum aestivum L. Em.THel and comparison with a map from a wide cross

    Theor. Appl. Genet.

    (1997)
  • K.S. Caldwell et al.

    Sequence polymorphism in polyploid wheat and their D genome diploid ancestor

    Genetics

    (2004)
  • H. Chauhan et al.

    The wheat chloroplastic small heat shock protein (sHSP26) is involved in seed maturation and germination and imparts tolerance to heat stress

    Plant Cell Environ.

    (2012)
  • H. Chauhan et al.

    Identification and characterization of high temperature stress responsive genes in bread wheat (Triticum aestivum L.) and their regulation at various stages of development

    Plant Mol. Biol.

    (2011)
  • M.M. Chaves et al.

    Understanding plant responses to drought- from genes to the whole plant

    Funct. Plant Biol.

    (2003)
  • Q. Chen et al.

    Analysis of conserved domains identifies a unique structural feature of a chloroplast heat shock protein

    Mol. Gen. Genet.

    (1991)
  • F. Choule et al.

    Structural and functional partitioning of bread wheat chromosome 3B

    Science

    (2014)
  • A. Comastri et al.

    Heat in wheat: exploit reverse genetic techniques to discover new alleles within the Triticum durum sHSP26 family

    Front. Plant Sci

    (2018)
  • F. Corpet

    Multiple sequence alignment with hierarchical clustering

    Nuc. Acid Res.

    (1988)
  • J. Dubcovsky et al.

    Genome plasticity a key factor in the success of polyploid wheat under domestication

    Science

    (2007)
  • S. Dulai et al.

    Effects of drought on photosynthetic parameters and heat stability of PSII in wheat and in Aegilops species originating from dry habitats

    Acta Biologica Szegediensis

    (2006)
  • J. Dvorak et al.

    Reconstruction of the phylogeny of the genus Triticum from variation in repeated nucleotide sequences

    Theor. Appl. Genet.

    (1992)
  • B. Ehdaie et al.

    Heat resistance in wild Triticum and Aegilops

    J. Genet. Breed.

    (1992)
  • M. Feldman et al.

    Wheats

  • R.A. Fischer et al.

    Plant productivity in the arid and semiarid zones

    Annu. Rev. Plant Physiol.

    (1978)
  • M. Fokar et al.

    Heat tolerance in spring wheat. I estimating cellular thermotolerance and its heritability

    Euphyt

    (1998)
  • K.C. Giese et al.

    Evidence for an essential function of the N terminus of a small heat shock protein in vivo, independent of in vitro chaperone activity

    PNAS

    (2005)
  • M. Guo et al.

    Genome-wide analysis of the CaHsp20 gene family in pepper: comprehensive sequence and expression profile analysis under heat stress

    Front. Plant Sci.

    (2015)
  • S. Gupta et al.

    Genotypic variation for cellular thermotolerance in Aegilops tauschii Coss., the D-genome progenitor of wheat

    Euphyt

    (2010)
  • G.R. Hilton et al.

    Small heat-shock proteins: paramedics of the cell

    Molecular Chaperon

    (2012)
  • B. Hu et al.

    GSDS 2.0: an upgraded gene feature visualization server

    Bioinformatics

    (2015)
  • S. Hugly et al.

    Enhanced thermal tolerance of photosynthesis and altered chloroplast ultrastructure in a mutant of Arabidopsis deficient in lipid desaturation

    Plant Physiol.

    (1989)
  • A.M.H. Ibrahim et al.

    Genetic control of high temperature tolerance in wheat as measured by membrane thermal stability

    Crop Sci.

    (2001)
  • International Wheat Genome Sequencing Consortium

    A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome

    Science

    (2014)
  • Z. Jia et al.

    Aegilops tauschii draft genome sequence reveals a gene repertoire for wheat adaptation

    Nature

    (2013)
  • D.T. Jones et al.

    The rapid generation of mutation data matrices from protein sequences

    Comput. Appl. Biosci.

    (1992)
  • N. Khurana et al.

    Wheat chloroplast targeted sHSP26 promoter confers heat and abiotic stress inducible expression in transgenic Arabidopsis plants

    PLoS One

    (2013)
  • K.H. Kim et al.

    Overexpression of a chloroplast-localized small heat shock protein OsHSP26 confers enhanced tolerance against oxidative and heat stresses in tall fescue

    Biotechnol. Lett.

    (2012)
  • S. Kumar et al.

    MEGA 7: molecular Evolutionary Genetic Analysis version 7.0 for bigger datasets

    Mol. Biol. Evol.

    (2016)
  • Cited by (0)

    View full text