- Open Access
Two sides of the same coin? Unraveling subtle differences between human embryonic and induced pluripotent stem cells by Raman spectroscopy
Stem Cell Research & Therapyvolume 8, Article number: 271 (2017)
Human pluripotent stem cells, including embryonic stem cells and induced pluripotent stem cells, hold enormous promise for many biomedical applications, such as regenerative medicine, drug testing, and disease modeling. Although induced pluripotent stem cells resemble embryonic stem cells both morphologically and functionally, the extent to which these cell lines are truly equivalent, from a molecular point of view, remains controversial.
Principal component analysis and K-means cluster analysis of collected Raman spectroscopy data were used for a comparative study of the biochemical fingerprint of human induced pluripotent stem cells and human embryonic stem cells. The Raman spectra analysis results were further validated by conventional biological assays.
Raman spectra analysis revealed that the major difference between human embryonic stem cells and induced pluripotent stem cells is due to the nucleic acid content, as shown by the strong positive peaks at 785, 1098, 1334, 1371, 1484, and 1575 cm–1, which is enriched in human induced pluripotent stem cells.
Here, we report a nonbiological approach to discriminate human induced pluripotent stem cells from their native embryonic stem cell counterparts.
Human pluripotent stem cells (hPSCs) include embryonic stem cells (ESCs), derived from the inner cell mass of the blastocyst , and induced pluripotent stem cells (iPSCs), generated from adult somatic cells by forced expression of a specific set of reprogramming factors [2, 3]. hPSCs have the ability to undergo self-renewal indefinitely while retaining the capability to differentiate into cells of all three germ layers, both in vivo and in vitro . These features make hPSCs effective and advantageous cell sources for many biomedical applications, such as cell transplantation therapy , regenerative medicine , drug testing , and disease modeling . The production and derivation of human ESCs (hESCs), however, engenders significant ethical limitations since one needs to sacrifice an embryo to harvest these cells. In addition to the ethical concerns, another crucial bottleneck for hESC-based therapy is the risk of potential allogeneic immune rejection of hESC-derived cells by recipients after cell transplantation . The use of hESCs for disease modeling often involves the generation of mutant cell lines by homologous recombination for in-vitro disease recapitulation . The discovery of iPSCs, as patient-specific PSCs, has overcome both of these ethical and immunological issues, making iPSCs attractive candidates to complement ESCs in research and clinical studies. Additionally, in the context of human disease modeling, patient-specific iPSCs offer advantages over human recombinant and genetically modified ESCs since hiPSCs carry the genetic and epigenetic background of the patient from which they were derived. From a morphological point of view, ESCs and iPSCs are indistinguishable; functionally, iPSCs can differentiate into cells of any of the three germ layers, like ESCs. However, despite the large similarity between ESCs and iPSCs, it is still debated whether they are molecularly equivalent . Here, we use Raman spectroscopy to perform a comparative analysis of a human iPSC (hiPSC) line reprogrammed from skin fibroblasts and the commercially available hESC line H9 in order to identify specific biochemical signatures capable of discriminating between reprogrammed and native pluripotent stem cells. Raman spectroscopy is a label-free optical technique relying on inelastic light scattering, and able to provide characteristic fingerprints of biomolecules. Recently, Raman micro-spectroscopy has been employed successfully for several biochemical studies, such as lipid droplet overexpression in colorectal cancer stem cells , drug screening during stem cell differentiation , discrimination of tumor/healthy tissues , grading of live osteosarcoma cells , detection of hepatic maturation in mesenchymal stromal cells undergoing hepatic differentiation , screening of hESCs , lipidomics in living cells and tissues [18, 19], tumor-derived exosomes , different proteomics issues , as well as comparison of hESCs and hiPSCs with their differentiated derivatives . Further applications of Raman spectroscopy in biochemistry and cell biology have been identified . The results of Raman analysis performed on human reprogrammed and ESC lines indicate that although the overall spectral behavior is very similar for both cell lines, the hiPSC spectra still exhibit distinctive Raman features which allow for their discrimination from the native counterpart.
Generation and characterization of human induced pluripotent stem cells
A dermal biopsy specimen was collected after obtaining written informed consent from a healthy individual and cultured on a Petri culture dish. Fibroblasts were isolated and expanded by the outgrowth method in DMEM supplemented with 10% FBS. Fibroblasts were passaged twice and then infected for iPSC generation. Reprogramming of fibroblasts to pluripotency was performed by nonintegrating Sendai-virus-mediated transfection of the four canonical transcriptional factors (OCT4, SOX2, KLF4, and c-MYC) (CytoTune2.0 Sendai vectors; Thermo Scientific). Briefly, 3 × 105 fibroblasts were infected at a multiplicity of infection (MOI) of 5, yielding different iPSC clones generated under feeder-independent conditions on Matrigel-coated dishes (BD Biosciences). Generated hiPSCs were stained for alkaline phosphatase (AP) activity (Additional file 1: Figure S1A) and subsequently picked manually for culture and propagation. Prior to performing pluripotency assays, generated hiPSCs were tested for loss of Sendai virus transgenes by RT-PCR (Additional file 1: Figure S1B). The pluripotency of generated hiPSCs and hESCs was evaluated by qRT-PCR for expression of the endogenous pluripotency genes OCT4, SOX2, c-MYC, REX1, and NANOG (Additional file 1: Figure S1C) and pluripotency markers Oct4 and Nanog by immunostaining (Additional file 1: Figure S1D). To further assess the pluripotency of both stem cell lines used in this study, we performed a genome-wide gene expression profile assay according to the PluriTest algorithm  (Additional file 1: Figure S1E). Additionally, generated hiPSCs and hESCs were tested for markers of the three germ layers, Nestin (ectoderm), Brachyury (mesoderm), and Sox17 (endoderm), on whole embryoid bodies (EBs) by immunostaining (Additional file 1: Figure S1F) and by qRT-PCR for endoderm (SOX7), mesoderm (HAND1, ACTA2, and MYL2), and ectoderm (NESTIN and BMP4) expression markers (Additional file 1: Figure S1G). hESCs (H9) were purchased from the WiCell Research Institute, and this cell line was used as a control throughout our experiments. Before performing experiments, all cell lines were tested for mycoplasma contamination.
Human iPSCs and ESCs were cultured on Matrigel-coated (BD Biosciences) dishes in mTeSR1 medium (STEMCELL Technologies, Vancouver, BC, Canada). Cells were maintained at 37 °C in 5% CO2 in a humidified incubator. The culture medium was changed daily, and cells were passaged every 4–6 days (at 70% confluence) with Gentle Cell Dissociation reagent (STEMCELL Technologies).
Reverse transcription PCR and quantitative real-time PCR
Reverse transcription PCR (RT-PCR) was used for Sendai viral transgene detection in infected parental fibroblasts (ipF) and loss in hiPSCs compared to their uninfected parental cells (pF). Quantitative reverse transcription PCR (qRT-PCR) was used to assess the expression of pluripotency genes as well as genes of the three germ cell layers. Total RNA was extracted using Trizol reagent (Thermo Fisher Scientific, Waltham, MA, USA) following the manufacturer’s instructions. One microgram of RNA was used for cDNA synthesis using a High-Capacity cDNA Reverse Transcription kit (Applied Biosystems). Gene expression was quantified by qRT-PCR analysis using 1 μl of the RT product and Power SYBR Green Master Mix (Applied Biosystems). qRT-PCR was performed in a StepOne Plus instrument (Applied Biosystems), and the gene expression levels were normalized to the glyceraldehyde 3-phosphate dehydrogenase (GAPDH) housekeeping gene. The gene expression and relative fold-change (Fc) patterns were assessed by the 2–ΔΔCt method. The primers used in this work are presented in Table 1.
Genome-wide gene expression profile
For PluriTest assays, RNA was extracted from hiPSCs and hESCs using the Stratagene Absolutely RNA kit. Total RNA (0.5 μg) was processed with an Illumina TotalPrep RNA Amplification Kit (Thermo Scientific) following the manufacturer’s instructions. The antisense RNA (aRNA) product was hybridized to the Human HT-12v4 Expression BeadChip Kit (Illumina) and run in an iSCAN system (Illumina). The raw data were uploaded to the PluriTest website (http://www.pluritest.org) and analyzed with the PluriTest algorithm .
For immunocytochemistry, hiPSCs and hESCs were fixed in 4% (vol/vol) paraformaldehyde (PFA) and subjected to immunostaining using the following primary antibodies: human Oct4 (1:400, mouse monoclonal; STEMCELL Technologies), human Nanog (1:1000, rabbit polyclonal; Abcam), human Nestin (1:1000, mouse monoclonal; STEMCELL Technologies), human Brachyury (1:20, goat polyclonal; R&D systems), and human Sox17 (1:20, goat polyclonal; R&D systems). Incubation with primary antibodies was performed overnight at 4 °C. After rinsing with Dulbecco’s phosphate-buffered saline (DPBS), goat anti-mouse Alexa-Fluor-647, donkey anti-Goat Alexa-Fluor-594, and goat anti-rabbit Alexa-Fluor-488-conjugated secondary antibodies (all from Thermo Scientific) were added, and cells were incubated for 1 hour at 37 °C. Nuclei were counterstained with 4′-6-diamidino-2-phenylindole (DAPI). Slides were mounted with Fluorescent mounting medium (Dako Cytomation), and microscopy was performed using imaging systems (DMi8), filter cubes, and software from Leica microsystems.
DNA and RNA analyses for nucleic acid quantification and gel electrophoresis
Genomic DNA (gDNA) from hiPSCs and hESCs was extracted using a GenElute Mammalian Genomic DNA Miniprep kit (Sigma Aldrich, Saint Louis, MO, USA), while total RNA was extracted using an Absolutely RNA Miniprep kit (Agilent Technologies). Prior to DNA/RNA extraction, hiPSCs and hESCs were counted, and 4 × 105 cells were processed for nucleic acid purification. DNA and RNA samples were eluted in an equal volume of elution buffer, and 1 μl of each DNA/RNA sample was used for quantification by a NanoDrop spectrophotometer (Thermo Fisher Scientific); 0.5 μg of each RNA and DNA sample were loaded onto 1% agarose gels for electrophoresis and mass quantification. Nucleic acid purification and agarose gel electrophoresis were performed in biological triplicate for each cell line tested.
For mitochondrial labeling and activity, hiPSCs and hESCs were incubated for 30 minutes at 37 °C with 100 nM MitoTracker Green FM (Thermo Fisher Scientific) diluted in growth medium (mTeSR1; STEMCELL Technologies). Fluorescence was measured with a Leica imaging system (DMi8), and the fluorescence intensity (magnification × 20) was analyzed using Leica LAS-X software. The results are presented as the mean ± standard deviation (SD) of three independent experiments.
Cell proliferation assay by CFSE
Cell proliferation assays of hiPSCs and hESCs were evaluated by the 5,6-carboxyfluorescein diacetate succinimidyl ester (CFSE) method. Briefly, 5 × 105 cells were labeled with 8 μM CellTrace CFSE (cell proliferation kit; Thermo Fisher Scientific) in mTeSR1 medium for 10 minutes at 37 °C. Labeling was quenched by adding cold PBS with 0.1% bovine serum albumin (BSA) to cells, followed by a 5-minute incubation on ice. Two hours later (T0) and after 4 days (T4) of culture in mTeSR1 medium, cells were harvested for CFSE fluorescence evaluation by flow cytometric analysis (BD LSRFortessa x-20). Cell proliferation was calculated by measuring the decrease in label intensity in successive daughter cell generations . The proliferation index and cell populations of parental or successive generations were calculated with Modfit LT Version 3.2 software.
Propidium iodide staining for cell cycle analysis
Analysis of the cell cycle status was performed by flow cytometry on cells labeled with propidium iodide (PI), a fluorescent intercalating agent that is used to assess the DNA content during the cell cycle. For this assay, hiPSCs and hESCs were treated by Accutase for single cell dissociation, and 5 × 105 cells were harvested in PBS and alcohol-fixed with 70% cold ethanol at 4 °C for 30 minutes. After fixation, cells were washed three times with cold PBS, spun, and treated with PBS containing 0.1% Triton, 5 μg/ml PI, and 5 μg/ml ribonuclease for 1 hour in the dark. PI-stained cells were then analyzed by flow cytometry (BD LSRFortessa x-20) for proliferation and cell cycle distribution estimation.
Karyotype analysis of hiPSCs and hESCs was performed by multiplex-fluorescence in-situ hybridization (M-Fish). Cells were treated with KaryoMAX Colcemid solution (Thermo Fisher Scientific) and processed with standard methods. Briefly, fixed cells dropped onto glass slides were hybridized with the 24XCyte Human Multicolor FISH Probe Kit (MetaSystems, Altlussheim, Germany), following the manufacturer’s instructions. Slides were denatured in 0.07 N NaOH and then rinsed in graded ethanol. Meanwhile, the probe mix was denatured in a MJ mini personal thermal cycler (Bio-Rad Laboratories, Hercules, CA, USA) with the following program: 5 minutes at 75 °C, 30 seconds at 10 °C, and 30 minutes at 37 °C. Samples were then hybridized in a humidified chamber at 37 °C for 48 hours, followed by one wash in saline–sodium citrate (SSC) buffer for 5 minutes at 75 °C and counterstaining with DAPI. Finally, metaphases were visualized and captured using an Axio-Imager Z2 microscope. Karyotyping analysis was performed by means of ISIS software. To determine the karyotype of the hiPSCs and hESCs, 50 metaphase spreads were analyzed.
Embryoid body formation
For EB formation, hiPSCs and hESCs were dissociated into single cells by Accutase (Thermo Fisher Scientific) and cultured on an ultralow attachment plate (Corning) with mTeSR1 medium supplemented with 10 μM Rho-kinase inhibitor Y-27632 (Selleckchem) for 3 days to enable cell aggregation. After 3 days, the medium was switched to DMEM/F12 containing a 20% knockout serum replacement (KSR), 2 mM l-glutamine, 1 × 10−4 M nonessential amino acids, 1 × 10−4 M 2-mercaptoethanol, and 0.5% penicillin and streptomycin (all from Thermo Fisher Scientific). The medium was changed every other day until day 8 . After 8 days in culture as floating EBs, cell aggregates were transferred onto 0.1% gelatin-coated plates (Sigma-Aldrich) and cultured in the same medium for an additional 8 days before collecting the EBs for immunofluorescence and qRT-PCR analyses.
hPSC culture for Raman spectroscopy measurements
For Raman spectroscopy, hiPSCs and hESCs (all at passage P40) were dissociated into single cells by Accutase (Thermo Fisher Scientific), and 4 × 105 cells per cell line were seeded on CaF2 slides because of its negligible Raman signal for 24 hours to allow the cells to adhere to the CaF2 surface in mTeSR1 medium. Prior to Raman measurements, cells were fixed with 3.7% formaldehyde (Sigma-Aldrich) for 15 minutes at room temperature. After incubation in a fixative solution, cells were rinsed with DPBS and kept in distilled water for analysis to reduce the background interference derived from the culture medium.
Raman mapping and spectra preprocessing
Raman imaging was performed with an Alpha-300R microscope from Witec GmbH (Ulm, Germany) equipped with a 532-nm laser source in a backscattering configuration. The total laser power applied to the sample was set to 10 mW to avoid cell photodamage, and light was focused on the sample through a 100×/0.9 NA objective. A 600 lines/mm grating was used for frequency analysis of the backscattered light, with a spectral resolution of approximately 3.0 cm–1. For each measured cell, Raman maps were recorded using a raster scan with a step size of 400 nm, which is close to the optical resolution of the system (≈360 nm) as calculated with the Rayleigh criteria. For each pixel, we used a typical integration time of 2.0 seconds, with a spectral window ranging from 400 to 3100 cm–1. The Raman shift was previously calibrated by measuring a silicon sample and using the sharp Si-peak at 520 cm–1 as a reference. After Raman measurements, the spectra were first divided into two spectral regions: one ranging from 400 to 1800 cm–1, which is the well-known fingerprint region; and a second region ranging from 2600 to 3100 cm–1, in which the CH2 and CH3 stretching vibrations were located. The spectra collected from the surrounding area of the cells were used as background spectra and subtracted from the cell signals. Finally, all of the spectra of one map were normalized to the maximum total spectral area recorded for that specific cell, allowing a comparison of Raman maps recorded from different cells at different times.
Principal component analysis (PCA) and K-means cluster analysis (KCA) were performed on the collected datasets. To compare the results of multivariate analysis between different maps, the spectra from all the probed cells were processed altogether as one single collection, and the computed principal components (PCs) were exactly the same for all maps. The first six PCs, which comprised more than 98% of the total variance, were used to perform KCA, imposing six clusters to be addressed inside the cells (plus one cluster collecting the empty areas outside the cells). Subsequently, pseudo-color images were generated to represent the multivariate results. A specific color was assigned to each cluster and the cluster’s spatial distribution was mapped in the xy space. A custom-developed software package, Raman Tool Set, freely available online (http://ramantoolset.sourceforge.net) , was used to perform all of the spectra preprocessing steps and multivariate analysis.
All experiments were performed at least three times, each in biological replicates. Data were analyzed using GraphPad Prism 6 software, and statistical analysis was performed by Student’s t test. All values are expressed as the mean ± standard error of the mean (SEM) in all figure panels in which error bars are shown, and differences with p < 0.05, p < 0.01, and p < 0.001 were considered statistically significant.
Figure 1 shows two typical cells probed by Raman micro-spectroscopy, one hESC and one hiPSC. The smaller insets display bright-field pictures of the cells recorded by the optical microscope equipping the Raman spectrometer, while the larger pictures show the corresponding color-reconstituted Raman images. The Raman intensities for peaks at 1670 cm–1 (proteins), 2850 cm–1 (lipids), 785 cm–1 (nucleic acids), and the combination of 748 cm–1 with 1585 cm–1 (cytochrome C) are mapped in blue, green, red, and magenta, respectively. The curves reported in the bottom panel represent the overall averaged spectra of the two cells, and the Raman bands used for the pseudo-color pictures are clearly indicated. It is immediately evident that the hiPSC exhibits a more intense Raman signal corresponding to nucleic acids and that this overexpression is localized in well-defined regions inside the cell. To achieve a semiquantitative comparison between hESCs and hiPSCs, multivariate analysis (PCA followed by KCA) was performed over the whole recorded dataset including all of the mapped cells. The loading curves of the first three principal components (PCs) are shown in Fig. 2a. Even if PCA is performed on the relative spectra (i.e., after subtraction of the overall mean spectrum), the PC1 loading curve (upper panel of Fig. 2a) correctly resembles the average cell spectrum because it discriminates between the inner and outer regions of the cells in the Raman maps. By contrast, the following principal components, PC2 and PC3, provide much more information regarding the cell biochemical composition. In the PC2 loading curve (middle panel of Fig. 2a), the positive peaks located at 748, 1127, and 1585 cm–1 are due to cytochrome C (cyt c) vibrations [27, 28]; their sharpness and relatively high intensities are due to the resonant Raman scattering that cyt c undergoes under 532-nm laser light . The peak at 1305 cm–1 could be assigned both to the Amide III vibration [30, 31] as well as to vibrations due to fatty acids [19, 31]. The signature at 1438 cm–1 likely arises in PC2 from the asymmetric behavior of the C–H Raman band at 1440–1450 cm–1 across all spectra: when this latter band is closer to 1440 cm–1, the corresponding C–H vibration is typical of lipids (vice versa for proteins). The 1073 cm–1 Raman feature is typical of a C–C stretch in gauche lipids . Consequently, the PC2 loads are dominated by the resonant scattering of cyt c, with a smaller contribution coming from lipids and disordered lipids (the gauche phase is typical of liquid-state lipids). The PC3 curve instead exhibits strong positive peaks at 785, 1098, 1334, 1371, 1484, and 1575 cm–1, all of which can be ascribed to DNA (and/or RNA) molecules as reported previously [32,33,34]. The Raman peaks at 785 and 1484 cm–1 are due, respectively, to pyrimidine vibrations [32, 33] and purine stretches  and, consequently, are ascribed to Cytosine and Thymine bases and Guanine and Adenine bases, respectively. The 1334 and 1371 cm–1 Raman peaks are caused by aromatic vibrations of DNA bases, while the 1575 cm–1 peak is due to Adenine and Guanine only. Finally, the smaller 1098 cm–1 band originates from the backbone vibration of DNA (PO2– group). Even using the scores of the first six PCs as inputs for KCA, we report a detailed comment on the Raman bands only for the first three PCs since they account for almost 98% of the overall signal variance, while PC4, PC5, and PC6 together account only for 0.5% of the overall signal variance. Furthermore, no noise reduction based on the first six PCs was performed on the data, and consequently KCA assigns spectra to different clusters according to the Euclidean distances in the PC1–PC6 space, but without removing potential features that come from other PCs. KCA is performed on all cells together, imposing a total number of seven clusters. One cluster always collects spectra outside of cells, which are easily discriminated by KCA because of their very small intensities. Consequently, six clusters of spectra are distinguished inside cells. Tests performed with a higher number of clusters resulted in redundant groups without any further information. By contrast, a lower number of clusters produced excessively large variances within each cluster, indicating that a relevant number of spectra are “forcibly” assigned to the groups. The spatial distribution of the six clusters is shown in Fig. 3. Except for the red clusters, the other colors have no direct correspondence with the colors shown in Fig. 1 since they show, respectively, the results of multivariate (Fig. 3) and univariate (Fig. 1) analyses. It is evident that the red and dark red clusters are only present in the hiPSC lines, while hESCs exhibit very few or no red pixels. Other color labels, such as blue, yellow, and green (dark and light), are detected in both cell lines. The different expression levels of the blue and yellow areas in the measured hESCs are likely due to the different cell cycle phases of each single cell before fixation and Raman analysis. Compared to hESCs, hiPSCs show a more uniform behavior. The bottom panel of Fig. 3 is a graphical representation of the average spectra of each cluster, with the color of the curves corresponding to the colors of the clusters. Dark green and dark red curves are not reported because their general behaviors and major peaks are exactly the same as those of the light green and light red curves, but with a smaller intensity. This panel also reports the Raman frequencies of those peaks having different intensities, from the top curve (red) to the bottom curve (green). The red spectrum is characterized by peaks at 785, 1098, 1334, and 1575 cm–1, which are DNA/RNA-base-related vibrations (see the earlier peak assignment discussion for the PC2 and PC3 curves). Moreover, in the red curve, two protein markers show differences from the other spectra: the shoulder at 1465 cm–1, on the right side of the C–H band at 1440–1450 cm–1, is typical of a higher protein content ; and the Amide I band at 1650–1680 cm–1 is centered at 1658 cm–1, at lower frequencies than the other spectra, indicating a higher expression of alpha-helix structures . No changes are detected in the behavior of the peak at 1003 cm–1, assigned to the breathing mode of phenylalanine, and the band at 1244 cm–1, assigned to the Amide III vibration . In the following yellow curve, peaks at 748, 1127, and 1585 cm–1 are highlighted and generated from cyt c molecules, along with a band at 1305 cm–1, likely derived from lipids [18, 19]. Moreover, a small shoulder at 1738 cm–1, compared to the other average spectra, is observed. This latter signal is also typical of lipids; that is, the C = O stretch in ester groups . In the average spectrum of the blue cluster, characteristic signatures are found at 1244 cm–1, with a higher intensity ratio I 1244/I 1305 compared to the other clusters spectra, and at 1608 cm–1, with a small band largely missing in the other curves. As already stated, the former signature (1244 cm–1) is a characteristic vibration of the Amide III band, while the latter signature (1608 cm–1) is the overlapping of ring-breathing modes of phenylalanine and tyrosine with a C = C vibration mode in proteins , which ascribes these signals to proteins. Finally, the green curve has an overall intensity smaller than those of the others, and the profile of the curve, as well as the relative ratios of the main peaks, closely resemble the PC1 loading curve (which is an overall average, as discussed earlier). The significant decrease in intensity is due to the scattering volume: since the green areas correspond to thinner regions of cells (see optical images in Fig. 1), the laser spot along the z axis could be partially outside the cell, thus leading to a smaller amount of scattering molecules. The green areas correspond to outer regions of the cells (such as external membranes) and/or to the most adherent (thin) areas, where organelles are unexpected. It is worth noting that the lack of red clusters in hESCs in Fig. 3 does not mean that DNA/RNA bases are missing in these cells. In fact, Fig. 1 shows DNA/RNA red regions for both cell lines. But the DNA/RNA Raman intensity is so high in the red cluster regions of hiPSCs that it overcomes the other Raman signals, while this is not occurring in the hESCs. A more quantitative evaluation was achieved from KCA by computing the percentage area of each cluster. Comparisons between the percentage values of hESCs and hiPSCs show that, along with a strong variation in nucleic acid content (comprised of the red cluster), no significant changes were observed in the cyt c intracellular distribution (composed of the yellow cluster) between the two cell lines. In sum, cluster analysis of the Raman images conclusively demonstrates that hESCs have much lower nucleic acid content than hiPSCs.
Biological validation of the Raman results
The results of the Raman spectra analysis revealed a specific fingerprint for hiPSCs with respect to hESCs. In particular, major differences were found in the nucleic acid content (DNA and/or RNA), as demonstrated by the spectral features at 785, 1098, 1334, and 1575 cm–1 (all DNA/RNA-base-related peaks), which are significantly higher in hiPSCs compared to their hESC counterparts. Based on the Raman spectroscopy results, we biologically validated these findings using different conventional approaches. First, we verified that the nucleic acid content was indeed higher in hiPSCs compared to hESCs. This validation step was performed by nucleic acid extraction from 4 × 105 cells per group, as described in Methods. After purification, 1 μl of DNA and RNA samples was used for spectrophotometric analysis by a NanoDrop instrument. As shown in Fig. 4a (left graph), the cumulative levels of DNA were significantly higher in hiPSCs compared to hESCs. Analogous results, albeit somewhat less striking, were obtained by measuring the RNA content (Fig. 4a, right graph). To confirm these findings by an independent approach, we loaded 0.5 μg of total DNA and 0.5 μg of total RNA onto a 1% agarose gel for mass quantification. As shown in Fig. 4b, the same trend was observed. Altogether, these results suggest that Raman spectroscopy represents an extremely accurate and sensitive method for detecting even subtle quantitative and qualitative differences among highly homogeneous cell lines. Differences in the DNA level may reflect a different proliferation rate that, in our case, is not age related since the hiPSCs and hESCs used in this study were all at the same passage (P40). Since the DNA level undergoes significant variations during the cell cycle, we further investigated the Raman findings by performing flow cytometry assays based on the dilution of CSFE fluorescence, which relies on intensity-halving at each cell division. In our experimental setting, we measured the CFSE intensity at time zero (T0) to define the first generation (G1, 5 × 105) and reanalyzed the cells at day 4 (T4) to monitor the rounds of cell cycles in subsequent generations. To characterize the CFSE distribution, flow cytometry data were analyzed by Modfit LT Version 3.2 software using a statistical tool known as the proliferation index, which corresponds to the average number of cell divisions that a cell in the original population has undergone . Graphs and proliferation indexes relative to the cell cycle events in each of the cell lines analyzed are shown in Fig. 5a, b. Intriguingly, hESCs occupied a higher position in terms of the proliferation rate, with 83% of cells in G9 after 4 days of culture, while the majority of hiPSCs (73%) were still in G8 at T4. To gain further insight regarding the DNA level and quantify the percentage of hiPSCs and hESCs within the G1, S, and G2/M phases, we performed a cell-cycle kinetic assay by PI staining, followed by flow cytometry. As shown in Fig. 5c, no significant differences in cell cycle progression were detected. More specifically, the cell cycle profiles of the hESC populations showed that 25.9% of cells were in G0/G1, 15.6% were in G2/M, and 51.8% were in the S phase, while the cell cycle profile of hiPSCs showed 21.2% of cells in G0/G1, 24.46% in G2/M, and 43.2% in the S phase. A graph relative to the cell-cycle kinetics is shown in Fig. 5d. Furthermore, we also determined the expression levels of proliferation-associated proteins, such as cyclin A (CCNA2), cyclin B1 (CCNB1), cyclin D (CCND1), and cyclin E (CCNE1), by qRT-PCR analysis (Fig. 5e). The cyclin profiling results followed the same trend in all of the tested pluripotent stem cells, with negligible differences between hiPSCs and hESCs. We further deepened our analysis by performing a karyotype study to exclude chromosomal aberrations in hiPSCs, which are often reported to be a consequence of cellular reprogramming . M-Fish analysis of 50 metaphases ruled out any such abnormalities (Fig. 6a). In addition to the differences in the DNA/RNA level, Raman spectra analysis also highlighted clear peaks at 748, 1127, and 1585 cm–1, all of which are cytochrome C-related and do not exhibit any appreciable cyt c difference between the cell lines. To validate these results, we performed MitoTracker staining to selectively label mitochondria, the inner membrane of which is associated with the cytochrome C complex. As shown in Fig. 6b, c, we could not detect any significant difference in the mitochondria staining intensity between hESCs and hiPSCs, thus supporting the Raman analysis results.
In the present study, Raman spectroscopy was used to perform a comparative analysis of hESCs and hiPSCs. The experimental results demonstrate that even though the pluripotent stem cell lines analyzed are largely equivalent, small but significant differences can be appreciated. Raman spectroscopy is well suited for detecting subtle differences in specific biochemical groups. In our case, a multivariate analysis of peak intensities highlights a dissimilarity between the two pluripotent stem cell lines in terms of the nucleic acid levels, with a higher amount detected in hiPSCs compared to hESCs. At the present date, we do not have a clear mechanistic explanation for these differences; with regards to nucleic acids, we hypothesize that the different epigenetic backgrounds between hiPSCs and hESCs might play at least in part a significant role. ESCs are known to be transcriptionally hyperactive, undergoing major silencing during differentiation ; moreover, differences in chromatin dynamics are likely to occur in iPSCs vs ESCs and have been reported by several groups [41, 42]. Tan et al.  have shown, using a similar approach, that hiPSCs closely resemble the spectral signatures of hESCs. In their elegant work, metabolic differences between the two groups were hypothesized on the basis of Raman analysis. Our study offers a further, more detailed description of the biochemical diversity between hiPSCs and hESCs, thereby providing additional insights into the molecular characteristics of reprogrammed and naïve stem cells.
Overall, our work confirms the usefulness of Raman spectroscopy for achieving a molecular fingerprint in hPSC discrimination, adding a label-free, optical technique to the available biochemical tools. We believe that the experimental method described in this study may contribute to increase the level of sensitivity offered nowadays by more conventional approaches. Furthermore, it may improve the capability to assess the true potential owned by hiPSCs for regenerative medicine, drug screening, and disease modeling, thus complementing other well-known in-vitro and in-vivo approaches.
Bovine serum albumin
5,6-Carboxyfluorescein diacetate succinimidyl ester
Dulbecco’s modified Eagle’s medium
Fetal bovine serum
Human embryonic stem cell
Human induced pluripotent stem cell
Human pluripotent stem cell
K-means cluster analysis
Multiplex-fluorescence in-situ hybridization
Principal component analysis
Polymerase chain reaction
Real-time quantitative PCR
Thomson JA, Itskovitz-Eldor J, Shapiro SS, Waknitz MA, Swiergiel JJ, Marshall VS, Jones JM. Embryonic stem cell lines derived from human blastocysts. Science. 1998;282:1145–7.
Takahashi K, Tanabe K, Ohnuki M, Narita M, Ichisaka T, Tomoda K, Yamanaka S. Induction of pluripotent stem cells from adult human fibroblasts by defined factors. Cell. 2007;131:861–72.
Yu J, Vodyanik MA, Smuga-Otto K, Antosiewicz-Bourget J, Frane JL, Tian S, Nie J, Jonsdottir GA, Ruotti V, Stewart R, Slukvin II, Thomson JA. Induced pluripotent stem cell lines derived from human somatic cells. Science. 2007;318:1917–20.
Wobus AM, Boheler KR. Embryonic stem cells: prospects for developmental biology and cell therapy. Physiol Rev. 2005;85:635–78.
Hanna J, Wernig M, Markoulaki S, Sun CW, Meissner A, Cassady JP, Beard C, Brambrink T, Wu LC, Townes TM, Jaenisch R. Treatment of sickle cell anemia mouse model with iPS cells generated from autologous skin. Science. 2007;318:1920–3.
Kriks S, Shim JW, Piao J, Ganat YM, Wakeman DR, Xie Z, Carrillo-Reid L, Auyeung G, Antonacci C, Buch A, Yang L, Beal MF, Surmeier DJ, Kordower JH, Tabar V, Studer L. Dopamine neurons derived from human ES cells efficiently engraft in animal models of Parkinson’s disease. Nature. 2011;480:547–51.
Matsa E, Rajamohan D, Dick E, Young L, Mellor I, Staniforth A, Denning C. Drug evaluation in cardiomyocytes derived from human induced pluripotent stem cells carrying a long QT syndrome type 2 mutation. Eur Heart J. 2011;32:952–62.
Gramlich M, Pane LS, Zhou Q, Chen Z, Murgia M, Schötterl S, Goedel A, Metzger K, Brade T, Parrotta E, Schaller M, Gerull B, Thierfelder L, Aartsma-Rus A, Labeit S, Atherton JJ, McGaughran J, Harvey RP, Sinnecker D, Mann M, Laugwitz KL, Gawaz MP, Moretti A. Antisense-mediated exon skipping: a therapeutic strategy for titin-based dilated cardiomyopathy. EMBO Mol Med. 2015;7:562–76.
Drukker M, Benvenisty N. The immunogenicity of human embryonic stem-derived cells. Trends Biotechnol. 2004;22:136–41.
Song H, Chung SK, Xu Y. Modeling disease in human ESCs using an efficient BAC-based homologous recombination system. Cell Stem Cell. 2010;6:80–9.
Chin MH, Mason MJ, Xie W, Volinia S, Singer M, Peterson C, Ambartsumyan G, Aimiuwu O, Richter L, Zhang J, Khvorostov I, Ott V, Grunstein M, Lavon N, Benvenisty N, Croce CM, Clark AT, Baxter T, Pyle AD, Teitell MA, Pelegrini M, Plath K, Lowry WE. Induced pluripotent stem cells and embryonic stem cells are distinguished by gene expression signatures. Cell Stem Cell. 2009;5:111–23.
Tirinato L, Liberale C, Di Franco S, Candeloro P, Benfante A, La Rocca R, Potze L, Marotta R, Ruffilli R, Rajamanickam VP, Malerba M, De Angelis F, Falqui A, Carbone E, Todaro M, Medema JP, Stassi G, Di Fabrizio E. Lipid droplets: a new player in colorectal cancer stem cells unveiled by spectroscopic imaging. Stem Cells. 2015;33:35–44.
Langenbach F, Handschel J. Effects of dexamethasone, ascorbic acid and β-glycerophosphate on the osteogenic differentiation of stem cells in vitro. Stem Cell Res Ther. 2013;4:117.
Amharref N, Beljebbar A, Dukic S, Venteo L, Schneider L, Pluot M, Manfait M. Discriminating healthy from tumor and necrosis tissue in rat brain tissue samples by Raman spectral imaging. Biochim Biophys Acta. 2007;1768:2605.
Chiang Y-H, Wu SH, Kuo Y-C, Chen H-F, Chiou A, Lee OK. Raman spectroscopy for grading of live osteosarcoma cells. Stem Cell Res Ther. 2015;6:81.
Wu H-H, Ho JH, Lee OK. Detection of hepatic maturation by Raman spectroscopy in mesenchymal stromal cells undergoing hepatic differentiation. Stem Cell Res Ther. 2016;7(6).
Chan JW, Lieu DK, Huser T, Li RA. Label-free separation of human embryonic stem cells (hESCs) and their cardiac derivatives using Raman spectroscopy. Anal Chem. 2009;81:1324–31.
Freudiger CW, Min W, Saar BG, Lu S, Holtom GR, He C, Tsai JC, Kang JX, Xie XS. Label-free biomedical imaging with high sensitivity by stimulated Raman scattering microscopy. Science. 2008;322:1857–61.
Wu H, Volponi JV, Oliver AE, Parikh AN, Simmons BA, Singh S. In vivo lipidomics using single-cell Raman spectroscopy. Proc Natl Acad Sci U S A. 2011;108:3809–14.
Tirinato L, Gentile F, Di Mascolo D, Coluccio ML, Das G, Liberale C, Pullano SA, Perozziello G, Francardi M, Accardo A, De Angelis F, Candeloro P, Di Fabrizio E. SERS analysis on exosomes using super-hydrophobic surfaces. Microelectron Eng. 2012;97:337–40.
Coluccio ML, Gentile F, Das G, Nicastri A, Perri AM, Candeloro P, Perozziello G, Proietti Zaccaria R, Gongora JS, Alrasheed S, Fratalocchi A, Limongi T, Cuda G, Di Fabrizio E. Detection of single amino acid mutation in human breast cancer by disordered plasmonic self-similar chain. Sci Adv. 2015;1:e1500487.
Tan Y, Konorov SO, Schulze HG, Piret JM, Blades MW, Turner RF. Comparative study using Raman microspectroscopy reveals spectral signatures of human induced pluripotent cells more closely resemble those from human embryonic stem cells than those from differentiated cells. Analyst. 2012;137:4509–15.
Diem M, Romeo M, Boydston-White S, Miljković M, Matthäus C. A decade of vibrational micro-spectroscopy of human cells and tissue (1994–2004). Analyst. 2004;129:880–5.
Müller FJ, Schuldt BM, Williams R, Mason D, Altun G, Papapetrou EP, Danner S, Goldmann JE, Herbst A, Schmidt NO, Aldenhoff JB, Laurent LC, Loring JF. A bioinformatic assay for pluripotency in human cells. Nat Methods. 2011;8:315–7.
Wallace PK, Muirhead KA. Cell tracking 2007: a proliferation of probes and applications. Immunol Invest. 2007;36:527–61.
Candeloro P, Grande E, Raimondo R, Di Mascolo D, Gentile F, Coluccio ML, Perozziello G, Malara N, Francardi M, Di Fabrizio E. Raman database of amino acids solutions: a critical study of extended multiplicative signal correction. Analyst. 2013;138:7331–40.
Read DS, Woodcock DJ, Strachan NJ, Forbes KJ, Colles FM, Maiden MC, Clifton-Hadley F, Ridley A, Vidal A, Rodgers J, Whiteley AS, Sheppard SK. Evidence for phenotypic plasticity among multihost Campylobacter jejuni and C. coli lineages, obtained using ribosomal multilocus sequence typing and raman spectroscopy. Appl Environ Microbiol. 2013;79:965–73.
Johannessen C, White PC, Abdali S. Resonance Raman optical activity and surface enhanced resonance Raman optical activity analysis of cytochrome c. J Phys Chem A. 2007;111(32):7771–6.
Sinibaldi F, Mei G, Polticelli F, Piro MC, Howes BD, Smulevich G, Santucci R, Ascoli F, Fiorucci L. ATP specifically drives refolding of non-native conformations of cytochrome c. Protein Sci. 2005;14:1049–58.
Xie C, Li YQ, Tang W, Newton RJ. Study of dynamical process of heat denaturation in optically trapped single microorganisms by near-infrared Raman spectroscopy. J Appl Phys. 2003;94:6138–42.
De Gelder J, De Gussem K, Vandenabeele P, Moens L. Reference database of Raman spectra of biological molecules. J Raman Spectrosc. 2007;38:1133–47.
Krafft C, Knetschke T, Siegner A, Funk RHW, Salzer R. Mapping of single cells by near infrared Raman microspectroscopy. Vib Spectrosc. 2003;32:75–83.
van Manen H-J, Kraan YM, Roos D, Otto C. Single-cell Raman and fluorescence microscopy reveal the association of lipid bodies with phagosomes in leukocytes. Proc Natl Acad Sci U S A. 2005;102:10159–64.
Ruiz-Chica J, Medina MA, Sánchez-Jiménez F, Ramírez FJ. Fourier transform Raman study of the structural specificities on the interaction between DNA and biogenic polyamines. Biophys J. 2001;80:443–54.
Krafft C, Knetschke T, Funk RHW, Salzer R. Identification of organelles and vesicles in single cells by Raman microspectroscopic mapping. Vib Spectrosc. 2005;38:85–93.
Zolea F, Biamonte F, Candeloro P, Di Sanzo M, Cozzi A, Di Vito A, Quaresima B, Lobello N, Trecroci F, Di Fabrizio E, Levi S, Cuda G, Costanzo F. H ferritin silencing induces protein misfolding in K562 cells: a Raman analysis. Free Radic Biol Med. 2015;89:614–23.
Monti P, Freddi G. Raman spectroscopic studies of silk fibroin from Bombyx mori. J Raman Spectrosc. 1998;29:297–304.
Chan JW, Taylor DS, Zwerdling T, Lane SM, Ihara K, Huser T. Micro-Raman spectroscopy detects individual neoplastic and normal hematopoietic cells. Biophys J. 2006;90:648–56.
Roederer M. Interpretation of cellular proliferation data: avoid the panglossian. Cytometry A. 2011;79(A):95–101.
Mayshar Y, Ben-David U, Lavon N, Biancotti JC, Yakir B, Clark AT, Plath K, Lowry WE, Benvenisty N. Identification and classification of chromosomal aberrations in human induced pluripotent stem cells. Cell Stem Cell. 2010;7:521–31.
Efroni S, Duttagupta R, Cheng J, Dehghani H, Hoeppner DJ, Dash C, Bazett-Jones DP, Le Grice S, McKay RD, Buetow KH, Gingeras TR, Misteli T, Meshorer E. Global transcription in pluripotent embryonic stem cells. Cell Stem Cell. 2008;2:437–47.
Apostolou E, Hochedlinger K. Chromatin dynamics during cellular reprogramming. Nature. 2013;502:462–71.
This research was supported in part by a grant from the Programma Operativo Nazionale 2007/2013: iCARE (Infrastruttura Calabrese per la Medicina Rigenerativa: Generazione di Biobanche per la Preservazione di Cellule Staminali Umane e di Tessuto Osseo per Uso Clinico e Design e Sviluppo di Bioscaffold Innovativi) (PON03PE_00009_2) and by the Programma Operativo Nazionale 2007/2013 PON: “Exchanger: Share Your Science” (PON04a3_00433).
Availability of data and materials
The datasets used and/or analyzed in this study are available from the corresponding author upon reasonable request.
Ethics approval and consent to participate
A skin biopsy was collected from a healthy individual after obtaining written informed consent from the donor. The study was approved by the Ethics Committee of the Magna Graecia University and the Azienda Ospedaliero-Universitaria “Mater Domini”.
Consent for publication
All of the authors have read and approved the manuscript for publication.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Showing generation and characterization of hiPSCs from skin fibroblasts. (A) Representative images of generated iPSCs showing ESC-like morphology and AP activity. (B) RT-PCR analysis showing loss of Sendai viral transgenes in hiPSCs (lane 1), their presence in infected parental fibroblasts (ipF, lane 2), and their absence in noninfected parental fibroblasts (pF, lane 3). (C) qRT-PCR analysis confirming upregulation of the endogenous pluripotency genes OCT4, SOX2, c-MYC, REX1, and NANOG in generated hiPSCs and hESCs, which are used as a control cell line. All expression values normalized to GAPDH and relative to the parental fibroblasts. Data represent mean ± SD. (D) Immunostaining of hESCs and iPSCs for pluripotency markers Oct4 (red) and Nanog (green), with costaining with DAPI (blue). Scale bar = 50 μm. (E) Genome-wide gene expression profiling by PluriTest of undifferentiated hESCs and iPSCs reveals that the lines have a high pluripotency score. (F) Immunostaining of whole EBs (day 25) for Nestin (ectoderm), Brachyury (mesoderm), and Sox17 (endoderm) and costaining with DAPI (blue). Scale bar = 50 μm. (G) qRT-PCR analysis showing potential of generated hiPSCs to differentiate into cells of all three germ layers (SOX7, endodermal marker; HAND1, ACTA2, and MYL2, mesodermal markers; NESTIN and BMP4, ectodermal markers) and downregulation of pluripotency-associated genes (OCT4 and NANOG). All expression values normalized to GAPDH and relative to the respective hiPSC clones (TIF 5709 kb)