Distinct gene expression dynamics in developing and regenerating crustacean limbs
Edited by Denis Duboule, Universite de Geneve, Geneve 4, Switzerland; received October 27, 2021; accepted April 14, 2022
Significance
Some organisms have the fascinating capacity to regenerate lost body parts. To which extent regeneration entails the redeployment of an embryonic developmental program is a long-standing question of regenerative studies, with implications for development, evolution, and regenerative medicine. In this study, we address this question by comparing the global transcriptional dynamics of leg regeneration and leg development in the crustacean Parhyale hawaiensis. We show that despite extensive overlaps in gene usage, the development and regeneration of Parhyale legs show distinct temporal profiles of gene expression that cannot be aligned in a coherent fashion. These results suggest that regeneration does not simply mirror development, but deploys some of the same gene modules in a different overall framework.
Abstract
Regenerating animals have the ability to reproduce body parts that were originally made in the embryo and subsequently lost due to injury. Understanding whether regeneration mirrors development is an open question in most regenerative species. Here, we take a transcriptomics approach to examine whether leg regeneration shows similar temporal patterns of gene expression as leg development in the embryo, in the crustacean Parhyale hawaiensis. We find that leg development in the embryo shows stereotypic temporal patterns of gene expression. In contrast, the dynamics of gene expression during leg regeneration show a higher degree of variation related to the physiology of individual animals. A major driver of this variation is the molting cycle. We dissect the transcriptional signals of individual physiology and regeneration to obtain clearer temporal signals marking distinct phases of leg regeneration. Comparing the transcriptional dynamics of development and regeneration we find that, although the two processes use similar sets of genes, the temporal patterns in which these genes are deployed are different and cannot be systematically aligned.
Sign up for PNAS alerts.
Get alerts for new articles, or get an alert when an article is cited.
Many animals have the capacity to regenerate body parts that have been lost after a severe injury. In some cases regeneration produces faithful replicas of the lost organs, which are indistinguishable from those originally developed in the embryo. This similarity in the outcome of development and regeneration suggests that the processes generating these structures could also be similar, i.e., that regeneration could mirror embryonic development. The fact that both take place within the same organism, relying on the same genome, makes it easy to envisage that the same molecular mechanisms and gene regulatory networks could be used in both cases.
Besides this evident connection, however, there are important ways in which development and regeneration are likely to differ. Development is a stereotypic process, unfolding from a defined starting point, in the stable and well-provisioned environment of the fertilized egg. In contrast, regeneration starts with an injury whose extent and timing are unpredictable. Regeneration also unfolds in the context of adult physiology, e.g., influenced by the nutritional status of the animal, exposure to microbes, as well as circadian, seasonal, and hormonal cycles. For instance, in arthropods, the molting cycle profoundly affects the physiology of the individual and imposes physical constraints on the growth of regenerating structures (1).
In the adult body, the cellular context in which regeneration takes place differs from the embryo: different pools of progenitors are available compared with development, and differentiated cell types such as immune cells and neurons (which are not yet formed at the onset of organ development) are known to play key roles in supporting regeneration (2–6).
Significant differences can also be seen in the scales in which development and regeneration unfold. Embryonic organ primordia are usually hundreds of micrometers to millimeters in size, while adult regenerating organs can be orders of magnitude larger (7). Such differences in size are likely to have an impact on the mechanisms that coordinate cell behavior and cell fate across developing tissues, such as diffusion-based morphogen gradients and long range cell–cell communication. Differences may also exist in the temporal scale over which developmental and regenerative processes unfold.
Despite these differences, numerous studies indicate that development and regeneration could share significant similarities (reviewed in ref. 8). For example, a classic study used reciprocal tissue grafts between developing limb buds and regenerating blastemas in axolotls to reveal similar patterning activities in those tissues (9). Later studies contributing to this debate have compared the roles played by specific regulatory genes (10, 11), the deployment of positional markers (12, 13), and the transcriptional profiles of regenerating tissues (14–16) during development and regeneration, reaching different conclusions.
The crustacean Parhyale hawaiensis presents an excellent system for exploring the relationships between embryonic and regenerative processes, for several reasons. First, Parhyale are able to regenerate their legs with high fidelity; regenerated legs are indistinguishable from the original, unharmed adult legs (17). Second, Parhyale are direct developers (do not undergo metamorphosis), so the adult legs directly derive from the legs developing in the embryo (18). Third, although adult legs are larger than embryonic legs, the leg primordia in embryos and regenerating adults develop on similar spatiotemporal scales. The primordia are in the order of 100 μm in size and consist of a few hundred cells (19, 20). The temporal scales of leg development and regeneration are also similar, spanning 4 to 5 d at 26 °C from primordium/blastema formation to fully patterned leg (19–21). These shared features provide a common framework for comparing development and regeneration in Parhyale and testing to which extent the dynamics of regeneration mirror those of development.
To compare gene usage during development and regeneration on a genome-wide scale we performed RNA sequencing (RNA-seq) on single legs, covering the time course of each process, from early limb buds or freshly amputated legs to fully patterned legs. Our working hypothesis was that some phases of leg regeneration, such as wound closure, may be specific to regeneration, but others like patterning, morphogenesis, growth, and cell differentiation could share significant similarities. Our goals were to: 1) compare expression dynamics on a global scale and determine whether specific phases of leg regeneration can be associated, on the basis of gene expression, with specific phases of leg development; 2) identify sets of genes that are coexpressed in distinct phases of leg development and regeneration and determine whether similar clusters of coexpressed genes are involved in these processes; and 3) determine whether these sets of coexpressed genes are deployed in the same temporal order in the embryo and in the regenerating adult leg.
We expected to recover common temporal patterns of gene expression underpinning the embryonic and regenerative time courses, consistent with the idea that some aspects of leg regeneration redeploy mechanisms used for leg embryonic development. A failure to detect a common temporal order of gene expression would suggest that development and regeneration follow distinct trajectories.
Results
Transcriptional Profiling of Leg Development Reveals Stereotypic Developmental Profiles.
To investigate transcriptional dynamics and assess individual variation in developing embryonic legs we performed RNA-seq on individual fourth thoracic (T4) legs during the time course of leg development, from young limb bud stages to fully patterned and differentiated legs (21) (Fig. 1A). We collected entire T4 legs every 6 h, from 96 to 192 h post fertilization (hpf) (Ef [full] series). In order to account for the progressive regionalization of the primordium, we also collected, when possible, the distal portion of the T4 leg: the distal one-third of the leg from 120 to 138 hpf, and the carpus, propodus, and dactylus from 144 to 192 hpf (Ed [distal] series). The full and the distal leg samples were collected in pairs, from contralateral T4 legs of the same embryos, yielding a total of 70 samples covering the time course of leg development (SI Appendix, Table S1).
Fig. 1.

Principal component analysis (PCA) on the complete RNA-seq dataset (Datasets S3 and S4) showed that the principal axis of gene expression variation, explaining 14% of the variance, strongly correlates with developmental time in both the Ef and the Ed leg samples (PC1, Fig. 1B). A weaker source of variation was linked to specific samples (PC2, see SI Appendix, Fig. S1.1). To probe the strength of the temporal signal in those data, we applied RAPToR, a method that allows predicting the developmental stage of a sample from its gene expression profile, relative to a reference time series (22). We built a reference using the Ef leg samples and used this to estimate the stage of each sample. The predictions of the model match the real developmental age of each sample accurately, not only for the Ef samples (which were used to train the model) but also for the Ed leg samples (Fig. 1C and SI Appendix, Table S2). These results indicate that the dynamics of gene expression in developing legs are highly stereotypic, and that the temporal dynamics captured in the Ef and Ed series are highly coherent with each other.
Further comparing the transcriptional dynamics in the Ef and Ed samples, we found 7,963 and 1,354 genes to be differentially expressed during embryogenesis in Ef and Ed samples, respectively (DESeq2, padj < 0.001, in a total of 43,212 gene models; SI Appendix, Fig. S1.2), with an overlap of 1,121 genes (differentially expressed genes given in Datasets S5 and S6). We attribute the higher number of differentially expressed genes in the Ef samples to the fact that this dataset spans a longer developmental period and includes additional tissues. The tissue dissections to collect the Ed samples were also more challenging, possibly contributing to lower sample quality (SI Appendix, Fig. S1.1 and Table S1). Despite these differences, we noticed a high similarity in gene expression dynamics between these two datasets (Fig. 1D). Overall, this analysis shows that the temporal signal of the distal part of the leg (Ed) is largely recapitulated in the full leg series (Ef).
Transcriptional Profiling of Leg Regeneration Reveals Temporal Dynamics with High Inter-individual Variation.
We performed a similar series of RNA-seq experiments to investigate the temporal dynamics of gene expression during the course of regeneration in adult T4 legs, amputated at the distal end of the carpus. Previous studies have shown that the cellular activity associated with regeneration occurs within 200 μm from the amputation site (19, 23), which in these experiments corresponds approximately to the distal half of the carpus. Samples were collected every 12 h, from the moment of amputation until 120 h post amputation (hpa), when the legs appear to be fully patterned (19). To ensure that we have sampled patterned and differentiated legs, we also collected samples at the onset of expression of a late distal leg marker (the DistalDsRed exon trap, collected ∼150 hpa; 23, 24) and after the first molt following regeneration.
From each of these legs we collected two fragments (Fig. 2A): one consisting of the distal-most end of the leg stump (the carpus, including the blastema and newly regenerating structures; Rd [distal] series) and one from a more proximal podomere (the distal part of the merus; Rp [proximal] series). The Rd samples capture the entire region that participates actively in regeneration (19), while the Rp samples serve as controls, intended to capture transcriptional variations associated with the physiological status of each individual (e.g., molting stage, nutritional state). Overall, we collected 120 samples from 37 individuals (paired samples were collected from the left and right T4 legs in 23 individuals), spanning 13 time points, yielding a total of 60 Rd and 60 Rp samples (Fig. 2A, listed in SI Appendix, Table S1; Datasets S7 and S8).
Fig. 2.

Principal component analysis including both the Rd and Rp series reveals several distinct sources of variation in these data. PC1 captures the difference between the Rd (regenerating) and Rp (control) samples, with the notable exception of the 0 hpa (pre-amputation) and the postmolt Rd samples, which group together with the Rp series (Fig. 2B). This distribution shows that tissues undergoing active regeneration are transcriptionally distinct from the non-regenerating samples.
PC2 reveals marked differences between the groups of samples collected from each individual (particularly in individuals marked in bold, Fig. 2C). This variation reflects real biological differences between individuals, as we find a much higher correlation of gene expression in samples collected from the same individual than among different individuals (Fig. 2C and SI Appendix, Fig. S2.1A). These samples were processed in a randomized order, so these correlations could not arise from postprocessing batch effects. We return to the source of this inter-individual variation in the next section.
PC3 and PC4 capture a temporal signal corresponding to the progress of regeneration in the Rd samples (Fig. 2D and SI Appendix, Fig. S2.2A). On PC3, the transcriptional profile of pre-amputation samples (0 hpa) matches the profile of samples collected at the end of the regenerative time course, consistent with our expectation that regeneration is largely completed by ∼120 hpa and after the following molt.
Principal component analysis on the Rd samples alone captures the temporal signal in PC1 (Fig. 2E) and both temporal and inter-individual variation in PC2 (SI Appendix, Fig. S2.2B). Overall, our analysis reveals that regeneration and inter-individual variation are the major sources of variation in the Rd samples.
Using a reference timeline based on the Rd samples, the stage of regenerating samples (Rd) can be partly predicted based on their transcriptome (linear regression r2 = 0.29) (Fig. 2F and SI Appendix, Table S3). In contrast, most of the control samples (Rp) are assigned to the fully differentiated state, confirming that these samples do not carry a substantial regenerative signal (linear regression r2 ∼ 0; Fig. 2F).
Impact of the Molting Cycle on the Transcriptional Profile of Adult Legs.
We hypothesized that the observed “individual signal” (PC2 in Fig. 2C) is linked to the physiological state of each animal, as it is shared by all the samples collected from each individual. Since molting is a major physiological variable in adult crustaceans, we decided to test directly the impact that the molting cycle might have on the leg transcriptome.
Selecting animals of the same age/size as in the regeneration RNA-seq experiments, we monitored the molting status of 66 animals over two successive molts; we observed that this cohort molted with a mean period of 27 d (SD 7.2 d). We then collected entire T4 legs from 20 of these animals at different stages of the molting cycle (Fig. 3A and SI Appendix, Table S1) and performed RNA-seq on these samples (Datasets S9 and S10). Principal component analysis on these 20 samples shows that different stages of the molt cycle are well separated on PC1 and PC2, representing almost half of the total variation (large circles in Fig. 3B). Major transcriptional changes can be observed in the 5 d that precede molting (orange, brown, and yellow circles in Fig. 3B), followed by more stable transcriptional profiles post molting (blue and purple circles in Fig. 3B).
Fig. 3.

On the principal components describing the molting cycle, we projected the expression data of our regeneration time series in order to assess the molting status of each sample. We observed that most samples are associated with the intermolt phase, except those highlighted in bold in Fig. 2C, which are associated with near-molt stages (Fig. 3B). Molt-associated genes are a major driver of the interindividual variation (seen in PC2, Fig. 2C) in the regenerating leg samples (SI Appendix, Fig. S3.1).
Applying a soft clustering approach (Mfuzz) on the molting cycle dataset, we defined eight distinct sets of coexpressed genes (SI Appendix, Fig. S3.2 for clustering parameters, SI Appendix, Fig. S3.3 for cluster content, and Datasets S11 and S12 for cluster data). The samples collected shortly before molting show the largest changes in gene expression (orange, brown, and yellow phases in Fig. 3C). We identified 131 transcription factors whose expression changes during the molt cycle (SI Appendix, Fig. S3.4 and Dataset S30). These factors, which include the ecdysone receptor and other known mediators of molt responses in arthropods, are prime candidates for future studies on the interplay between molting and regeneration in Parhyale.
These analyses confirm our hypothesis that molting status has a strong transcriptional influence on the regenerating leg transcriptomes. In particular, the imminence of molting deeply modifies the transcriptional state of an adult leg (Fig. 2C).
Disentangling the Transcriptional Signals of Physiology and Regeneration.
To investigate the transcriptional dynamics driven by the regenerative process independently of the physiological/molting status of each animal, we developed a Bayesian modeling approach (using JAGS, ref. 25; model outlined in Fig. 4A) to dissect the contributions of regeneration (R, bold red in Fig. 4A) and the individual’s physiology on the the transcriptome of Rd samples (gray circle in Fig. 4A). Based on the results presented earlier, we assumed that the variation due to an individual’s physiology would be shared by all the samples collected from each individual (Rd and Rp from contralateral T4 legs). In contrast, the variation in gene expression driven by the regenerative process should be specific to each Rd sample. Previous observations suggested that individual limbs can regenerate at different speeds (19); we therefore modeled the regenerative signal separately in each Rd sample, even when we collected them from the left and right T4 legs of the same individual.
Fig. 4.

The regenerative signal R was modeled as an enrichment value, similar to a fold change between Rd and the same individual’s control/physiological signal measured in the Rp samples, taking into account sampling errors/variation (Fig. 4A; R values given in Dataset S13). An R value of 1 conveys that there is no difference in the expression of a given gene between Rd and Rp, R > 1 means that the gene is up-regulated in the regenerating sample, and R < 1 that the gene is down-regulated. Comparing the temporal profiles of R and Rd shows that R values preserve the temporal signal of regeneration but largely reduce the inter-individual variation associated with molting (SI Appendix, Fig. S4.1 A and B).
This modeling approach is successful in extracting the expression dynamics of regeneration from the overall transcriptional variation, without introducing unintended distortions or artifacts in the data (Fig. 4B and C). The principal axes of variation correlate better with regeneration time in the modeled R data compared with the Rd data (SI Appendix, Fig. S4.2), and predictions on the regenerative stage of each sample using RAPToR are also more accurate using R instead of Rd (SI Appendix, Fig. S4.3 A and C; average distance 21 versus 30 h, respectively).
A more targeted approach for removing molt-related variation is to exclude from the Rd dataset the five samples collected close to molting and the genes whose expression is significantly affected by molting (>6,000 genes). RAPToR predictions made using this alternative approach have a similar accuracy with the predictions made using Bayesian modeling on the entire dataset (SI Appendix, Fig. S4.3 B and C). Given that the targeted approach excludes five samples and >6,000 genes from the analysis, we decided to pursue our study using Bayesian modeling.
The regenerative stage of each sample is predicted more accurately in early phases of regeneration (0 to 36 hpa) than in later phases (48 to 120 hpa). This mirrors observations in live imaging experiments, in which wound closure reliably takes place in the first 1 to 2 d after amputation, but the onset of later events varies (19). Given these variations, instead of sample collection time, we decided to use the predictions made by RAPToR to place the regenerating samples on a temporal scale (pseudotime) reflecting each leg’s progress in regeneration, based on its transcriptional profile.
Distinct Transcriptional Dynamics in Developing and Regenerating Legs.
Having captured the transcriptional profiles of leg development and regeneration, we turned our attention to comparing these profiles, to determine the extent to which the dynamics of leg regeneration mirror those of leg development. To render the embryonic data (Ef and Ed) comparable with the modeled data from regenerating legs (R), raw counts in the embryonic datasets were transformed into enrichment values (fold changes) by applying a similar Bayesian modeling as for the R values (see Methods; SI Appendix, Fig. S4.1 D–G, and Datasets S14 and S15); we refer to these transformed data as the E and D series (based on Ef and Ed, respectively). We find that a large proportion of genes showing temporal variation during regeneration also show dynamic expression during leg development (SI Appendix, Fig. S4.4 and Dataset S16).
A combined principal component analysis on the E and the R datasets reveals that, overall, gene expression varies more during development than regeneration: the two major axes of variation (PC1 and PC2) capture well the transcriptional dynamics of leg development, but not the dynamics of leg regeneration (Fig. 5A). There is no obvious alignment between the variation seen in the embryonic and the regenerating time series.
Fig. 5.

Next, we tried to temporally align these datasets using RAPToR. We built a reference time series based on the embryonic leg (E) data and tested whether the regenerating leg (R) samples can be aligned to this reference. Pre-amputation and postmolt samples are consistently assigned to the latest stages of the leg development series, as expected of fully differentiated leg tissues (Fig. 5B and SI Appendix, Table S4). The other samples are inconsistently placed on the developmental series, some matching early and some later phases of development with no obvious pattern, suggesting that there is no straightforward relation between the phases of leg regeneration and leg development.
When using PCA and RAPToR, global expression profiles could be dominated by specific groups of genes that show strong differential expression (e.g., terminal differentiation genes), concealing relevant expression dynamics that occur on a smaller scale (e.g., in genes involved in patterning, the control of cell proliferation, and morphogenesis). To dissect the temporal dynamics of genes associated with different phases of development and regeneration, we turned to a clustering approach, which classified genes into four major coexpression clusters in the E series and eight major coexpression clusters in the R series (the same analysis was also performed on the D series; SI Appendix, Fig. S5.1 and Datasets S17–S28; cluster sizes in SI Appendix, Table S5). All the E clusters and five of the eight R clusters appear to be associated with specific phases of development or regeneration (Fig. 5C). In the embryonic leg data, genes in cluster E2 are expressed predominantly in the early phases of leg development, genes in E4 and E1 in mid phases, and genes in E3 in the late phase (Fig. 5C, Left). In the regenerating leg data, genes in cluster R4 are expressed early, R1 in early–mid phases, R8 in mid–late phases, and R6 and R2 are associated with differentiated legs (both pre- and post-regeneration). The remaining three R clusters (R3, R5, and R7) do not show a consistent temporal signal (Fig. 5C, Right).
Having identified clusters of coexpressed genes in the embryonic and regenerative time series, we systematically compared the gene content of these clusters to determine whether similar sets of genes are coexpressed during development and regeneration. We measured their overlap in terms of enrichment (fold change) relative to random expectation (see Methods). The largest overlaps are observed between the early developmental cluster E2 and the mid regenerative clusters R1 and R8, and between the late developmental and regenerative clusters (E3, R2, and R6), which are associated with differentiated legs (Fig. 5E and SI Appendix, Fig. S5.4D and S5.5). Despite this enrichment, we find that these clusters show significant differences in gene content (Fig. 5F).
Examining the expression profiles of R cluster genes during leg development and E cluster genes during regeneration (SI Appendix, Fig. S5.3) did not yield any additional insights.
A finer classification of coexpressed genes yielded 32 and 37 clusters for E and R, respectively (SI Appendix, Fig. S5.9; Datasets S18, S20, S24 and S26). Similar overlaps in gene content were observed at this finer resolution (Fig. 5G). Despite these similarities, we did not detect a well-aligned temporal sequence of expression during development and regeneration (alternative strategies for temporally aligning the gene clusters were tested; see Fig. 5G and SI Appendix, Fig. S5.9 and S5.10).
Comparing the Deployment of Specific Functional Categories of Genes.
To associate the identified clusters with biological functions, we performed a gene ontology (GO) enrichment analysis on each cluster (SI Appendix, Fig. S5.7 and S5.8) and we categorized the enriched GO terms based on processes that contribute to leg development and regeneration (see Methods; Fig. 5D, and SI Appendix, Fig. S5.7). In the embryonic leg data, the most noticeable feature is the strong enrichment of the early gene cluster with GO terms associated with cell proliferation (cluster E2), followed by phases enriched in patterning and morphogenesis (cluster E4), and cell differentiation (cluster E3) (Fig. 5D, Left). In the regenerating leg, we observe an initial phase associated with stress, wounding, immune responses, and cell death (cluster R4), followed by a phase associated with cell dedifferentiation, cell proliferation, patterning, and morphogenesis (cluster R8), and then a phase associated with cell differentiation (cluster R6); a gene cluster that is associated with TOR signaling and growth is expressed throughout the process (cluster R5) (Fig. 5D, Right). This temporal sequence is in agreement with the regenerative phases identified by live imaging (19).
Overall, we observe that enriched GO terms have distinct temporal distributions in these two datasets. As expected, regeneration starts with a phase of wound healing (cluster R4) that is not represented in embryonic leg development. But notably, embryonic legs express genes associated with cell proliferation and patterning/morphogenesis in distinct phases (clusters E2 and E4), whereas in leg regeneration these processes occur simultaneously (cluster R8).
Taking a more targeted approach, we also examined the temporal profiles of specific genes that are likely to be involved in immunity, cell proliferation, leg patterning, and cell differentiation (Fig. 6; gene list in SI Appendix, Table S6). The genes associated with immunity, cell proliferation, and patterning were selected based on published information (particularly on Drosophila orthologs, see SI Appendix, Supplementary Methods) and on the GO term analysis; genes associated with differentiated neurons and muscles were identified from Parhyale single-cell transcriptomic data (17).
Fig. 6.

Immunity-associated genes are markedly up-regulated during the early phases of regeneration, following wounding (Fig. 6A and SI Appendix, Fig. S6.4B). The same genes are expressed only in the later phases of embryonic development, possibly connected to the differentiation of circulating hemocytes (Fig. 6A and SI Appendix, Fig. S6.1A and S6.3).
The expression profiles of genes expressed in the G2/M phase of the cell cycle indicate that cell proliferation occurs mainly during the early phase of leg embryogenesis, and the mid–late phases of regeneration (Fig. 6B and SI Appendix, Fig. S6.1B, S6.3, S6.4). This is consistent with data from live imaging (19) (SI Appendix, Fig. S6.5) and with the GO enrichment analysis (Fig. 5D).
Leg patterning genes were predominantly expressed in the mid–late phases of leg development (from 120 hpf onward), after the down-regulation of genes associated with cell proliferation (Fig. 6C and SI Appendix, Fig. S6.1, S6.2, and S6.3). During leg regeneration, this set of genes is predominantly expressed during later phases, overlapping with the expression of genes associated with cell proliferation (Fig. 6C and SI Appendix, Fig. S6.2 and S6.4).
Finally, genes associated with differentiated muscles and nerve cells are strongly expressed during the late phases of leg development in the embryo and in fully differentiated adult legs before and after regeneration (Fig. 6D and E and SI Appendix, Fig. S6.1, S6.3 and S6.4). They are down-regulated during the course of regeneration, possibly reflecting neuron and muscle dedifferentiation during these stages.
Discussion
Comparing the transcriptional profiles of developing and regenerating legs has allowed us to probe whether the process of leg regeneration recapitulates parts of embryonic development, in terms of transcriptional dynamics, on a global, genome-wide scale.
We find that the transcriptional dynamics of leg development are stereotypic and highly reproducible across individuals (Fig. 1). The developmental stage of a leg can be predicted from the transcriptome to within ∼8 h of developmental time (Fig. 1C). In contrast, the transcriptional dynamics of leg regeneration are embedded within strong inter-individual variation (Fig. 2), which is largely driven by the molting cycle (Fig. 3B). This is consistent with the fact that, unlike development which occurs in the relatively stable environment of the egg, regeneration takes place in the context of complex physiology in the adult. Even after correcting for inter-individual variation, through Bayesian modeling, regenerating legs show less stereotypic temporal profiles of expression than developing embryonic legs (compare Figs. 1C and 4C, and expression profiles in Fig. 6).
After filtering out inter-individual variation, we recover more clearly the distinct phases of gene expression that unfold during regeneration. Using GO term enrichment analysis we can assign putative gene functions to each of these phases. This analysis reveals distinct phases for wound healing, metabolic reprogramming (during a period previously described as a phase of quiescence) (19), cell proliferation, morphogenesis, and finally cell differentiation (Figs. 5C and 6).
We have tried to relate these phases of leg regeneration to the time course of leg development by comparing global transcriptional dynamics (Fig. 5A and B), sets of coexpressed genes (Fig. 5C–G and SI Appendix, Fig. S5.3 and S5.10), and the transcriptional dynamics of genes involved in cell proliferation, patterning, and cell differentiation (Fig. 6B–E and SI Appendix, Fig. S6.1–4). While we observe that overlapping gene sets are implicated in both leg development and regeneration, we find that the temporal order in which they are deployed is not the same. This is true not only in phases and processes that are likely to be unique to regeneration—e.g., wound healing, immune/stress responses, metabolic reprogramming—but also in processes like cell proliferation, patterning, and morphogenesis, which are shared between development and regeneration. We conclude that the time course of leg regeneration is not collinear with that of leg development.
A similar approach has been used recently to compare the transcriptional dynamics of development and regeneration in the sea anemone Nematostella vectensis (26) and in the zebrafish heart (27). Notwithstanding differences in experimental design (e.g., sample pooling masking inter-individual variation), the results of these studies echo some of the conclusions we present here. Similar to what we observe in crustacean legs, both in the body of Nematostella and in the zebrafish heart, the transcriptional dynamics of development are more pronounced than the dynamics of regeneration (e.g., compare Fig. 5A with figure 1D in ref. 26; Fig. 1D) and the comparison of transcriptional dynamics revealed both shared and divergent patterns of gene deployment over time.
Our analysis does not exclude that a core set of regulatory genes could coordinate leg development and regeneration in similar ways. For example, patterning mechanisms in development and regeneration could share common regulators and regulatory interactions, as suggested by previous studies (8, 9). Our work highlights, however, that despite any shared underlying regulators, the global transcriptional dynamics of development and regeneration are largely distinct. The similar results obtained in distant species—cnidarians, crustaceans, and fish—start to build a coherent picture in which regeneration is not a straightforward replay of development, but deploys some common modules in a different overall framework.
Methods
RNA-Seq Design and Sequencing.
Embryonic dataset.
Parhyale females of the Chicago-F inbred line (28) were collected after fertilization, and their embryos removed about 3 d post fertilization. Each brood was kept separately, in a temperature-controlled incubator set to 27 °C, in sterile six-well plates (Costar, #3516), in filtered artificial seawater (FASW) (salinity at 30 practical salinity units [PSU]) containing antibiotics (Gibco, #15240-062, at 1× final concentration). Embryos were staged 3 to 4 d post fertilization. Two series of single leg samples were collected from these embryos at 6-h time intervals. The Ef series consisted of entire developing T4 legs, collected from 96 hpf to 192 hpf (40 samples in total). These samples included the primordia of the basis, the ischium, the merus, the carpus, the propodus, the dactylus, and the coxal plates and gills. The Ed series of samples included only the distal-most part of the leg: the prospective merus, carpus, propodus, and dactylus from 120 to 138 hpf, when these podomeres cannot yet be morphologically distinguished (9 samples), and the developing carpus, propodus, and dactylus from 144 to 192 hpf (21 samples). At least 2 to 3 samples were collected for each time point per series. Ef and Ed samples were collected from contralateral T4 legs of the same individuals.
Embryos were dissected in the lid of a 5-mL Protein LoBind Tube (Eppendorf, #0030108302), in 1% bovine serum albumin (BSA) in FASW (80 μL). The eggshell was removed with fine forceps (Fine Science Tools, #11254-20), and legs were dissected with borosilicate needles (pulled capillaries; Sutter, #B100-50-15). Samples were transferred in 100 μL of ice-cold lysis solution (Agilent Absolutely RNA Nanoprep Kit, #400753), homogenized though brief pipetting, and flash frozen in liquid nitrogen. RNA extraction was performed with the Agilent Absolutely RNA Nanoprep Kit (#400753), following manufacturer’s instructions, and eluted in 10 μL of elution buffer. RNA extraction was performed in randomized batches to avoid shared batch effects in biological replicates. As the concentrations of RNA was too low to be directly quantified in these extracts, samples were treated as follows: 9 μL from each sample was directly used for cDNA amplification over 15 cycles of LD-PCR, using the SMART-Seq v4 Ultra Low Input RNA Kit for sequencing (Takara Bio, #634898) and the SeqAmp DNA Polymerase (Takara Bio, #638509); 1 μL of cDNA was then used for Qubit quantification (4.0 HS DNA), measuring in the range of 0.2 to 0.7 ng/μL. Libraries were synthesized from 1 ng of cDNA, using the Nextera XT DNA Library Preparation Kit (Illumina, #FC-131-1096; with dual indexing strategy, i7 and i5), and with a protocol that included an accelerated cooling step on ice after the 55 °C step. Quantification and validation of libraries were done with both Qubit 4.0 (HS DNA Kit, Thermo Fisher) and Tapestation 2200 equipment using the D5000 ScreenTape System (Agilent, #5067-5588 and #5067-5589). QC libraries were normalized and then loaded into an Illumina NextSeq 500 sequencing system using NextSeq 500 High Output Kit with 76-bp single-end sequencing, according to the manufacturer’s instructions (Illumina). Further details about the sequenced samples are provided in SI Appendix, Table S1. Sequencing results can be found in Datasets S3 and S4 (counts and tpm values respectively). Raw sequencing data are deposited at Gene Expression Omnibus (GEO), accession no. GSE196485.
Regeneration dataset.
Adult males of the Chicago-F line (28) and the DistalDsRed line (24), measuring ∼1 cm in length and with no damaged appendages, were selected and kept individually in homogeneous conditions—including photoperiod (12:12 h light:dark cycle), temperature (25 to 26 °C), and medium (individual containers separated by mesh, sharing artificial seawater at 30 PSU)— for 3 mo prior to experiment. The same conditions were kept during the course of sampling. Sampling was performed between 8:00 and 10:00 AM. In order to test for the inter- and intraindividual variability of the regenerative process, both T4 legs of each animal were amputated simultaneously, proximally to the carpus/propodus joint. Samples from the Chicago-F line were harvested pre- and post-regeneration, and every 12 h, from 0 to 120 hpa; samples from the DistalDsRed line were collected when the DsRed signal became visible (around 150 hpa). Samples from the same individual were collected at the same moment. Due to the observed variability in the regenerative sequence, samples were processed and sequenced individually, as follows: 1) From each animal, either both or only one T4 leg was harvested; and 2) from each leg, two fragments were isolated, one corresponding to the regenerating podomere(s) (Rd series, localized on the carpus) and one to its proximal control podomere (Rp series, localized on the distal part of the merus from the same leg). Five paired samples were collected per time point, with the exception of the DistalDsRed line and postmolt samples (2 samples each). In total, 60 regenerating and 60 control samples were collected; a scheme is provided in Fig. 2A, and more details about the samples are provided in SI Appendix, Table S1.
Leg fragments were immediately transferred in 1.5-mL LowBind tubes with 500 μL of ice-cold lysis buffer (Reliaprep RNA Tissue MiniPrep System, Promega, #Z6111), vortexed, and then transferred to a sterile multiwell plate for manual disruption of the cuticle with a clean surgical blade. The sample was then retransferred to the tube, vortexed, and frozen in liquid nitrogen, and then stored at −80 °C until further processing. RNA extraction was randomized, to avoid processing related samples at the same time. RNA was extracted with the Reliaprep RNA Tissue MiniPrep System (Promega, #Z6111) and eluted in 15 μL of nuclease-free water (Invitrogen, #AM9937). RNA quality was assessed with TapeStation 2200 (Agilent RNA ScreenTape High Sensitivity System: #5067-5579, #5067-5580, and #5067-5581), and 1 ng of RNA was used for cDNA synthesis (SMART-Seq v4 Ultra Low Input RNA Kit; Takara Bio, #634898). Verification of library quality and sequencing were done as for the embryonic dataset (see above). Sequencing results can be found in Datasets S7 and S8 (counts and tpm values respectively). Raw sequencing data are deposited at GEO, accession no. GSE196485.
Molting dataset.
Sixty-six Chicago-F animals, selected as previously described, were individually kept and monitored over two successive molts (∼3), in order to determine their molting cycles: We observed that this cohort molted with a mean period of 27 d (SD 7.2). One entire T4 leg was harvested per animal, and RNA was extracted as described above. For the premolt samples, individual animals were monitored for molting every day after harvesting the leg; based on the time of molting, the harvested legs were assigned to one of the premolt categories (1-2, 3, 4, or 5 d before molting). The postmolt samples were collected 1, 2, 9, or 10 d post molting. A total of 20 legs derived from animals at different stages of their molting cycle was sequenced. Library preparation and sequencing were done as described above. Details about the sequenced samples are provided in SI Appendix, Table S1; sequencing results can be found in Datasets S9 and S10 (counts and tpm values respectively). Raw sequencing data are deposited at GEO, accession no. GSE196485.
Adult entire leg dataset.
In order to help build a new reference transcriptome (see below), two additional full T4 leg samples from nonregenerating Chicago-F males were collected and processed as described above. Details about the sequenced samples are provided in SI Appendix, Table S1; sequencing results can be found in Datasets S7 and S8 (counts and tpm values, respectively). Raw sequencing data are deposited at GEO, accession number GSE196485.
Reference Transcriptome Assembly.
Transcriptome assembly and annotation.
Sequenced reads were mapped to a modified version of the available P. hawaiensis genome assembly Phaw_5.0 (https://www.ncbi.nlm.nih.gov/assembly/GCA_001587735.2/, see SI Appendix, Methods), using hisat2 v2.1.0. Gene models were built from the RNA-seq data generated in this study, combined with a previous gene annotation; overlapping genes were removed and split genes were identified based on sequence similarity with PacBio long reads (SI Appendix, Methods). After a final manual curation step, the final gtf file contains 54,718 genes (Dataset S1) and the final list used for further analysis includes 52,759 genes (Dataset S2). Orthology annotation was performed using blastp (results are given in Dataset S29). See SI Appendix, Methods for more details.
Analyses of the RNA-Seq Datasets.
Read mapping and quantification.
For all RNA-seq datasets, reads were mapped to the 54,718 gene models (see above), using kallisto v. 0.42.5 (29). Counts and transcripts per million (tpm) values are provided in Datasets S3, S4, and S7–S10. See SI Appendix, Methods for further details.
Genes for which more than two reads mapped on average on each sample of the embryonic dataset or the regeneration dataset, were kept for further analysis (43,212 and 43,968 genes for the embryogenesis and regeneration datasets, respectively).
Time series analysis.
Rather than comparing gene expression among specific time points, our strategy was to use time as a continuous variable. Thus, rather than binning samples on discrete time points and considering these as replicates, we investigated temporal changes in gene expression by comparing individual samples over a continuous time course. To show that sampling density across the developing and the regenerating leg time course is sufficient for a robust analysis of transcriptional dynamics, we used a subsampling approach (SI Appendix, Methods); the results are presented in SI Appendix, Fig. S5.11.
Normalization and visualization of transcriptional dynamics.
Counts and tpm matrices were first quantile normalized (limma R package, v. 3.48.0) (30), then log transformed [log(x+1)]. The JAGS-transformed values were log transformed. Details on principal component analysis and heatmaps are provided in SI Appendix, Methods.
Differential expression analysis.
We used the R package DESeq2 (31) on raw counts for identifying genes differentially expressed during embryogenesis —separately in the Ef and the Ed series— and using hpf as the explanatory variable. Genes with a p.adj <0.001 were selected (Datasets S5 and S6). In order to determine the list of molting-related genes (which we used at different steps to assess the efficiency of the removal of the physiological signal from the R values, SI Appendix, Fig. S4.1B), we also applied DESeq2: Molting-related and unrelated genes were identified as genes significantly differentially expressed between time windows “1 to 2 d before molt” and “9 to 10 d after molt” (p.adj <0.001) while genes unrelated to molting were not differentially expressed (p.adj >0.5). The results of the DESeq2 analysis are provided at https://zenodo.org/record/6420694/files/DESeq2_lresO_molt.gz, as an R software object (rds format).
Identification of coexpressed gene sets.
We applied a soft clustering approach, using the R package Mfuzz (v. 2.52.0) (32), setting SD = 0.2, and membership = 0.8 for calculating the eset object. The optimal number of clusters was estimated from the inflection points of the Dmin function (iterations = 100). In order to plot expression dynamics, we extracted the cluster centroid values, which were used to build an input matrix for the heatmaps. For the molting cycle dataset, we considered the entire transcriptome; we found that 15,646 genes were assigned to clusters, identifying eight coexpressed gene sets (Fig. 3C, SI Appendix, Fig. S3.1, and Dataset S11). For the rest of the analysis, we considered the union of the 20,000 most variable genes within each E, D, and R dataset [var function of base R], which resulted in 27,709 genes (SI Appendix, Fig. S4.4 and Dataset S16). We identified 8 (19,731 genes were clustered) or 37 (15,665 genes) clusters for the R dataset, 4 (20,014 genes) or 32 (15,529 genes) for the E dataset, and 4 (17,033 genes) or 12 (14,883 genes) for the D dataset. Data are available in SI Appendix, Fig. S5.1 and Datasets S17–28.
Comparison of clusters.
Fold enrichment scores were computed as follows: We took an approach inspired from ref. 33, where a hypergeometric distribution on gene counts is used to estimate an enrichment score between gene sets. The enrichment fold was calculated as the ratio of the observed number of genes that overlapped between two clusters over the expected number. The overlap between clusters was further assessed with chord diagrams (Fig. 5F) using the circlize package (v. 0.4.13) (34), and Venn diagrams (SI Appendix, Figs. S4.4, S5.5) using the RVenn package (v. 1.1.0) (35).
The clustree package (v. 0.4.3) (36) was used to assess the correspondence between the clusters (SI Appendix, Figs. S5.6 and S5.9).
GO enrichment analysis.
Enriched GO terms were identified using the packages clusterProfiler (v. 4.0.0) (37), org.Dm.eg.db (v. 3.13.0) (38), and enrichplot (v. 1.12.0) (39), based on the GO terms assigned to the best blastp hit of each Parhyale gene in Drosophila (for 14,741 Parhyale genes, e-value <0.001). The parameters used for the GO term analysis were: ont = BP, pvalueCutoff = 0.05, and qvalueCutoff = 0.05, minGSSize = 4. Results are presented in Fig. 5D and SI Appendix, Figs. S3.3, S5.7, and S5.8 (dotplots for biological process, categories to display = 50). The list of significant GO terms (P value <0.005) was further trimmed for display (Fig. 5D and SI Appendix, Fig. S5.7) using the Revigo algorithm (revigo.irb.hr/) (40). We set the following parameters: allowed similarity = tiny, semantic similarity measure = SimRel.
Bayesian Modeling of Regenerating and Embryonic Datasets.
Computations were performed using JAGS via the R package rjags (25). Details on modeling are provided in SI Appendix, Methods. The resulting R, E, and D transformed values are provided in Datasets S13–S15 (values have been further log transformed).
Predicting Regeneration and Developmental Stages Using RAPToR.
In order to infer the progression of regeneration or development based on transcriptomic data, we used the R package RAPToR v1.1.4 (Real Age Prediction from Transcriptome staging on Reference), a recently developed tool to accurately predict individual samples’ developmental age from their gene expression profiles (22).
For building RAPToR references, we used the function ge_im with the formula “X ∼ s(hpa, bs = ‘ts’)” and parameter dim_red=“pca”. For the RAPToR reference based on Ef values for Fig. 1C, we used the normalized and log-transformed tpm values and selected the genes variable in the Ef samples (the top 20,000 variable genes as calculated by DESeq2), excluding the genes with a low expression (75th percentile count above 10, with a final set of 16,199 genes) and 30 principal components (values in SI Appendix, Table S2). To build the RAPToR reference based on Rd values for Fig. 2F, we used the top 20,000 most variable genes in samples Rd as calculated by DESeq2, excluding the genes with a low expression (75th percentile count above 10, with a final set of 12,438 genes). As regeneration is less synchronous than embryonic development (19), we were concerned that the sampling timing would not faithfully reflect regeneration progression and that our samples were an imperfect reference. To avoid overfitting, we made two changes to the standard RAPToR protocol for building a reference: We used only three PCs, and for each sample we built a separate reference excluding the sample being tested. Pre-amputation samples were given an arbitrary timing of 300 hpa that would place them at the end of the regenerative sequence, and postmolt samples were excluded from the reference (values are provided in SI Appendix, Table S3). To build the RAPToR references based on R values for Figs. 4C and 5B and SI Appendix, Figs. S5.2–4, S5.9, S5.10, S6.1, and S6.2, we used the 20,000 top genes with the most variable R values, further excluding the genes with a low expression (75th percentile count above 10, for a final set of 12,434 genes). To avoid overfitting, we followed the same procedure described above (values in SI Appendix, Table S7 for R samples, SI Appendix, Table S8 for E samples, and SI Appendix, Table S9 for D samples). For the RAPToR reference based on E values for Fig. 5B, we used a gene list that was as exhaustive as possible, consisting of the union of the 20,000 top genes with the most variable R, E, or D values, and excluding the genes with a low expression (75th percentile count above 10, for a final set of 16,759 genes) and 10 PCs (values in SI Appendix, Table S4). For the RAPToR reference based on E values for Figs. 5C, 5.9C, and 6 and SI Appendix, Table S6.2A, we used the 20,000 top genes with the most variable E values, excluding the genes with a low expression (75th percentile count above 10, yielding a final set of 14,632 genes), and 3 PCs. Correlation between predicted time of amputation and real time of amputation (Figs. 2F and 4C) was computed excluding postmolt and t0 samples, the real timing of the former being too uncertain and the later being considered as an end point to regeneration.
Data Availability
Datasets S1 to S13, R code and input files are provided in https://zenodo.org/record/6420694 (41). Raw sequencing results are deposited at GEO, accession no. GSE196485. All other study data are included in the article and/or supporting information.
Acknowledgments
We thank Romain Bulteau for his help with RAPToR; Philippe Veber and Bastien Boussau for their help with rjargs; Nipam Patel for the Parhyale used for the Iso-seq experiment; the Salt Lake City sequencing facility for sequencing the Iso-seq libraries; and Enrique Arboleda, Pascale Roux, and Laetitia Lebre for technical support. We thank Margarida Cardoso, Nikos Konstantinides, and Jordi Casanova for critical feedback; and an anonymous reviewer for detailed and thoughtful suggestions on the data analysis. This work was funded by the European Research Council, under the European Union Horizon 2020 programme (project ERC-2015-AdG #694918). A.A. was supported by the Marie Curie training network ‘EvoCell’, under the European Union Horizon 2020 programme (project H2020-MSCA-ITN-2017 #766053).
Supporting Information
Appendix 01 (PDF)
- Download
- 11.48 MB
References
1
D. M. Skinner, “Molting and regeneration.” in The Biology of Crustacea: Integument, Pigments, and Hormonal Processes, D. E. Bliss, L. H. Mantel, Eds. (Academic Press, 1985) pp. 43–146.
2
M. Singer, The influence of the nerve in regeneration of the amphibian extremity. Q. Rev. Biol. 27, 169–200 (1952).
3
A. Kumar, J. W. Godwin, P. B. Gates, A. A. Garza-Garcia, J. P. Brockes, Molecular basis for the nerve dependence of limb regeneration in an adult vertebrate. Science 318, 772–777 (2007).
4
N. Kyritsis et al., Acute inflammation initiates the regenerative response in the adult zebrafish brain. Science 338, 1353–1356 (2012).
5
J. W. Godwin, A. R. Pinto, N. A. Rosenthal, Macrophages are required for adult salamander limb regeneration. Proc. Natl. Acad. Sci. U.S.A. 110, 9415–9420 (2013).
6
C. Sinigaglia, M. Averof, The multifaceted role of nerves in animal regeneration. Curr. Opin. Genet. Dev. 57, 98–105 (2019).
7
J. P. Brockes, A. Kumar, Appendage regeneration in adult vertebrates and implications for regenerative medicine. Science 310, 1919–1923 (2005).
8
E. Nacu, E. M. Tanaka, Limb regeneration: A new development? Annu. Rev. Cell Dev. Biol. 27, 409–440 (2011).
9
K. Muneoka, S. V. Bryant, Evidence that patterning mechanisms in developing and regenerating limbs are the same. Nature 298, 369–371 (1982).
10
C.-M. Fan, L. Li, M. E. Rozo, C. Lepper, Making skeletal muscle from progenitor and stem cells: Development versus regeneration. Wiley Interdiscip. Rev. Dev. Biol. 1, 315–327 (2012).
11
A. Czarkwiani, D. V. Dylus, L. Carballo, P. Oliveri, FGF signalling plays similar roles in development and regeneration of the skeleton in the brittle star Amphiura filiformis. Development 148, dev180760 (2021).
12
M. Bosch, S.-A. Bishop, J. Baguña, J.-P. Couso, Leg regeneration in Drosophila abridges the normal developmental program. Int. J. Dev. Biol. 54, 1241–1250 (2010).
13
K. Roensch, A. Tazaki, O. Chara, E. M. Tanaka, Progressive specification rather than intercalation of segments during limb regeneration. Science 342, 1375–1379 (2013).
14
T. Gerber et al., Single-cell analysis uncovers convergence of cell identities during axolotl limb regeneration. Science 362, 421 (2018).
15
M. A. Storer et al., Acquisition of a unique mesenchymal precursor-like blastema state underlies successful adult mammalian digit tip regeneration. Dev. Cell 52, 509–524.e9 (2020).
16
C. Aztekin et al., Secreted inhibitors drive the loss of regeneration competence in Xenopus limbs. Development 148, dev199158 https://doi.org/10.1242/dev.199158. (2021).
17
A. Almazán, Ç. Çevrim, J. M. Musser, M. Averof, M. Paris, Regenerated crustacean limbs are precise replicas. bioRxiv https://doi.org/10.1101/2021.12.13.472338 (2021).
18
M. Paris, C. Wolff, N. Patel, M. Averof, The crustacean model Parhyale hawaiensis. Curr Top Dev Biol 147: 199-230 (2022).
19
F. Alwes, C. Enjolras, M. Averof, Live imaging reveals the progenitors and cell dynamics of limb regeneration. eLife 5, 73 (2016).
20
C. Wolff et al., Multi-view light-sheet imaging and tracking with the MaMuT software reveals the cell lineage of a direct developing arthropod limb. eLife 7, e34410 (2018).
21
W. E. Browne, A. L. Price, M. Gerberding, N. H. Patel, Stages of embryonic development in the amphipod crustacean Parhyale hawaiensis. Genesis 42, 124–149 (2005).
22
R. Bulteau, M. Francesconi, Real age prediction from the transcriptome with RAPToR. bioRxiv https://doi.org/10.1101/2021.09.07.459270 (2021).
23
N. Konstantinides, M. Averof, A common cellular basis for muscle regeneration in arthropods and vertebrates. Science 343, 788–791 (2014).
24
Z. Kontarakis et al., A versatile strategy for gene trapping and trap conversion in emerging model organisms. Development 138, 2625–2630 (2011).
25
M. Plummer, JAGS: A Program for Analysis of Bayesian Graphical Models Using Gibbs Sampling, Proceedings of the 3rd International Workshop on Distributed Statistical Computing (DSC 2003), March 20–22, Vienna, Austria. ISSN 1609-395X (2003).
26
H. Johnston et al., Whole body regeneration deploys a rewired embryonic gene regulatory network logic. bioRxiv https://doi.org/10.1101/658930 (2021).
27
M. Weinberger, F. C. Simões, T. Sauka-Spengler, P. R. Riley, (2021). Distinct epicardial gene regulatory programmes drive development and regeneration of the zebrafish heart. bioRxiv 2021.06.29.450229.
28
D. Kao et al., The genome of the crustacean Parhyale hawaiensis, a model for animal development, regeneration, immunity and lignocellulose digestion. eLife 5, e20062 (2016).
29
N. L. Bray, H. Pimentel, P. Melsted, L. Pachter, Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016).
30
M. E. Ritchie et al., limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015).
31
M. I. Love, W. Huber, S. Anders, Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
32
L. Kumar, M. E Futschik, Mfuzz: A software package for soft clustering of microarray data. Bioinformation 2, 5–7 (2007).
33
M. Kowarsky et al., Sexual and asexual development: Two distinct programs producing the same tunicate. Cell Rep. 34, 108681 (2021).
34
Z. Gu, L. Gu, R. Eils, M. Schlesner, B. Brors, Circlize implements and enhances circular visualization in R. Bioinformatics 30, 2811–2812 (2014).
35
T. G. Akyol, RVenn: Set Operations for Many Sets. R package version 1.1.0. https://CRAN.R-project.org/package=RVenn (2019).
36
L. Zappia, A. Oshlack, Clustering trees: A visualization for evaluating clusterings at multiple resolutions. Gigascience 7, giy083 (2018).
37
G. Yu, L. G. Wang, Y. Han, Q. Y. He, clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS 16, 284–287 (2012).
38
M. Carlson, org.Dm.eg.db: Genome wide annotation for Fly. R package version 3.8.2 (2019).
39
G. Yu, enrichplot: Visualization of Functional Enrichment Result. R package version 1.12.0, https://yulab-smu.top/biomedical-knowledge-mining-book (2021).
40
F. Supek, M. Bošnjak, N. Škunca, T. Šmuc, REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS One 6, e21800 (2011).
41
C. Sinigaglia et al., Supplementary information for ‘Distinct gene expression dynamics in developing and regenerating crustacean limbs’, Zenodo doi https://doi.org/10.5281/zenodo.6420682 (2022).
Information & Authors
Information
Published in
Classifications
Copyright
Copyright © 2022 the Author(s). Published by PNAS. This article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND).
Data Availability
Datasets S1 to S13, R code and input files are provided in https://zenodo.org/record/6420694 (41). Raw sequencing results are deposited at GEO, accession no. GSE196485. All other study data are included in the article and/or supporting information.
Submission history
Received: October 27, 2021
Accepted: April 14, 2022
Published online: July 1, 2022
Published in issue: July 5, 2022
Keywords
Acknowledgments
We thank Romain Bulteau for his help with RAPToR; Philippe Veber and Bastien Boussau for their help with rjargs; Nipam Patel for the Parhyale used for the Iso-seq experiment; the Salt Lake City sequencing facility for sequencing the Iso-seq libraries; and Enrique Arboleda, Pascale Roux, and Laetitia Lebre for technical support. We thank Margarida Cardoso, Nikos Konstantinides, and Jordi Casanova for critical feedback; and an anonymous reviewer for detailed and thoughtful suggestions on the data analysis. This work was funded by the European Research Council, under the European Union Horizon 2020 programme (project ERC-2015-AdG #694918). A.A. was supported by the Marie Curie training network ‘EvoCell’, under the European Union Horizon 2020 programme (project H2020-MSCA-ITN-2017 #766053).
Notes
This article is a PNAS Direct Submission.
Authors
Competing Interests
The authors declare no competing interest.
Metrics & Citations
Metrics
Altmetrics
Citations
Cite this article
Distinct gene expression dynamics in developing and regenerating crustacean limbs, Proc. Natl. Acad. Sci. U.S.A.
119 (27) e2119297119,
https://doi.org/10.1073/pnas.2119297119
(2022).
Copied!
Copying failed.
Export the article citation data by selecting a format from the list below and clicking Export.
Cited by
Loading...
View Options
View options
PDF format
Download this article as a PDF file
DOWNLOAD PDFLogin options
Check if you have access through your login credentials or your institution to get full access on this article.
Personal login Institutional LoginRecommend to a librarian
Recommend PNAS to a LibrarianPurchase options
Purchase this article to access the full text.