The growth of business firms: Theoretical framework and empirical evidence

  1. Dongfeng Fu*,,
  2. Fabio Pammolli*,,§,
  3. S. V. Buldyrev,
  4. Massimo Riccaboni,§,
  5. Kaushik Matia*,
  6. Kazuko Yamasaki, and
  7. H. Eugene Stanley*,
  1. *Center for Polymer Studies and Department of Physics, Boston University, Boston, MA 02215; Faculty of Economics, University of Florence, 50127 Florence, Italy; §IMT Institute for Advanced Studies, Via S. Micheletto 3, 55100 Lucca, Italy; Department of Physics, Yeshiva University, 500 West 185th Street, New York, NY 10033; and Tokyo University of Information Sciences, Chiba City 265-8501, Japan
  1. Contributed by H. Eugene Stanley, November 2, 2005

Abstract

We introduce a model of proportional growth to explain the distribution Pg(g) of business-firm growth rates. The model predicts that Pg(g) is exponential in the central part and depicts an asymptotic power-law behavior in the tails with an exponent ζ = 3. Because of data limitations, previous studies in this field have been focusing exclusively on the Laplace shape of the body of the distribution. In this article, we test the model at different levels of aggregation in the economy, from products to firms to countries, and we find that the predictions of the model agree with empirical growth distributions and size-variance relationships.

Gibrat (1, 2), building on the work of the astronomers Kapteyn and Uven (3), assumed the expected value of the growth rate of a business firm's size to be proportional to the current size of the firm, which is called the law of proportionate effect (4, 5). Several models of proportional growth have been subsequently introduced in economics to explain the growth of business firms (68). Simon and co-workers (912) extended Gibrat's model by introducing an entry process according to which the number of firms rise over time. In the framework of Simon and co-workers, the market consists of a sequence of many independent “opportunities” that arise over time, each of size unity. Models in this tradition have been challenged by many researchers (1317) who found that the firm-growth distribution is not Gaussian but displays a tent shape.

Here we introduce a general framework that provides a unifying explanation for the growth of business firms based on the number and size distribution of their elementary constituent components (1825). Specifically, we present a model of proportional growth in both the number of units and their size, and we draw some general implications on the mechanisms that sustain business-firm growth (7, 11, 21, 2628). According to the model, the probability density function (PDF) of growth rates is Laplace in the center (13) with power-law tails (29, 30) decaying as Pg(g) ∼ g –ζ, where ζ = 3.

Also, because of data limitations, previous studies in this field focus on the Laplace shape of the body of the distribution, which, however, is an unconditional object (31). Using a database on the size and growth of firms and products, we characterize the shape of the whole growth-rate distribution.

We test our model by analyzing different levels of aggregation of economic systems, from the “micro” level of products to the “macro” level of industrial sectors and national economies. We find that the model accurately predicts the shape of the PDF of growth rate at all levels of aggregation studied.

The Theoretical Framework

We model business firms as classes consisting of a random number of units. According to this view, a firm is represented as the aggregation of its constituent units such as divisions (22), businesses (20), or products (21). Accordingly, on a different level of coarse graining, a class can represent a national economy composed of economic units such as firms. In this article, we study the logarithm of the 1-year growth rate of classes g ≡ log[S(t + 1)/S(t)], where S(t) and S(t + 1) are the sizes of classes in the year t and t + 1 measured in monetary values [gross domestic product (GDP) for countries, sales for firms and products]. Our model is illustrated in Fig. 1. Two key sets of assumptions in the model are that the number of units in a class grows in proportion to the existing number of units (Assumptions 1–4) and the size of each unit fluctuates in proportion to its size (Assumptions 5 and 6).

Fig. 1.

Schematic representation of the model of proportional growth. At time t = 0, there are N(0) = 2 classes (□) and n(0) = 5 units (○) (Assumption 1). The area of each circle is proportional to the size ξ of the unit, and the size of each class is the sum of the areas of its constituent units (see Assumption 5). At the next time step, t = 1, a new unit is created (Assumption 2). With probability b the new unit is assigned to a new class (class 3 in this example) (Assumption 3). With probability 1 – b the new unit is assigned to an existing class with probability proportional to the number of units in the class (Assumption 4). In this example, a new unit is assigned to class 1 with probability 3/5 or to class 2 with probability 2/5. Finally, at each time step, every unit i grows or shrinks by a random factor ηi (Assumption 6).


Assumption 1. Each class α consists of K α(t) number of units. At time t = 0 (time step measured by year, generally), there are N(0) classes consisting of n(0) total number of units. The initial average number of units in a class is thus n(0)/N(0).

Assumption 2. At each time step, a new unit is created. Thus, the number of units at time t is n(t) = n(0) + t.

Assumption 3. With birth probability b, this new unit is assigned to a new class so that the average number of classes at time t is N(t) = N(0) + bt.

Assumption 4. With probability 1 – b, a new unit is assigned to an existing class α with probability P α = (1 – b) K α(t)/n(t), so K α(t + 1) = K α(t) + 1.

For simplicity, we do not consider the decrease of the number of units in a class. In reality, elementary units enter and exit. Because we are considering the case of a growing economy, it is legitimate to assume that the entry rate is higher than the exit rate. On the average, the net entry rate of units can be simplified as a positive constant. In the model, the net entry rate of units is fixed at 1. Thus, at large t, it gives results equivalent to the ones that would have been obtained when considering a value for the exit rate of units.

Our goal is to find P(K), the probability distribution of the number of units in the classes at large t. This model in two limiting cases, (i) b = 0, K α = 1 (α = 1, 2,... N(0)) and (ii) b ≠ 0, N(0) = 1, n(0) = 1, has exact analytical solutions P(K) = N(0)/t{t/[t + N(0)]}K [1 + O(1/t)] (32, 33) and limt →∞ P(K) = (1 + b′)Γ(K)Γ(2 + b′)/Γ (K + 2 + b′), where b′ = b/(1 – b) (34), respectively.

In the general case, the exact analytical solution is not known, and we obtain a numerical solution by computer simulations and compare it with the approximate mean-field solution (see, e.g., chapter 6 of ref. 35 and Appendix A).

Our results are consistent with the exactly solvable limiting cases as well as with the empirical data on the number of products in the pharmaceutical firms and can be summarized as follows. In the limit of large t, the distribution of K in the old classes that existed at t = 0 converges to an exponential distribution (36), Formula where λ = 1 – 1/K(t) and K(t) is the average number of units in the old classes at time t, K(t) = {[n(0) + t]/n(0)}1– b·n(0)b/N(0). The distribution of units in the new classes created at t > 0 converges to a power law with an exponential cutoff, Formula where φ = 2 + b/(1 – b) and f(K) decays for K → ∞ faster than P old(K). The distribution of units in all classes is given by Formula The mean-field approximation for P new(K) is given by Formula where K′ = K{n(0)/[n(0) + t]}1– b.

Assumption 5. At time t, each class α has K α(t) units of size ξi(t), i = 1,2,... K α(t), where K α and ξi > 0 are independent random variables taken from the distributions P(K α) and P ξi), respectively. P(K α) is defined by Eq. 3 and P ξi) is a given distribution with finite mean μξ and standard deviation σξ . We also assume that lnξi has finite mean m ξ = 〈lnξiand variance Formula. The size of a class is defined as Formula.

Assumption 6. At time t + 1, the size of each unit is decreased or increased by a random factor ηi(t) > 0 so that Formula where ηi(t) > 0, the growth rate of unit i, is an independent random variable taken from a distribution P ηi), which has a finite mean μη and standard deviation ση . We also assume that lnηi has finite mean m η ≡ 〈ln ηi 〉, and variance Formula.

The growth rate of each class is defined as Formula

Here we neglect the influx of the new units, so K α = K α(t + 1) = K α(t). The resulting distribution of the growth rates of all classes is determined by Formula where P(K) is the distribution of the number of units in the classes, computed in the previous stage of the model, and Pg(g|K) is the conditional distribution of growth rates of classes with a given number of units determined by the distribution P ξ(ξ) and P η(η).

The analytical solution of this model can be obtained only for certain limiting cases, but a numerical solution can be computed easily for any set of assumptions. We investigate the model numerically and analytically (see Appendix B) and find:

  1. The conditional distribution of the logarithmic growth rates Pg(g|K) for the firms consisting of a fixed number of units converges to a Gaussian distribution for K → ∞, Formula where V is a function of parameters of the distribution P ξ(ξ) and P η(η), and is the logarithm of mean growth rate of a unit, = ln μη. Thus, the width of this distribution decreases as Formula. This result is consistent with the observation that large firms with many production units fluctuate less than small firms (7, 18, 22, 37).

  2. For gV η, the distribution Pg(g) coincides with the distribution of the logarithms of the growth rates of the units: Formula In the case of power-law distribution P(K) ∼ K –φ, which dramatically increases for K → 1, the distribution Pg(g) is dominated by the growth rates of classes consisting of a single unit K = 1; thus, the distribution Pg(g) practically coincides with P η(ln ηi) for all g. Indeed, our empirical observations confirm this result.

  3. If the distribution P(K) ∼ K –φ, φ > 2 for K → ∞, as happens in the presence of the influx of new units b ≠ 0, Pg(g) = C 1C 2|g|2φ–3, for g → 0, which in the limiting case b → 0, φ → 2 gives the cusp Pg(g) ∼ C 1C 2|g| (C 1 and C 2 are positive constants), similar to the behavior of the Laplace distribution P L(g) ∼ exp(–|g|C 2) for g → 0.

  4. If the distribution P(K) weakly depends on K for K → 1, the distribution of Pg(g) can be approximated by a power law of g: Pg(g) ∼ |g|–3 in wide range Formula, where K(t) is the average number of units in a class. This case is realized for b = 0, t → ∞ when the distribution of P(K) is dominated by the exponential distribution and K(t) → ∞ as defined by Eq. 1. In this particular case, Pg(g) for Formula can be approximated by Formula

  5. In the case in which the distribution P(K) is not dominated by one-unit classes but for K → ∞ behaves as a power law, which is the result of the mean-field solution for our model when t → ∞, the resulting distribution Pg(g) has three regimes: Pg(g) ∼ C 1C 2|g|2φ–3 for small g, Pg(g) ∼ |g|–3 for intermediate g, and Pg(g) ∼ P(ln η) for g → ∞. The approximate solution of Pg(g) is obtained by using Eq. 8 for Pg(g|K) for finite K, mean-field solution Eq. 4 in the limit t → ∞ for P(K), and replacing summation by integration in Eq. 7: Formula For b ≠ 0 the integral above cannot be expressed in elementary functions. In the b → 0 case, Eq. 11 yields the main result Formula which combines the Laplace cusp for g → 0 and the power-law decay |g|–3 for g → ∞. Note that because of replacement of summation by integration in Eq. 7, the approximation Eq. 12 holds only for Formula.

In Fig. 2a we compare the distributions given by Eq. 10, the mean-field approximation Eq. 11 for b = 0.1, and Eq. 12 for b 3 0. We find that all three distributions have very similar tentshape behavior in the central part. In Fig. 2b we also compare the distribution Eq. 12 with its asymptotic behaviors for g → 0 (Laplace cusp) and g → ∞ (power law) and find the crossover region between these two regimes.

Fig. 2.

The growth rate PDF, Pg(g). (a) Comparison of three different approximations given by Eq. 10, mean field approximation Eq. 11 for b = 0.1 and Eq. 12. Each Pg(g) shows similar tent shape behavior in the central part. We see there is little difference between the three cases, b = 0 (no entry), b = 0.1 (with entry), and Eq. 12. This means that entry of new classes (b > 0) does not perceptibly change the shape of Pg(g). Note that we use K(t)/V = 2.16 for Eq. 10 and V = 1 for Eq. 12. (b) The crossover of Pg(g) given by Eq. 12 between the Laplace distribution in the center and power law in the tails. For small g, Pg(g) follows a Laplace distribution Pg(g) ∼ exp(– |g|), and for large g, Pg(g) asymptotically follows an inverse cubic power law Pg(g) ∼ g –3.


The Empirical Evidence

To test our model, we analyze different levels of aggregation of economic systems, from the micro level of products to the macro level of industrial sectors and national economies.

First, we analyze a database, the pharmaceutical industry database (PHID), that records sales figures of the 189,303 products commercialized by 7,184 pharmaceutical firms in 21 countries from 1994 to 2004, covering the whole size distribution for products and firms and monitoring the flows of entry and exit at both levels (kindly provided by the epris program). Then, we study the growth rates of all U.S. publicly traded firms from 1973 to 2004 in all industries, based on Security Exchange Commission filings (Compustat). Finally, at the macro level, we study the growth rates of the GDP of 195 countries from 1960 to 2004 (World Bank).

Fig. 3 shows that the growth distributions of countries, firms, and products are well fitted by the distribution in Eq. 12 with different values of Vg. Indeed, growth distributions at any level of aggregation depict marked departures from a Gaussian shape. Moreover, even if the Pg(g) of GDP can be approximated by a Laplace distribution, the Pg(g) of firms and products are clearly more leptokurtic than Laplace. Based on our model, the growth distribution is Laplace in the body, with power-law tails. In fact, Fig. 4 shows that the central body part of the growth-rate distributions at any level of aggregation is well approximated by a double-exponential fit. Fig. 5 reveals that the asymptotic behaviors of g at any level of aggregation can be well fitted by power law with an exponent ζ = 3.

Fig. 3.

Empirical tests of Eq. 12 for the PDF Pg(g) of growth rates rescaled by Formula. Shown are country GDP (○), pharmaceutical firms (□), manufacturing firms (⋄), and pharmaceutical products (▵). The shapes of Pg(g) for all four levels of aggregation are well approximated by the PDF predicted by the model (dashed lines). Dashed lines are obtained based on Eq. 12 with Vg ≈ 4 × 10–4 for GDP, Vg ≈ 0.014 for pharmaceutical firms, Vg≈ 0.019 for manufacturing firms, and Vg ≈ 0.01 for products. After rescaling, the four PDFs can be fit by the same function. For clarity, the pharmaceutical firms are offset by a factor of 102, manufacturing firms by a factor of 104, and the pharmaceutical products by a factor of 106. Note that the data for pharmaceutical products extend from Pg(g) = 1 to Pg(g) ≈ 10–4, and the mismatch in the tail parts is because Pg(g) for large g is mainly determined by the logarithmic growth rates of units ln η.


Fig. 4.

Empirical test of Eq. 12 for the central part in the PDF P(g) of growth rates rescaled by Formula. Shown are four symbols: country GDP (○), pharmaceutical firms (□), manufacturing firms (⋄), and pharmaceutical products (▵). The shape of central parts for all four levels of aggregation can be well fit by a Laplace distribution (dashed lines). Note that Laplace distribution can fit Pg(g) only over a restricted range, from Pg(g) = 1 to Pg(g) ≈ 10–1.


Fig. 5.

Empirical tests of Eq. 12 for the tail parts of the PDF of growth rates rescaled by Formula. The asymptotic behavior of g at any level of aggregation can be well approximated by power laws with exponents ζ ≈ 3 (dashed lines). The symbols are as follows: country GDP (left tail, ○; right tail, •), pharmaceutical firms (left tail, □; right tail, ▪), manufacturing firms (left tail, ⋄; right tail, ♦), and pharmaceutical products (left tail, ▵; right tail, ▴).


Our analysis in The Theoretical Framework predicts that the power-law regime of Pg(g) may vary depending on the behavior of P(K) for K → 1 and the distribution of the growth rates of units. In the case of the PHID, for which P(1) ≫ P(2) ≫ P(3)..., the growth-rate distribution of firms must be almost the same as the growth-rate distribution of products (as we stated in The Theoretical Framework). Hence, the power-law wings of Pg(g) for firms originate at the level of products. Because the PHID does not contain information on the subunits of products, we cannot test our prediction directly, but we can hypothesize that the distribution of the product subunits (number of customers or shipping ways) is less dominated by small K but has a sufficiently wide power-law regime because of the influx of new products. These rather plausible assumptions are sufficient to explain the shape of the distribution of the product growth rates, which is well described by Eq. 12.

The PHID allows us to test the empirical conditional distribution Pg(g|K) and the dependence of its variance σ2 on K, where K is the number of products. We find that σ ∼ K –0.28, which is significantly smaller than Formula behavior. This result does not imply correlations among product growth rates on the firm level (21) but can be explained by the fact that for skewed distributions of product sizes P ξ(ξ) characterized by large V ξ, the convergence of Pg(g|K) to its Gaussian limit Eq. 8 is slow and the growth rates of the firms are determined by the growth of the few large products. Using the empirical values for the PHID μξ = 3.44, V ξ = 5.13, μη = 0.016, V η = 0.36 and assuming log-normality of the distributions P ξ(ξ) and P η(η), we find that the behavior of σ can be well approximated by a power law σ ∼ K –0.20 for K < 103. For this set of parameters, the convergence of Pg(g|K) to a Gaussian distribution takes place only for K > 105. This result is consistent with the observations of the power-law relationship between firm size and growth-rate variance reported earlier (13, 18, 19, 38).

Discussion

Business firms grow in scale and scope. The scope of a firm is given by the number of its products. The scale of a firm is given by the size of its products. A firm such as Microsoft gets few big products, whereas Amazon sells a huge variety of goods, each of small size in terms of sales. In this article we argue that both mechanisms of growth are proportional. The number of products that a firm can launch successfully is proportional to the number of products that it has already commercialized. Once a product has been launched, its success depends on the number of customers who buy it and the price they are willing to pay. To a large extent, if products are different enough, the success of a product is independent from other products commercialized by the same company. Hence, the sales of products can be modeled as independent stochastic processes. Moreover, sometimes, new products are commercialized by new companies. As a result, small companies with few products can experience sudden jerks of growth resulting from the successful launch of a new product.

In this article, we find that the empirical distribution of firm growth rates exhibits a central part that is distributed according to a Laplace distribution and power-law wings Pg(g) ∼ g –ζ, where ζ = 3. If the distribution of number of units K is dominated by single-unit classes, the tails of firm growth distribution are primarily due to smaller firms composed of one or few products. The Laplace center of the distribution is shaped by big multiproduct firms. We find that the shape of the distribution of firm growth is almost the same in the presence of a small entry rate and with zero entry. We also find that the predictions of the model are accurate in the case of product growth rates, which implies that products can be considered as composed of elementary sale units, which evolve according to a random multiplicative process (6). Although there are several plausible explanations for the Laplace body of the distribution, which can be considered as an unconditional object (18, 31), the power-law decay of the tails has not been observed previously. We introduce a simple and general model that accounts for both the central part and the tails of the distribution. The shape of the business growth-rate distribution is due to the proportional growth of both the number and the size of the constituent units in the class. This result holds in the case of an open economy (with entry of new firms) as well as in the case of a closed economy (with no entry of new firms).

Acknowledgments

We thank S. Havlin, J. Nagler, and F. Wang for helpful discussions and suggestions. We thank the National Science Foundation and Merck Foundation (epris program) for financial support.

Footnotes

  • To whom correspondence may be addressed. E-mail: dffu{at}bu.edu or hes{at}bu.edu.

  • Author contributions: F.P., S.V.B., and H.E.S. designed research; D.F., F.P., S.V.B., M.R., K.M., K.Y., and H.E.S. performed research; S.V.B. contributed new reagents/analytic tools; D.F., S.V.B., M.R., K.M., and K.Y. analyzed data; and D.F., F.P., S.V.B., M.R., and K.M. wrote the paper.

  • Conflict of interest statement: No conflicts declared.

  • This paper was submitted directly (Track II) to the PNAS office.

  • Abbreviations: PDF, probability density function; PHID, pharmaceutical industry database; GDP, gross domestic product.

References

« Previous | Next Article »Table of Contents
From the Cover