A Poissonian explanation for heavy tails in e-mail communication
Edited by Steven Strogatz, Cornell University, Ithaca, NY, and accepted by the Editorial Board September 3, 2008
Abstract
Patterns of deliberate human activity and behavior are of utmost importance in areas as diverse as disease spread, resource allocation, and emergency response. Because of its widespread availability and use, e-mail correspondence provides an attractive proxy for studying human activity. Recently, it was reported that the probability density for the inter-event time τ between consecutively sent e-mails decays asymptotically as τ−α, with α ≈ 1. The slower-than-exponential decay of the inter-event time distribution suggests that deliberate human activity is inherently non-Poissonian. Here, we demonstrate that the approximate power-law scaling of the inter-event time distribution is a consequence of circadian and weekly cycles of human activity. We propose a cascading nonhomogeneous Poisson process that explicitly integrates these periodic patterns in activity with an individual's tendency to continue participating in an activity. Using standard statistical techniques, we show that our model is consistent with the empirical data. Our findings may also provide insight into the origins of heavy-tailed distributions in other complex systems.
Sign up for PNAS alerts.
Get alerts for new articles, or get an alert when an article is cited.
The analysis of social and economic data has a long and illustrious history (1–3). Despite their idiosyncratic complexity, a number of striking statistical regularities are known to describe individual and societal human behavior (4–7). These regularities are of enormous practical importance because they provide insight into how individual behaviors influence social and economic outcomes. Indeed, much of the current research on complex systems aims to quantify the impact of individual agents on the organization and dynamics of the system as a whole (8, 9). Before we can predict how individuals affect, for example, the organization of systems, it is paramount to understand the behavior of the individual agents.
The current availability of digital records has made it much easier for researchers to quantitatively investigate various aspects of human behavior (10–21). In particular, e-mail communication records are attracting much attention as a proxy for quantifying deliberate human behavior because of the omnipresence of e-mail communication and availability of e-mail records (13, 14, 16, 18). The data, however, do not provide a detailed record of all of the activities in which each individual participates; we do not know, for instance, when an individual is sleeping, eating, walking, or even browsing the web. The resulting uncertainty in deliberate human activity thus poses a fundamental challenge to quantifying and modeling of human behavior.
Researchers commonly account for uncertainty or lack of information through stochastic models. One of the simplest stochastic models for human activity is a point process in which independent events occur at a constant rate ρ. Such processes are referred to as homogeneous Poisson processes, and they are used to describe a large class of phenomena, including some aspects of human activity (22). Homogeneous Poisson processes have two well-known statistical properties: the time between consecutive events, the inter-event time τ, follows an exponential distribution, p(τ) = ρe−ρτ, and the number of events NT during a time interval of duration T time units follows a Poisson distribution with mean ρT.
Several recent studies of deliberate human activity, including e-mail correspondence, have focused on the former property. These studies have reported that the empirical distribution of inter-event times decays asymptotically as a power law, p(τ) ∝ τ-α, with exponent α ≈ 1 (13, 14, 18, 23). Other studies have identified a similar power-law scaling in the inter-event time distribution of many other facets of human behavior, such as file downloads (10–12), letter correspondence (15, 17, 18), library usage (17), broker trades (17), web browsing (17, 19), human locomotor activity (20), and telephone communication (21). These observations are in stark contrast to the predictions of a homogeneous Poisson process, suggesting that a more suitable null model with which to compare mechanistic models of human activity is a truncated power-law model with scaling exponent α = 1.*
The heavy-tailed nature of the distribution of inter-event times prompts us to search for the mechanisms responsible for its emergence. Two main classes of mechanisms can be considered: (i) human behavior is primarily driven by rational decision making, which introduces correlations in activity, thereby giving rise to heavy tails;† (ii) human behavior is primarily driven by external factors such as circadian and weekly cycles, which introduces a set of distinct characteristic time scales, thereby giving rise to heavy tails.‡ Whereas the former interpretation has been shown to give rise to a truncated power-law distribution of inter-event times, the latter has been rejected by some authors (17, 18). Indeed, even though Hidalgo (24) investigated a model with seasonal changes in activity rates that is able to generate data with an approximate power-law decay in the distribution of inter-event times with exponents α ≈ 2 or α ≈ 1, the α ≈ 1 case requires a specific relationship between the rates of activity ρi and the corresponding duration of the seasons Ti over which each rate holds. It has therefore been argued that seasonality alone can only robustly give rise to heavy-tailed inter-event time distributions with exponent α ≈ 2 (17).
Here, we demonstrate that the distribution of inter-event times in e-mail correspondence patterns display systematic deviations from the truncated power-law null model because of circadian and weekly patterns of activity. We subsequently propose a mechanistic model that incorporates these observed cycles, and a simulated annealing procedure to nonparametrically estimate its parameters. We then use Monte Carlo hypothesis testing to demonstrate that the predictions of our model are consistent with the observed heavy-tailed inter-event time distribution. Finally, we discuss the implications of our findings for modeling human activity patterns and, more generally, complex systems.
Empirical Patterns
We study a database of e-mail records for 3,188 e-mail accounts at a European university over an 83-day period (23). Each record comprises a sender identifier, a recipient identifier, the size of the e-mail, and a time stamp with a precision of 1 s. We preprocess the dataset and identify a set of 394 accounts that provide enough data to quantify human activity and that are likely neither spammers nor listservs [see Preprocessing of the Data in supporting information (SI) Text and Fig. S1].
To gain some intuition about e-mail activity patterns, let us consider a fictitious student, Katie.§ Katie arrives at the university 20 min before her Thursday morning class. During this time, she decides to check her e-mail and sends 3 e-mails. Katie checks her e-mail after lunch and sends a brief e-mail to a friend before her next class. Later that evening, Katie sends 4 more e-mails once she has finished her homework. Katie does not check her e-mail again until the following day when she sends e-mails intermittently between attending classes, completing homework assignments, and meeting social engagements. Katie spends the weekend without e-mail access and doesn't send another e-mail until Monday. Katie's e-mail activity, which is similar to many e-mail users, is both periodic and cascading. That is, there are periodic changes in her activity rate, which account for her sleep and work patterns, and there are cascades of activity—active intervals—of varying length when Katie primarily focuses on e-mail correspondence (Fig. 1).
Fig. 1.
If our intuition about deliberate human activity is correct, then the periodic pattern of activity should manifest itself in the inter-event time statistics, particularly when compared with the predictions of the truncated power-law null model that does not account for temporal periodicities (see Null Model in SI Text). Specifically, we anticipate that e-mail users typically send e-mails during the same 8-hour periods of the day. We therefore expect the data to have significantly more inter-event times between 24 ± 8 h—the time required to send e-mails on consecutive workdays—than the truncated power-law model predictions. We therefore expect that the null model underestimates the number of inter-event times between 16 and 32 h. Because of the normalization of the probability density, the truncated power-law model will overestimate other inter-event times. These predictions are all confirmed by the data, suggesting that periodicity is a fundamental aspect of human activity (Fig. 2).
Fig. 2.
Model
We propose a model of e-mail usage that incorporates the hypothesized periodic and cascading features of human activity. We account for periodic activity with a primary process, which we model as a nonhomogeneous Poisson process. Whereas a homogeneous Poisson process has a constant rate ρ, a nonhomogeneous Poisson process has a rate ρ(t) that depends on time. In our model, the rate ρ(t) depends on time in a periodic manner; that is, ρ(t) = ρ(t + W), where W is the period of the process. Consistent with our observations (Fig. 3), we relate the rate of the nonhomogeneous Poisson process to the daily and weekly distributions of active interval initiation, pd(t) and pw(t):where the period W is 1 week, and the proportionality constant Nw is the average number of active intervals per week.¶
Fig. 3.
We further assume that each event generated from the primary process initiates a secondary process, which we model as a homogeneous Poisson process with rate ρa (see Additional Evidence for a Homogeneous Poisson Cascade in SI Text). We refer to these “cascades of activity” as active intervals, during which Na additional events occur where Na is drawn from some distribution p(Na). Once the Na events have occurred in the active interval, the activity of the individual is again governed by the primary process defined by Eq. 1. Our model thus mimics how individuals like Katie use e-mail: Katie sends e-mails sporadically throughout the day, but once she starts checking her e-mail, it is relatively easy to send additional e-mails in rapid succession. We refer to the resulting model as a cascading nonhomogeneous Poisson process.‖
Results
To compare our model with the empirical data, we first need to estimate the parameters of our model from the data. Ideally, the data would specify which events belong to the same active intervals—the active interval configuration C—so that we could estimate the distributions pd(t), pw(t), and p(Na). The data we analyze, however, do not specify the actual active interval configuration Co, so it is not evident whether, for example, p(Na) should be described by a normal or exponential distribution.
Because we do not know a priori the functional form of the activity pattern in the cascading process, we cannot use the formalism implemented by, for example, Scott and coworkers (28, 29). Instead, we introduce a new method that enables us to nonparametrically infer the empirical distributions pd(t), pw(t), and p(Na) from the data.
Given a particular active interval configuration C, we can easily calculate all of our model's parameters and compare its predictions with the empirical data: Nw is the average number of active intervals per week; pd(t) and pw(t) are the probabilities of starting an active interval at a particular time of day and week, respectively; the active interval rate ρa is the inverse of the average inter-event time in active intervals; and the probability of Na additional events occurring during an active interval p(Na) is estimated directly from the active interval configuration (Fig. 3). We then manipulate the active interval configuration C to find the active interval configuration Ĉ that gives a best estimate of the observed inter-event time distribution (see Methods). This method allows us to infer the best-estimate distributions p̂d(t), p̂w(t), and p̂(Na), given the data and our proposed model, without making any assumptions on their functional forms.
We next compare the predictions of the cascading nonhomogeneous Poisson process with the empirical cumulative distribution of inter-event times P(τ) for all 394 users under consideration in the present study (see SI Appendix). Because we are using the empirical data to estimate the parameters for our model—that is, the estimated parameters depend on the data—we must use Monte Carlo hypothesis testing (30, 31) to assess the significance of the agreement between the predictions of our model and the empirical data (see Monte Carlo Hypothesis Testing in SI Text). The visual agreement of our model's predictions are confirmed by P values clearly above our 5% rejection threshold (Fig. 4).
Fig. 4.
In fact, the cascading nonhomogeneous Poisson process can only be rejected at the 5% significance level for 1 user, indicating that our model cannot be rejected as a model of human dynamics. By comparison, the truncated power-law null model is rejected at the 5% significance level for 344 users. Indeed, the null model is always rejected for many more users than the cascading nonhomogeneous Poisson process regardless of the rejection threshold selected, and our model does not display the large systematic deviations from the data that are observed for the truncated power-law null model (Fig. 5)
Fig. 5.
Discussion
Our results clearly demonstrate that circadian and weekly cycles, when coupled to cascading activity, can accurately describe the heavy tails observed in e-mail communication patterns. The question then is, would rational decision making, together with circadian and weekly cycles, be equally able to describe the statistical patterns observed for e-mail communication? Even if the answer to this question is affirmative, parsimony suggests that rational decision making is not a necessary component of human activity patterns, given our simpler explanation.
In addition to providing a good description of e-mail communication patterns, we surmise that our model is readily applicable to many other conscious human activities. For instance, most people make telephone calls sporadically throughout the day. After a telephone call has been made, it is effortless to make another telephone call. Similarly, individuals run errands throughout the month. Once an individual runs one errand, it is easier to run another errand during the same trip than it is to run errands again the following day. Both of these anecdotes are illustrative of the way humans tend to optimize their time and effort to accomplish the tasks in their daily routines, a process that is captured by the periodic and cascading mechanisms in our model.
The particular periodic and cascading features that are incorporated into our model depend on the activity under consideration. For instance, sexual activity is influenced by menstrual cycles (32), and airline travel is influenced by seasonality (33). Furthermore, our model can also be generalized to cases in which the parameters are not stationary. This may be important, for instance, in the case of Darwin and Einstein's letter correspondence in which the number of letters sent per year increased 100-fold over 40 years (15, 18).
Although our model is only designed to account for a single activity (e-mail correspondence), it can easily be extended to incorporate the multitude of activities in which any individual participates. To facilitate the inclusion of additional activities, it is useful to interpret our model as a nonstationary hidden Markov point process (28, 34). Within this framework, an individual switches between any two activities i and j with some probability defined by a nonstationary Markov transition matrix Tij(t) that depends on time t. For instance, our model can be redefined as a nonstationary hidden Markov point process that switches between two states: a state in which an individual is not composing e-mails and a state in which an individual is composing e-mails. Predictions of models that incorporate more than one activity can then be verified against data that records several activities for a single individual.
Our model further suggests an experiment (35) that not only records when an individual has sent an e-mail, but also when that individual is using a computer or actively using an e-mail client. This additional data would provide direct empirical evidence for describing active intervals. In the absence of such data, we have developed a simulated annealing procedure that allows us to nonparametrically infer the hidden Markov structure of our model, providing insight into how to compare our model with other cascading point processes (25, 26).
Although our model provides an accurate description of when an e-mail is sent, a question left unaddressed is to determine who the probable recipient of that e-mail is going to be. For instance, one might speculate that e-mails are sent randomly with some Poissonian rate to acquaintances or individuals who share common interests. Alternatively, it is plausible that e-mails are sent based on a perceived priority of important tasks, perhaps in response to previous correspondence (14). When combined with our model that statistically describes when individuals send e-mails, quantifying the likely recipient of an e-mail will provide an important step toward describing how the structure of e-mail and social networks evolve.
Our study also provides a clear demonstration of how hypothesis testing (30, 36) can objectively assess the validity of a proposed model—a procedure we vehemently advocate. Using this methodology, we demonstrate that although both models reproduce the asymptotic scaling of the observed inter-event time distribution, our model is consistent with the entire inter-event time distribution, whereas the truncated power-law null model is not.
The consequences of our findings are clear; demonstrating that a model reproduces the asymptotic power-law scaling of a distribution does not necessarily provide evidence that the model is an accurate mechanistic description of the underlying process. Indeed, there is mounting evidence that some purported power-law distributions in complex systems may not be power laws at all (37–39). There may be a common explanation for these apparent power laws: Complex systems are inherently hierarchical, but the distinct levels in the hierarchy are difficult to distinguish (40). In the case of e-mail correspondence, for example, the active intervals are not recorded in the data, thereby concealing the various scales of e-mail activity. This demonstrates how the mixture of scales of activity can give rise to scale-free activity patterns. We suspect that similar mixture-of-scales explanations (41–45) may provide a basis for the reported universality of heavy-tailed distributions in complex systems.
Methods
Area Test Statistic.
We quantify the agreement between a model ℳ(θ) with parameters θ and dataset 𝒟 by measuring the area A between the empirical cumulative distribution function P𝒟(u) and the model cumulative distribution function Pℳ(u∣θ):We specify u = ln τ, which is roughly uniformly distributed, to improve the numerical efficiency of our simulated annealing procedure. The area test statistic is advantageous because it is easy to interpret, and it retains more information about the distribution than many other test statistics (see Area Test Statistic in SI Text).
Identifying Active Intervals.
If we knew the actual active interval configuration Co, it would be straightforward to compute the parameters θo = {Nw,pd(t),pw(t),ρa,p(Na)} of the cascading nonhomogeneous Poisson process. The data, however, do not identify the actual active interval configuration Co, so we must use heuristic methods (see Simulated Annealing Procedure in SI Text) to determine the best-estimate active interval configuration Ĉ, from which we can compute the best-estimate parameters θ̂. We use simulated annealing to minimize the area test statistic A (Eq. 2) for the inter-event time distribution. Thus, identifying active intervals that are consistent with our expectations for our model reduces to finding the best-estimate active interval configuration Ĉ, which minimizes the area A between the empirical data and the predictions of the cascading nonhomogeneous Poisson process.
Our simulated annealing procedure is as follows. Starting from a random active interval configuration C in which adjacent events are randomly assigned to the same active interval, we compute the parameters θ of the cascading nonhomogeneous Poisson process, then we numerically estimate the cumulative distribution Pℳ(u∣θ), and, finally, we measure the area test statistic A(C) of the active interval configuration C. The active interval configuration is modified to a new configuration C′ by either merging two adjacent active intervals or by splitting an active interval. If the new configuration C′ reduces the area test statistic, then the new configuration is unconditionally accepted. Otherwise, the configuration is conditionally accepted with probability exp(−(A(C′) − A(C))/T), where T is the effective “temperature” measured in units of the area test statistic A. After attempting 2N configurations at each temperature so that each pair of N consecutive events might be merged and split, we reduce the temperature T by 5% until the active interval configuration settles at the best-estimate Ĉ without moving for 5 consecutive cooling stages.
Throughout the simulated annealing procedure, we track the lowest area test statistic configuration. If the system has settled in a configuration that is not the lowest area test statistic configuration, the system is placed in the lowest area test statistic configuration, and the system is cooled further. We have verified that our simulated annealing procedure accurately identifies active intervals and estimates parameters θ in synthetically generated cascading nonhomogeneous Poisson process datasets (see Simulated Annealing Procedure in SI Text and Fig. S5).
Acknowledgments.
We thank R. Guimerà, M. Sales-Pardo, M. J. Stringer, E. N. Sawardecker, S. M. Seaver, and P. McMullen for insightful comments and suggestions. R.D.M. and D.B.S. thank the National Science Foundation (NSF)–Integrative Graduate Education and Research Traineeship Program (DGE-9987577) for partial funding during this project. A.E.M. is supported by NSF Grant DMS-0709212. L.A.N.A. gratefully acknowledges the support of NSF Award SBE 0624318 and of the W. M. Keck Foundation.
Supporting Information
Supporting Appendix (PDF)
Supporting Information
- Download
- 5.42 MB
Supporting Information (PDF)
Supporting Information
- Download
- 4.49 MB
References
1
A Smith An Inquiry into the Nature and Causes of the Wealth of Nations (Methuen, London, 1786).
2
V Pareto Manuale di Economia Politica (Societa Editrice, Milan, 1906).
3
GK Zipf Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology (Addison–Wesley, Cambridge, MA, 1949).
4
MHR Stanley, et al., Scaling behaviour in the growth of companies. Nature 379, 804–806 (1996).
5
BA Huberman, PLT Pirolli, JE Pitkow, RM Lukose, Strong regularities in world wide web surfing. Science 280, 95–97 (1998).
6
V Plerou, LAN Amaral, P Gopikrishnan, M Meyer, HE Stanley, Similarities between the growth dynamics of university research and of competitive economic activities. Nature 400, 433–437 (1999).
7
LAN Amaral, et al., Behavioral-independent features of complex heartbeat dynamics. Phys Rev Lett 86, 6026–6029 (2001).
8
MEJ Newman, The structure and function of complex networks. Soc Indus Appl Math Rev 45, 167–256 (2003).
9
C Castellano, S Fortunato, V Loreto, Statistical physics of social dynamics., arXiv:0710.3256. (2007).
10
A Johansen, D Sornette, Download relation dynamics on the WWW following newspaper publication of URL. Physica A 276, 338–345 (2000).
11
A Johansen, Response times of internauts. Physica A 296, 539–546 (2001).
12
AG Chessa, JM Murre, A memory model for internet hits after media exposure. Physica A 333, 541–552 (2004).
13
A Johansen, Probing human response times. Physica A 338, 286–291 (2004).
14
AL Barabási, The origin of bursts and heavy tails in human dynamics. Nature 435, 207–211 (2005).
15
JG Oliveira, AL Barabási, Darwin and Einstein correspondence patterns. Nature 437, 1251 (2005).
16
DB Stouffer, RD Malmgren, LAN Amaral, Log-normal statistics in e-mail communication patterns., arXiv:physics/060527. (2006).
17
A Vázquez, JG Oliveira, Z Dezsõ, KI Goh, I Kondor, AL Barabási, Modeling bursts and heavy tails in human dynamics. Phys Rev E 73, 036127. (2006).
18
A Vázquez, Impact of memory on human dynamics. Physica A 373, 747–752 (2006).
19
Z Dezsõ, E Almaas, A Lukács, B Rácz, I Szakadát, AL Barabási, Dynamics of information access on the web. Phys Rev E 73, 066132. (2006).
20
T Nakamura, et al., Universal scaling law in human behavioral organization. Phys Rev Lett 99, 138103 (2007).
21
J Candia, et al., Uncovering individual and collective human dynamics from mobile phone records. J Phys A 41, 224015 (2007).
22
DJ Daley, D Vere-Jones An Introduction to the Theory of Point Processes (Springer, Berlin, 1988).
23
JP Eckmann, E Moses, D Sergi, Entropy of dialogues creates coherent structure in e-mail traffic. Proc Natl Acad Sci USA 101, 14333–14337 (2004).
24
C Hidalgo, Conditions for the emergence of scaling in the interevent time of uncorrelated and seasonal systems. Physica A 369, 877–883 (2006).
25
J Neyman, EL Scott, A statistical approach to problems of cosmology. J R Stat Soc B 20, 1–43 (1958).
26
SB Lowen, MC Teich Fractal-Based Point Processes (Wiley, New York, 2005).
27
AG Hawkes, Spectra of some self-exciting and mutually exciting point processes. Biometrika 58, 83–90 (1971).
28
SL Scott, Bayesian analysis of a two-state Markov modulated Poisson process. J Comput Graph Stat 8, 662–670 (1999).
29
SL Scott, P Smyth Bayesian Statistics (Oxford Univ Press, Oxford) 7 (2003).
30
RB D'Agostino, MA Stephens Goodness-of-Fit Techniques (Marcel Kekker, New York, 1986).
31
WH Press, SA Teukolsky, WT Vetterling, BP Flannery Numerical Recipes in C: The Art of Scientific Computing (Cambridge Univ Press, 2nd Ed, New York, 2002).
32
J Udry, NM Morris, Distribution of coitus in the menstrual cycle. Nature 220, 593–596 (1968).
33
N Kulendran, ML King, Forecasting international quarterly tourist flows using error-correction and time-series models. J Int Forecasting 13, 319–327 (1997).
34
RJ Elliott, L Aggoun, JB Moore Hidden Markov Models: Estimation and Control (Springer, Berlin, 1995).
35
DJ Watts, A twenty-first century science. Nature 445, 489 (2007).
36
DS Sivia, J Skilling Data Analysis: A Bayesian Tutorial (Oxford Science Publications, Oxford, 2006).
37
R Perline, Strong, weak and false inverse power laws. Stat Sci 20, 68–88 (2005).
38
AM Edwards, et al., Revisiting Lévy flight search patterns of wandering albatrosses, bumblebees and deer. Nature 449, 1044–1048 (2007).
39
A Clauset, CR Shalizi, MEJ Newman, Power-law distributions in empirical data arXiv:0706.1062., 2007).
40
M Sales-Pardo, R Guimerà, AA Moreira, LAN Amaral, Extracting the hierarchical organization of complex systems. Proc Natl Acad Sci USA 104, 15224–15229 (2007).
41
H Silcock, The phenomenon of labour turnover. J R Stat Soc A 117, 429–440 (1954).
42
CM Harris, The Pareto distribution as a queue service discipline. Oper Res 16, 307–313 (1968).
43
JM Hausdorff, C Peng, Multiscaled randomness: A possible source of 1/f noise in biology. Phys Rev E 54, 2154 (1996).
44
W Willinger, R Govindan, S Jamin, V Paxson, S Shenker, Scaling phenomena in the Internet: critically examining criticality. Proc Natl Acad Sci USA 99, 2573–2580 (2002).
45
AE Motter, APS de Moura, C Grebogi, H Kantz, Effective dynamics in Hamiltonian systems with mixed phase space. Phys Rev E 71, 036215. (2005).
Information & Authors
Information
Published in
Classifications
Copyright
© 2008 by The National Academy of Sciences of the USA.
Submission history
Received: January 11, 2008
Published online: November 25, 2008
Published in issue: November 25, 2008
Keywords
Acknowledgments
We thank R. Guimerà, M. Sales-Pardo, M. J. Stringer, E. N. Sawardecker, S. M. Seaver, and P. McMullen for insightful comments and suggestions. R.D.M. and D.B.S. thank the National Science Foundation (NSF)–Integrative Graduate Education and Research Traineeship Program (DGE-9987577) for partial funding during this project. A.E.M. is supported by NSF Grant DMS-0709212. L.A.N.A. gratefully acknowledges the support of NSF Award SBE 0624318 and of the W. M. Keck Foundation.
Notes
This article is a PNAS Direct Submission. S.S. is a guest editor invited by the Editorial Board.
*
For simplicity, we use a truncated power law with an exponent of α = 1 as our null model. Similar conclusions are reached when the power-law scaling exponent is fit to the data or when other heavy-tailed null models [e.g., log-normal or log-uniform distributions (16)] are considered.
†
If humans make decisions based on their own previous memories, then we might expect that humans are heavily influenced by recent events. That is, the probability ρdt that an event will happen in a time interval dt is not constant but is, instead, a decreasing function of the time elapsed since the last event (18).
‡
This interpretation does not rely on highly-competent human behavior and allows for the possibility that human activity, and hence the time dependence of ρ, is modulated by instinct, the environment, or social stimuli.
This article contains supporting information online at www.pnas.org/cgi/content/full/0800332105/DCSupplemental.
§
We suspect that most users had access to their e-mail only at the university because the data are obtained from a European university prior to 2004 (J. P. Eckmann, personal communication).
¶
In specifying Nw as the average number of active intervals per week, we are implicitly assuming that the fraction of time spent in active intervals is very small. We have verified that this is the case for all users under consideration. Also, it is important to choose the time step Δt in the binning of the empirical pd(t) to be sufficiently small such that the probability of an event occurring at time t is ρ(t)Δt ≪ 1. We choose Δt = 1/Nw hours, which meets this criterion while still maintaining computational feasibility.
Authors
Competing Interests
The authors declare no conflict of interest.
Metrics & Citations
Metrics
Citation statements
Altmetrics
Citations
Cite this article
105 (47) 18153-18158,
Export the article citation data by selecting a format from the list below and clicking Export.
Cited by
Loading...
View Options
View options
PDF format
Download this article as a PDF file
DOWNLOAD PDFLogin options
Check if you have access through your login credentials or your institution to get full access on this article.
Personal login Institutional LoginRecommend to a librarian
Recommend PNAS to a LibrarianPurchase options
Purchase this article to access the full text.