Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2014 Oct 6;111(43):15316–15321. doi: 10.1073/pnas.1323111111

Reputation and impact in academic careers

Alexander Michael Petersen a,1, Santo Fortunato b,1, Raj K Pan b, Kimmo Kaski b, Orion Penner c, Armando Rungi a, Massimo Riccaboni c,d, H Eugene Stanley e,1, Fabio Pammolli a,e
PMCID: PMC4217436  PMID: 25288774

Significance

Over a scientist’s career, a reputation is developed, a standing within a research community, based largely upon the quantity and quality of his/her publications. Here, we develop a framework for quantifying the influence author reputation has on a publication’s future impact. We find author reputation plays a key role in driving a paper’s citation count early in its citation life cycle, before a tipping point, after which reputation has much less influence relative to the paper’s citation count. In science, perceived quality, and decisions made based on those perceptions, is increasingly linked to citation counts. Shedding light on the complex mechanisms driving these quantitative measures facilitates not only better evaluation of scientific outputs but also a more transparent evaluation of the scientists producing them.

Keywords: computational sociology, science of science, networks of networks, Matthew effect, sociophysics

Abstract

Reputation is an important social construct in science, which enables informed quality assessments of both publications and careers of scientists in the absence of complete systemic information. However, the relation between reputation and career growth of an individual remains poorly understood, despite recent proliferation of quantitative research evaluation methods. Here, we develop an original framework for measuring how a publication’s citation rate Δc depends on the reputation of its central author i, in addition to its net citation count c. To estimate the strength of the reputation effect, we perform a longitudinal analysis on the careers of 450 highly cited scientists, using the total citations Ci of each scientist as his/her reputation measure. We find a citation crossover c×, which distinguishes the strength of the reputation effect. For publications with c < c×, the author’s reputation is found to dominate the annual citation rate. Hence, a new publication may gain a significant early advantage corresponding to roughly a 66% increase in the citation rate for each tenfold increase in Ci. However, the reputation effect becomes negligible for highly cited publications meaning that, for c ≥ c×, the citation rate measures scientific impact more transparently. In addition, we have developed a stochastic reputation model, which is found to reproduce numerous statistical observations for real careers, thus providing insight into the microscopic mechanisms underlying cumulative advantage in science.


Citation counts are widely used to judge the impact of both scientists and their publications (14). Although it is recognized that many factors outside the pure merit of the research or the authors influence such counts, little effort has been devoted to identifying and quantifying the role of the author-specific factors. Recent investigations have begun to study the impact the individual scientists have through collaboration and reputation spillovers (5, 6), two integrative features of scientific careers that contribute to cumulative advantage (79). However, the majority of citation models avoid author-specific effects, mainly due to the difficulty in acquiring comprehensive disambiguated career data (1013). As the measures are becoming increasingly common in evaluation scenarios throughout science, it is crucial to better understand what the citation measures actually represent in the context of scientists’ careers. Moreover, how does reputation affect a scientist’s access to key resources, the incentives to publish quality over quantity, and other key decisions along the career path (1418)? In addition, what role does reputation play in the mentor-matching process within academic institutions, in the effectiveness of single/double blinding in peer review, and in the reward system of science (14, 15, 19)?

It is against this background that we have developed a quantitative framework with the goal of isolating the effect of author reputation upon citation dynamics. Specifically, by controlling for time- and author-specific factors, we quantify the role of author reputation on the citation life cycle of individual publications at the micro level. We use a longitudinal career dataset from Thomson Reuters Web of Science comprising 450 highly cited scientists, 83,693 articles, and 7,577,084 citations tracked over 387,103 publication years. Dataset [A] refers to 100 top-cited physicists, [B] to another set of 100 highly prolific physicists, [C] to 100 assistant professors in physics, [D] to 100 top-cited cell biologists, and [E] to 50 top-cited pure mathematicians (for further data elaboration, see SI Appendix). For each central scientist i, we analyze the scientific production measured by the number ni(t) of publications published in year t, the cumulative number of citations ci,p(t) received by publication p, and our quantitative reputation measure defined here as the net citations aggregated across all publications Ci(t)=pci,p(t).

We begin with a description of our reputation model, followed by an empirical analysis of career trajectories, establishing Ci as a good quantitative measure of reputation. We then establish quantitative benchmarks from the citation distribution within individual publication portfolios and also quantify features of the citation life cycle, both of which are crucial components of our reputation effect model. Combining several empirical features of our analysis, we then investigate the role of the reputation effect, showing that author reputation accounts for a significant fraction of the citation rate of young publications, thus providing a testable mechanism underlying cumulative advantage in science (79). We conclude by developing a stochastic Monte Carlo (MC) reputation model, one which matches the microscopic and macroscopic citation dynamics.

Results

Reputation Signaling.

Academic career growth is a complex process emerging from the institutional, social, and cognitive aspects of science. Conceptually, each career i is embedded in two fundamental networks that are interconnected: the nodes in the first network represent scientists and in the second network represent publications. The links within the first network represent collaborations between scientists, and within the second network they represent citations between publications; the cross-links represent the associations between individuals and their publications.

Because these networks are dynamic, it is difficult to fully understand for any given individual, let alone the entire system, the complex information contained by all associations. As a result, scientific reputation has emerged as a key signaling mechanism to address the dilemma of excessive information that arises, for example, in the task of evaluating, comparing, and ranking publication profiles in academic competitions. Reputation signals can flow between scientists, jk; between publications, pq; and between a publication and a scientist, pi. The focus of our analysis is on this latter dependency, ip, whereby author reputation can impact the citation rate of his/her publications, generating subsequent reputation feedback, pi.

Reputation plays an important role as a signal of trustworthiness and quality, a role that addresses directly the “agency problem” characterizing the reward system in science (14). Moreover, reputation signaling in scientific networks is used to overcome information asymmetries between scientists and other academic agents; in this role, it will become increasingly important as the rate of science publication grows and scientists have less time to absorb relevant advancements (14, 1921). With little time to read every paper on a given topic, this trustworthiness signal is anecdotally consistent with the common practice of perusing the author names when preliminarily evaluating the relevance of a newly found publication. In the past, an author’s identity and associated reputation was mainly linked to reference lists and personal interactions. Nowadays, an author’s reputation is becoming increasingly visible through searchable publication databases, laboratory websites, press, and other media, in addition to citations.

We measure the author reputation by Ci(t), which measures not only the number of times his/her Ni(t) publications have been referenced (an indication of overall scientific impact) but also the number of appearances of his/her name in the literature, thereby providing a name association visibility. What Ci does not account for is intrinsic research quality, e.g., the quality ratio Ci/Ni is broadly distributed across scientists. Because quantitative proxies for quality are limited to citation counts, it is presently difficult to distill the role played by quality in assessing overall scientific impact.

By analyzing the top scientists, we reduce the compound reputation effect occurring when two or more scientists of comparable reputation are coauthors on a publication, a scenario where it may be difficult to estimate the differential impact of these scientists on the citation rate. Due to data limitations requiring author name disambiguation and career data for all coauthors j, we assume that a majority of the reputation signal is attributable to the central scientist i by the following approximation: C(t)jCj(t)Ci(t). Also, by analyzing top-cited cohorts, we can establish an upper bound to the strength of the reputation effect. We note that Ci possibly discounts the role of mentor reputation effects early in the career (22). Nevertheless, by analyzing top scientists, the signaling advantage received early in their careers by associating with prestigious mentors/coauthors should be negligible over the long run (20).

To measure the role of author reputation vis-à-vis publication impact, we use a regression model that correlates the increase in the number of citations Δci,p(t+1) for a given paper p in year t+1 using three explanatory variables: (i) the role played by the net number of citations cp(t) accrued up to paper age τp quantified by the power-law regression parameter π, (ii) the role of publication age and the obsolescence of knowledge quantified by the exponential regression parameter τ¯; and (iii) the role of author reputation Ci(t) quantified by the power-law regression parameter ρ.

Together, these three features are (i) the publication citation effect Πp(t)[cp(t)]π, (ii) the life cycle effect Ap(τ)exp[τp/τ¯], and (iii) the author reputation effect Ri(t)[Ci(t)]ρ. We perform a multivariate regression to estimate the π, τ¯, and ρ values that parameterize the citation model,

Δci,p(t+1)η×Πp(t)×Ap(τ)×Ri(t), [1]

with the multiplicative log-normal noise term η. In SI Appendix, we perform an additional fixed-effects regression using year as well as author variables to better control for the overall growth in scientific output across time. To fully justify our reputation effect model, in what follows we first account for two key features: measures for cumulative career reputation and obsolescence features of the citation life cycle.

Patterns of Growth for Longitudinal Reputation Measures.

In this section, we investigate the patterns of cumulative publication and citation growth across the career. A striking statistical patterns observed for top scientists is the faster than linear growth in time, both in cumulative publication number Ni(t)t=1tni(t) and in cumulative citation count Ci(t)p=1Ni(t)ci,p(t) for a large part of a scientist’s “growth phase,” which we find to be 30 y after their first publication. Fig. 1 A and B shows the characteristic growth trajectories N(t)tα¯ and C(t)tζ¯, calculated by an appropriate average over individual Ni(t) and Ci(t), respectively. To facilitate visual comparison, we use arbitrary normalized ordinate units so that each curve starts from the same point, N(1)=C(1)1. The growth trajectories are characterized by superlinear algebraic growth, with α¯1 and ζ¯>α¯ (values shown in Fig. 1). Individual exponents αi and ζi are also calculated for the Ni(t) and Ci(t) of each author (in addition to multiple other quantitative measures; SI Appendix, Tables S1–S9). We averaged both αi and ζi within each dataset, confirming that αiα¯ and ζiζ¯, confirming that the aggregate patterns hold for the individual scale. In SI Appendix, we control for the exponential growth in scientific publication rates, which can contribute to the longitudinal growth in Ci(t). We define “deflated” citation counts Δci,pD(t)Δci,p(t)/D(t), which are normalized by the number of publications D(t) within a given discipline (because a new publication can cite an old publication only once). For each discipline, we observe a 5% exponential growth in D(t) over the last half century. After deflating each Ci(t), the net affect is only to reduce the estimated ζi values by roughly 15%, meaning that the growth exponents ζi2 reflect significant growth above the underlying baseline growth trend in science.

Fig. 1.

Fig. 1.

Quantifying cumulative reputation measures and citation dynamics. (A and B) Growth trajectories of the cumulative publications N(t) and citations C(t), appropriately rescaled to start from unity in each ordinate. The characteristic α¯ and ζ¯ exponents shown in each legend are calculated over the growth phase of the career. The mathematicians [E] have distinct career trajectories, with α¯1 because collaboration spillovers via division of labor likely play a smaller role in publication rate growth. See SI Appendix, Tables S1–S9, for αi and ζi values calculated for individual careers. (C) Relation between τ1/2 and cumulative citations cp. (D) PA dynamics with π1 break down for c<c×. The reputation effect provides a citation boost above the baseline PA citation rate attributable to cp(t) only.

Hence, we use Ci(t) as a quantitative measure of reputation owing to the fact that the time dependence is readily quantified by a single parameter ζi. We also use the power-law scaling of Ci(t) as a benchmark for the stochastic career model we develop in the final section. Fig. 2 shows two additional empirical benchmarks: (i) the microscopic citation dynamics of individual publications comprising the publication portfolio, and (ii) the rank-citation profile, which is the Zipf distribution of the publications ranked in decreasing order of rank r, ci(1)ci(2)ci(Ni). We confirm that the individual curves ci(r) belong to the class of the discrete generalized β distributions (DGBDs), which in the general form reads c(r)rβ(N+1r)γ (4). We validate the DGBD fits using the χ2 test (SI Appendix).

Fig. 2.

Fig. 2.

Quantitative patterns in the growth and size-distribution of the publication portfolio for scientists from 3 disciplines. (Left) ci,p(t) for each author’s most cited papers (colored according to net citations in 2010) along with Ci(t)tζi (dashed black curve). (Right) The evolution of each author’s rank-citation profile using snapshots taken at 5-y intervals. The darkest blue data points represent the most recent ci(r,t), and the subset of red data points indicate the logarithmically spaced data values used to fit the empirical data to our benchmark DGBD rank-citation distribution model (4) (solid black curve; SI Appendix). The intersection of ci(r,t) with the dashed black line corresponds to the author’s h-index hi(t).

Variability in the Citation Life Cycle.

Important scientific discoveries can cause paradigm shifts and significantly boost the reputation of scientists associated with the discovery (23). To measure the reputation effect, one must also account for obsolescence features of scientific knowledge. It is also important to account for the variations in scientific impact, because most publications report results that are not seminal breakthroughs, but, rather, report incremental advances that are likely to have relatively short-term relevance.

In this section, we analyze the dynamics of the citation trajectory Δcp(τ), the number of new citations received in publication year τ, where τ is the number of years since the publication was first cited. We analyze Δcp(τ) at two levels of aggregation: (i) for each discipline, we calculate an averaged Δcp(τ) by collecting publications with similar total citation counts cp. To achieve a scaled trajectory that is better suited for averaging, we normalize each individual Δcp(τ) by its peak citation value, Δcp(τ)Δcp(τ)/Max[Δcp(τ)]. In Fig. 3, the panels on the Left show the characteristic citation trajectory of publications belonging to each of the top 5 quintiles of each disciplinary citation distribution. Each curve represents the average trajectory Δc(τ)Nq1pΔcp(τ) calculated from the Nq publications in quintile q. (ii) For each career i, we calculate Δci(τ) by averaging over groups of ranked citation sets within his/her publication portfolio. The panels on the Right of Fig. 3 show that, even within prestigious careers, there is significant variation in the citation life cycle.

Fig. 3.

Fig. 3.

The citation life cycle reflects both the intrinsic pace of discovery and the obsolescence rate of new knowledge, two features that are discipline dependent. (Left) For each of three disciplines, the averaged citation trajectory Δc(τ) is calculated for papers in the nth quintile with the corresponding citation range indicated in each legend. For example, for physicists in dataset [A], the top 20% of papers have between 74 and 17,032 citations, and the papers in percentile 21–40 have between 31 and 73 citations. (Right) Δc(τ) calculated for rank-ordered groups of papers (listed in each legend) for three authors chosen from each discipline.

At both levels of aggregation, the impact life cycle typically peaks before publication age τ5 y. Counterexamples likely correspond to publications that receive a delayed secondary attention, e.g., receiving subsequent experimental validation of a previous theoretical prediction, and vice versa. We define the half-life τ1/2 as the time to reach half the peak citation rate, Δc(τ1/2)=1/2 in the decay phase. Papers in the theoretical domains of mathematics and physics can exhibit τ1/2>40 y. Remarkably, some mathematics publications even have τ1/2 that span nearly the entire data sample duration of 100 y, reflecting the indisputable and foundational nature of progress by proof. This is in contrast to top-cited cell biology publications, whereby for even the top 20% of most cited works, the value τ1/210 y. This relatively short decay timescale likely arises from the large scale of research output in biomedical fields, which leads to a significantly higher discovery rate, and likewise, a relatively faster obsolescence rate.

The relation between the decay timescale τ¯ and cp provides insight into the knowledge diffusion rate. Fig. 1C shows an approximate scaling relation τ1/2cpΩ¯ when grouping publications into logarithmically spaced cp bins. Physics and biology differ mainly for the highly cited publications with cp40, whereas mathematics shows larger variation in τ1/2 per citation. The Ω¯ value provides an approximate relation between citations and time. In mathematics, τ1/2cp, indicating that the impact is distributed roughly uniformly across time. However, for biology publications, the sublinear relation with Ω¯0.30 indicates that for two publications, one with twice the citation impact as the other, the more cited publication gained twice the number of citations over a time period τ1/2 that was less than twice as large as the τ1/2 of the less-cited publication. The differences in Ω¯ are possibly related to discipline-dependent bursts in technological advancement, funding initiatives (15), and other social aspects of science that are related to nonlinearities in scientific advancement.

Baseline Citation Model.

To provide an initial test for basic mechanistic differences between the citation dynamics of highly cited publications and less-cited publications, in this subsection we analyze the relation between Δcp(t+1) and cp(t) representing the standard baseline preferential attachment (PA) model (corresponding to the limit τ¯ and ρ=0). Grouping together papers by cp(t) (using logarithmic bins), we calculate for each group the mean number of new citations in the following year, Δcp(t+1). Fig. 1D shows the empirical relation for physicists in datasets [A/B], indicating that publications with citations above a gradual but substantial citation crossover value c× obey a distinct scaling law that matches approximately linear (π1) PA dynamics (see SI Appendix, Fig. S8, for other disciplines). However, below c×, the citation rates are in excess of the citation rate expected from linear PA alone, reflecting the citation premium that can be achieved via reputation.

Quantifying the Role of the Reputation Effect.

By analyzing the publications of highly cited scientists, we have shown that the basic citation dynamics above and below the citation crossover value c× vary considerably. In this subsection, we investigate the role played by the reputation effect for publications with cp(τ)c× compared with publications with cp(τ)<c×. Based upon the assessment of the growth dynamics (SI Appendix, Figs. S8 and S9), we choose the crossover values c×40 [A/B], c×10 [C], c×100 [D], and c×20 [E]. Our results are not strongly dependent on reasonable variations around our choice of c×. Table 1 shows the πi, τi¯, and ρi estimates, above and below c×, for the individual careers highlighted in Figs. 1 and 3.

Table 1.

Best-fit parameters for each effect (±SEs), both for individuals and the average values (±SD) within each dataset

c(t1)<c× c(t1)c×
Name πi (paper) τ¯i (life cycle) ρi (reputation) πi (paper) τ¯i (life cycle) ρi (reputation)
Gossard, A. C. 0.34 ± 0.027 4.92 ± 0.261 0.25 ± 0.008 0.80 ± 0.048 4.73 ± 0.184 0.09 ± 0.024
Barabási, A. L. 0.42 ± 0.036 3.00 ± 0.155 0.29 ± 0.010 1.06 ± 0.016 3.65 ± 0.111 0.01 ± 0.011
Average ± SD [A] 0.43 ± 0.14 5.67 ± 2.52 0.22 ± 0.06 0.96 ± 0.19 8.93 ± 4.09 −0.07 ± 0.11
Baltimore, D. 0.32 ± 0.018 4.64 ± 0.148 0.28 ± 0.006 0.62 ± 0.047 5.92 ± 0.250 0.15 ± 0.026
Laemmli, U. K. 0.54 ± 0.036 5.09 ± 0.297 0.21 ± 0.014 1.09 ± 0.025 6.40 ± 0.255 −0.12 ± 0.019
Average ± SD [D] 0.40 ± 0.14 6.64 ± 6.24 0.26 ± 0.05 0.99 ± 0.22 9.55 ± 26.30 −0.06 ± 0.14
Serre, J. P. 0.33 ± 0.095 15.90 ± 3.724 0.14 ± 0.026 0.66 ± 0.065 20.50 ± 3.862 −0.03 ± 0.039
Wiles, A. 0.56 ± 0.208 5.23 ± 1.187 0.24 ± 0.052 0.70 ± 0.059 9.04 ± 0.633 0.10 ± 0.042
Average ± SD [E] 0.27 ± 0.17 30.60 ± 56.80 0.14 ± 0.07 0.54 ± 0.25 21.40 ± 54.30 0.01 ± 0.11

For statistical significances, see SI Appendix, Tables S10–S22.

Our main result is a robust pattern of role switching by author- and publication-specific effects, specifically,

ρ(c<c×)>ρ(cc×)andπ(c<c×)<π(cc×). [2]

For example, for the aggregate dataset [A/B] representing prolific physicists, we estimate the values ρ(c<40)0.2, ρ(c40)0, π(c<40)0.4, and π(c40)1. To emphasize the role of reputation on new publications, consider two scientists separated by a factor of 10 in their cumulative citations, C1(t)=10C2(t). All other things being equal, the citation premium attributable to reputation alone for publications in the reputation regime (c<c×) is Δc1(t)/Δc2(t)=10ρ1.66 (using the value ρ=0.22 for dataset [A]). Hence, there is a 66% increase in the citation rate for each 10-fold increase in Ci(t), which integrated over a career can provide significant positive feedback. A pattern that emerges independent of discipline is ρ(cc×)0, meaning that reputation only plays a significant role for c<c×. In SI Appendix, section S6, we test the robustness of this result by implementing a fixed-effects regression, the result of which reaffirms the distinct roles of π and ρ above and below c×. Hence, these two inequalities in Eq. 2 indicate that publications are initially boosted by author reputation to a citation “tipping point” ci,pc×, above which the citation rate is sustained in large by publication reputation. These findings show how microscopic reputation mechanisms contribute to cumulative “rich-get-richer” processes in science (7, 9).

Simulating Synthetic MC Careers with the Reputation Model.

Here, we discuss three variants of a MC career growth model that simulates the dynamics of Δci,p(t+1) for each publication p in each time period t of the career of synthetic author i. With each variant, we introduce progressively a new feature of publication citation trajectories. (i) We begin with a basic linear PA model whereby Δci,p(t+1)ci,p(t). Model ii (PA-LC) includes a life-cycle (LC) obsolescence effect, Ap(τ). Fig. 4 compares models i and ii, which do not incorporate author specific factors, with the reputation model iii given by Eq. 1. The PA model fails to reproduce the characteristic trajectories of real publications, because there is a clear first-mover advantage (24) for the first publications published in the career, as well as non–power-law growth of Ci(t).

Fig. 4.

Fig. 4.

Comparison of three MC career models against empirical benchmarks demonstrated in Figs. 13 and SI Appendix, Figs. S1–S3. For each model, we show Δc(τ) for the top four groups of ranked papers, the evolution of ci,p(τ) and Ci(t) (dashed black curve), and the evolution of the rank-citation profile ci(r) at 5-period intervals. The best-fit DGBD β and γ parameters are also useful as quantitative benchmarks. For each model, we evolve the system over T40 periods, each period representative of a year. See SI Appendix for further elaboration of the model parameters used in the MC simulation.

We use quantitative patterns demonstrated for real careers in Figs. 2 and 3 as empirical benchmarks to distinguish models ii and iii. We confirm that the reputation model iii satisfies the empirical benchmark characteristics in all three graphical categories (Fig. 4). We also confirm for the model iii, but not for the model ii, that there is a distinction between Δc(τp) for different rank sets. Furthermore, for model iii, we quantitatively confirm that C(t)tζ with 2ζ3. For sufficiently large t, we also confirm that c(r,t) belongs to the class of DGBD distributions, with β values within the range of values observed empirically. In SI Appendix, we further demonstrate how the model can be used to estimate properties of “average” careers for a given MC parameter set. For example, SI Appendix, Fig. S11, shows excellent agreement between the reputation model’s prediction and empirical data when estimating the fraction fcx of publications with cpc× for a given career age t. Empirically, we observe saturation fcx 0.20–0.30 for large t.

Discussion

Social networks in science are characterized by heterogeneous structure (25) that provides opportunities for intellectual and social capital investment (26) and influences scientists’ research strategies (21). Identifying patterns of career growth is becoming increasingly important, largely due to the widespread emergence of quantitative evaluation processes and recent efforts to develop quantitative models of career development. However, difficulties in accounting for complex social mechanisms, in addition to nonlinearities and nonstationarities in the career growth process, highlight the case for caution in the development of predictive career models (16, 17). Without a better understanding of the institutional features and scientific norms that affect scientific careers, along the variable path from apprentice to group leader and mentor, there is a possibility to misuse quantitative career metrics in the career evaluation process.

Toward the goal of better understanding career growth, with potential policy implications for the quantitative career evaluation process, we have analyzed the effect of reputation on the microlevel processes underlying the dynamics of a scientist’s research impact. We used a regression model for the citation rate Δci,p, which accounts for the role of publication impact (π), the role of knowledge obsolescence (τ¯), and the role of author reputation (ρ). Interestingly, we find that the reputation parameter ρ(cc×)0, meaning that in the long run the reputation effect makes a negligible contribution to the citation rate of papers with large cp. However, we identify caveats concerning the way publications can become highly cited. By analyzing the variation of ρ and π for publications above and below a citation threshold c×, we identify the advantageous role that author reputation plays in the citation dynamics of new publications, finding that future publications can gain roughly a 66% increase in Δc for each 10-fold increase in reputation Ci. We note that it is also likely that both institutional affiliation and journal reputation also play a role in the citation dynamics; however, disentangling the interaction between the multiple reputation sources will likely be challenging and remains an open avenue for investigation.

In the process of analyzing the effect of reputation on career growth, it was necessary to also quantify two essential features of our model, namely patterns of cumulative productivity and impact across the career, and patterns of obsolescence in the citation life cycle of individual publications. For prolific scientists, we have identified a robust pattern of growth for two cumulative reputation measures, Ni(t) and Ci(t), each of which are quantifiable by a single scaling parameter, αi and ζi, respectively. These regularities suggest that underlying social processes sustain career growth via reinforcing coevolution of scientific collaboration and publication (6, 2729). We also introduced a citation deflator index to control for the increased supply of citations arising from the exponential 5% growth (per year) in the total publication output. Analyzing the growth of “deflated” citation trajectories, CiD(t), we observed ζi2 values which confirms that the observed career growth is significantly above the baseline inflation rate of science. We note that in using nondecreasing cumulative reputation measure Ci(t), we have overlooked the possibility that reputation can significantly decrease, as occurs when a scientist is associated with invalidated and/or fraudulent science. Indeed, recent evidence indicates that the retraction of a publication can have a negative impact on the potential growth of Ci (30). As a robustness check, we also used the annual citation rate ΔCi(t) as an additional (noncumulative) reputation measure, one that is more amenable to controlling for secular growth trends. We applied a multivariate fixed effects regression using ΔCi(t) as the reputation measure (SI Appendix, section S6), which reconfirms the role of reputation in citation dynamics.

Our analysis tracks the evolution of each scientist’s publication portfolio across the career, suitably illustrated by the rank-citation profile ci(r), which highlights the skewed distribution of ci,p, even within a career. Arising from the power-law features of ci(r) (4), we emphasize the disproportionate fraction of a scientist’s total citations Ci owed to the ci(r=1) citations coming from his/her highest-cited publication. For example, the average and SD of the ratio ci(1)/Ci is 0.15±0.13 for the physicists, 0.09±0.08 for the biologists, and 0.16±0.08 for the mathematicians we analyzed, which emphasizes the potentially large reputation boost that can follow from just a single high-impact publication. With rapidly increasing numbers of journals accompanied by the opportunity for rapid publication, the reputation effect provides an incentive to aim for quality over quantity in the publication process, reinforcing a research strategy that is beneficial for science and scientists.

It is also important to consider the role of reputation in light of the increasing orientation of science around team endeavors characterized by multiple levels of hierarchy and division of labor (31). Because it is difficult to evaluate and assign credit to individual contributions in a team setting, there may be an increase in the role and strength of the reputation in overcoming the problem associated with asymmetric and incomplete information. In addition to the collaboration network, reputation also plays a key role in numerous other scientific inputs (money, labor, knowledge, etc.) that inevitably affect the overall quantity and quality of scientific outputs. It will become increasingly important to understand the relation between these inputs and outputs to efficiently allocate scientific resources (6, 15, 18).

In light of individual careers, an institutional setting based on quantitative appraisal that neglects these complex relations may inadvertently go against the goal of sustaining the careers of talented and diligent young academics (6). For example, our finding of a crossover behavior around c× shows how young scientists lacking reputation can be negatively affected by social stratification in science. The appealing competitive advantage gained by working with a prestigious mentor may be countered by the possibility that it may not be the ideal mentor–advisee match. Despite having analyzed cohorts of highly cited scientists, our results have broad implications across the scientific population when one considers the numerous careers that interact with top scientists via collaboration or mentorship.

In excess, the reputation effect may also negatively affect science, especially considering how online visibility has become a relatively new reputation platform in an increasingly competitive environment. As such, strategies of self-promotion may emerge as scientists try to “game” with reputation systems. In such scenarios, it may be hard to disentangle fair from foul play. For example, it may be difficult to distinguish self-citation strategies aimed at boosting Ci from the natural tendency for scientists who are crossing disciplinary borders to self-cite with the intention to send credibility signals (32). Reputation will also become increasingly important in light of preferential treatment in search queries, e.g., Google Scholar, which provide query results ordered according to citation measures. These systemic search and retrieval features may further strengthen association of reputation between publications and authors. In all, our results should motivate future research to inspire institutional and funding body evaluation schemes to appropriately account for the roles that reputation and social context play in science. For example, our results can be used in support of the double-blind review system, which by reducing the role of reputation, is perceived to have advantages due to its objectivity and fairness (33). We conclude with a general note that the data deluge brought forth during the past decade is fueling extensive efforts in the computational social sciences (34) to identify and study the so-called “social atom” (35). Because our methodology is general, we speculate that other social networks characterized by trust and partial/asymmetric information are also based on similar reputation mechanisms. Indeed, it is likely that agent-based reputation mechanisms will play an increasing role in society due to the omnipresence of online recommender systems governed by reputation dynamics operating as a general diffusive contagion phenomena (36).

Supplementary Material

Supplementary File

Acknowledgments

We thank M. C. Buchanan, A. Scharnhorst, J. N. Tenenbaum, S. V. Buldyrev, and S. Succi for helpful discussions. A.M.P., S.F., K.K., M.R., and F.P. thank the European Union Framework Programme Project “Multiplex” for financial support and National Research Project “Crisis Lab” Project at Institutions, Markets, Technologies Lucca Institute for Advanced Studies. O.P. acknowledges funding from the Canadian Social Sciences and Humanities Research Council. H.E.S. thanks the National Science Foundation for Grant CMMI 1125290.

Footnotes

The authors declare no conflict of interest.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1323111111/-/DCSupplemental.

References

  • 1.Radicchi F, Fortunato S, Castellano C. Universality of citation distributions: Toward an objective measure of scientific impact. Proc Natl Acad Sci USA. 2008;105(45):17268–17272. doi: 10.1073/pnas.0806977105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Radicchi F, Fortunato S, Markines B, Vespignani A. Diffusion of scientific credits and the ranking of scientists. Phys Rev E Stat Nonlin Soft Matter Phys. 2009;80(5 Pt 2):056103. doi: 10.1103/PhysRevE.80.056103. [DOI] [PubMed] [Google Scholar]
  • 3.Petersen AM, Wang F, Stanley HE. Methods for measuring the citations and productivity of scientists across time and discipline. Phys Rev E Stat Nonlin Soft Matter Phys. 2010;81(3 Pt 2):036114. doi: 10.1103/PhysRevE.81.036114. [DOI] [PubMed] [Google Scholar]
  • 4.Petersen AM, Stanley HE, Succi S. Statistical regularities in the rank-citation profile of scientists. Sci Rep. 2011;1:181. doi: 10.1038/srep00181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Azoulay P, Zivin JSG, Wang J. Superstar extinction. Q J Econ. 2010;125(2):549–589. [Google Scholar]
  • 6.Petersen AM, Riccaboni M, Stanley HE, Pammolli F. Persistence and uncertainty in the academic career. Proc Natl Acad Sci USA. 2012;109(14):5213–5218. doi: 10.1073/pnas.1121429109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Merton RK. The Matthew effect in science. Science. 1968;159(3810):56–63. [PubMed] [Google Scholar]
  • 8.De Solla Price D. A general theory of bibliometric and other cumulative advantage processes. J Am Soc Inf Sci. 1976;27(8):292–306. [Google Scholar]
  • 9.Petersen AM, Jung WS, Yang JS, Stanley HE. Quantitative and empirical demonstration of the Matthew effect in a study of career longevity. Proc Natl Acad Sci USA. 2011;108(1):18–23. doi: 10.1073/pnas.1016733108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Peterson GJ, Pressé S, Dill KA. Nonuniversal power law scaling in the probability distribution of scientific citations. Proc Natl Acad Sci USA. 2010;107(37):16023–16027. doi: 10.1073/pnas.1010757107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Eom Y-H, Fortunato S. Characterizing and modeling citation dynamics. PLoS One. 2011;6(9):e24926. doi: 10.1371/journal.pone.0024926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Medo M, Cimini G, Gualdi S. Temporal effects in the growth of networks. Phys Rev Lett. 2011;107(23):238701. doi: 10.1103/PhysRevLett.107.238701. [DOI] [PubMed] [Google Scholar]
  • 13.Golosovsky M, Solomon S. Stochastic dynamical model of a growing citation network based on a self-exciting point process. Phys Rev Lett. 2012;109(9):098701. doi: 10.1103/PhysRevLett.109.098701. [DOI] [PubMed] [Google Scholar]
  • 14.Stephan PE. The economics of science. J Econ Lit. 1996;34(3):1199–1235. [Google Scholar]
  • 15.Stephan PE. How Economics Shapes Science. Harvard Univ Press; Cambridge, MA: 2012. [Google Scholar]
  • 16.Penner O, Petersen AM, Pan RK, Fortunato S. The case for caution in predicting scientists’ future impact. Phys Today. 2013;66(4):8–9. [Google Scholar]
  • 17.Penner O, Pan RK, Petersen AM, Kaski K, Fortunato S. On the predictability of future impact in science. Sci Rep. 2013;3:3052. doi: 10.1038/srep03052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Duch J, et al. The possible role of resource requirements and academic career-choice risk on gender differences in publication rate and impact. PLoS One. 2012;7(12):e51332. doi: 10.1371/journal.pone.0051332. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.David PA. 2008. The historical origins of “Open Science”: An essay on patronage, reputation and common agency contracting in the scientific revolution. Capitalism and Society 3(2):Article 5.
  • 20.Ductor L, Fafchamps M, Goyal S, van der Leij MJ. Social networks and research output. Rev Econ Stat. doi: 10.1162/REST_a_00430. [DOI] [Google Scholar]
  • 21.Shane S, Cable S. Network ties, reputation, and the financing of new ventures. Manage Sci. 2002;48(3):364–381. [Google Scholar]
  • 22.Malmgren RD, Ottino JM, Nunes Amaral LA. The role of mentorship in protégé performance. Nature. 2010;465(7298):622–626. doi: 10.1038/nature09040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Mazloumian A, Eom Y-H, Helbing D, Lozano S, Fortunato S. How citation boosts promote scientific paradigm shifts and nobel prizes. PLoS One. 2011;6(5):e18975. doi: 10.1371/journal.pone.0018975. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Newman MEJ. The first-mover advantage in scientific publication. Europhys Lett. 2009;86(6):68001. [Google Scholar]
  • 25.Burt RS. Structural Holes. Harvard Univ Press; Cambridge, MA: 1992. [Google Scholar]
  • 26.Nahapiet J, Ghoshal S. Social capital, intellectual capital, and the organizational advantage. Acad Manage Rev. 1998;23(2):242–266. [Google Scholar]
  • 27.Börner K, Maru JT, Goldstone RL. The simultaneous evolution of author and paper networks. Proc Natl Acad Sci USA. 2004;101(Suppl 1):5266–5273. doi: 10.1073/pnas.0307625100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Guimerà R, Uzzi B, Spiro J, Amaral LAN. Team assembly mechanisms determine collaboration network structure and team performance. Science. 2005;308(5722):697–702. doi: 10.1126/science.1106340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Palla G, Barabási AL, Vicsek T. Quantifying social group evolution. Nature. 2007;446(7136):664–667. doi: 10.1038/nature05670. [DOI] [PubMed] [Google Scholar]
  • 30.Lu SF, Jin GZ, Uzzi B, Jones B. The retraction penalty: Evidence from the Web of Science. Sci Rep. 2013;3:3146. doi: 10.1038/srep03146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Wutchy S, Jones BF, Uzzi B. Multi-university research teams: Shifting impact, geography, and stratification in science. Science. 2008;322(5905):1259–1262. doi: 10.1126/science.1158357. [DOI] [PubMed] [Google Scholar]
  • 32.Hellsten I, et al. Self-citations, co-authorships and keywords: A new approach to scientists’ field mobility? Scientometrics. 2007;72:469–486. [Google Scholar]
  • 33.Ware M. 2008. Peer Review: Benefits, Perceptions and Alternatives, PRC Summary Papers 4 (Publishing Research Consortium, London), December, pp 1–20.
  • 34.Lazer D, et al. Life in the network: The coming age of computational social science. Science. 2009;323(5915):721–723. doi: 10.1126/science.1167742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Buchanan M. The Social Atom. Bloomsbury Publishing; New York: 2007. [Google Scholar]
  • 36.Vespignani A. Modelling dynamical processes in complex socio-technical systems. Nat Phys. 2012;8(1):32–39. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES