The hb-index, a modified h-index designed to more fairly assess author achievement

Olen R Brown

doi:10.1179/1351000212Y.0000000016

. 2013 Jul 19;17(4):176–178. doi: 10.1179/1351000212Y.0000000016

The h_b-index, a modified h-index designed to more fairly assess author achievement

Olen R Brown ^1,^✉

PMCID: PMC6837726

Abstract

The h-index characterizes the publication achievement/impact of authors and is defined by the originator Jorge Hirsch as: ‘the number of papers with citation number ≥h’. The h-index has the inherent characteristic that authors with very different total citations can have the same h-index. In fact, no contributions to the h-index are made either by papers cited fewer times than h, or citations of an individual paper above h. Such citations are ‘excess’ citations not credited by the h-index. To address these deficiencies, we propose a simple, straightforward modification, the h_b-index: Inline graphic , where h is the Hirsch h-index and e is the sum of all citations minus h². Therefore, e is the excess citations not credited by the h-index.

Keywords: Publication impact, Author impact, Hirsch index, Modified Hirsch index, Bibliographic impact

Introduction

The h-index was first described by Hirsch¹ in an article entitled: ‘An index to quantify an individual's scientific research output’. The h-index was originally intended to be a measure of an individual's achievement although it commonly now is used as a bibliographic measure of impact. Hirsch¹ defined the h-index as ‘the number of papers with citation number ≥h’. The h-index has received acceptance and support,^2–4 criticism,⁵ corrections, complements, and a single-number alternative has been devised.⁶. An extensive evaluation of the h index and its variants has been published.⁷ The h-index and the related area of journal impact has recently been the subject of articles published in journals of higher education.^8,9 The h-index has been used recently to assess impact of disease¹⁰ and it has been scaled for different scientific fields that were shown to have significantly different average citations per paper.¹¹ The h-index can be calculated using databases such as Thompson Scientific Web of Science¹² or Google Scholar Citations.¹³ A book and associated software¹⁴ is useful generally for the subject of publication impact, specifically for analyzing academic citations, and for calculating the h-index.

Inherently, the h-index has the undesirable characteristic that individuals with very different numbers of citations can have the same h-index. Our goal in this report was to modify the h-index to express the differences resulting from increased total citations while maintaining the other attributes of the index that have been found useful in evaluating the publication impact of an individual including for promotion, tenure, asset allocation, and related administrative purposes.

Results

Every h-index can be generated by the minimum of h² citations when exactly h papers receive exactly h citations. The effect of h² citations justifies Inline graphic term in the h_b-index. Also, any h-index theoretically could be generated by any total number of citations ≥h². The greater than h² citations are ‘excess’ citations and it appears reasonable and desirable to modify the h-index to credit these citations. This is especially important for young scientists who may have only a few papers and a comparatively low h-index. By analogy, it is desirable that a single-number index reveals the ‘ripening fruit’ whereas the h-index reveals only the ‘fully ripened’ fruit in one event as it creates a unitary increase in h on the special occasion that a citation occurs for a ‘nearly ripe’ paper.

Fig. 1 clarifies the h-index and reveals its limitation of not accounting for ‘excess’ citations. A specific publication record is shown for a hypothetical author who has published 90 papers cited 820 times. Citations are distributed as follows: 1 paper with 100; 9 papers each with 30; and groups of 10 papers each with 20, 10, 5, 4, 3, 2, 1, and 0 citations. By inspection, the h-index is 20 (number of papers with ≥h citations). However, the number of papers and their citation distributions are only one of many with the h-index of 20. Indeed, every h-index can be created from multiple distributions (except for the trivial case with one paper). Fig. 1 also illustrates that no citations of a paper that exceed 20 make any contribution to the h-index of 20. Likewise, no citation of lowly ranked papers (those below the twentieth paper in this case) makes any contribution to the h-index (by definition, exactly 20 papers cited 20 or more times, creates the h-index). The other citations are ‘excess’ because they do not contribute to the h-index.

Figure 1. — Graphic depiction for a hypothetical author with an h-index of 20 (see text for specifics).

There are other undesirable consequences of the h-index. For example, the first citation of any published paper of an individual creates an h-index of 1. However, the publication of any number of additional papers (however large the number) will not change the h-index until two papers become cited twice and (minimally) n papers must be cited n times for the h-index to become n. Thus, additional citations of ≤h for any number of papers cited h times will not raise the h-index. Also, a single paper, no matter how highly cited, cannot lead to an h-index that is larger than 1, and a body of papers, no matter how highly cited, cannot have an h-index larger than the number of papers in that body. Also, if a scientist publishes only one paper and that paper becomes cited any number of times, the h-index remains at 1. It is undesirable that an index behave in this manner.

Table 1 compares the h-index and the h_b-index. Publication records were created with different citation distributions to clarify the concept of ‘excess citations’ and to make the math simple enough to evaluate by inspection. However, other citation distributions contain the same defects. Table 1 distributions A–F compare different authors with very different total citations but the same number of papers to isolate the effect of number of citations. For clarity, the excess citations for distributions A–H occur in papers with citations greater than h. It is obvious that publications with citations fewer than h have no effect on h.

Table 1.

Corrective effects of the h_b-index

Citation distributions*	Total citations	Excess citations (e)^†	Publications	h-index	h_b-index
A	100	0	10	10	10
B	190	90	10	10	19.48
C	200	100	10	10	20.00
D	230	130	10	10	21.40
E	370	270	10	10	26.43
F	550	450	10	10	31.21
G	1050	650	20	20	45.50
H^‡	7150	5550	260	40	114.50

Open in a new tab

*Distribution A has the minimum number of publications and citations that yield the h-index of 10. The other citation distributions were made specific for clarity and devised to give a wide range of total citations, but are not unique to our argument. Distribution B is assigned 190 citations with 1 paper cited 100 times and 9 papers cited 10 times each. Distribution C has 1 paper cited 50 times, 2 papers cited 40 times each, and 7 papers cited 10 times each. Distribution D has 1 paper cited 50 times, 3 papers each cited 30 times, 3 papers each cited 20 times, and 3 papers each cited 10 times. Distribution E has 1 paper cited 100 times, 3 papers each cited 50 times, 3 papers each cited 30 times, and 3 papers each cited 10 times. Distribution F has 1 paper with each of the following number of citations: 100, 90, 80, 70, 60, 50, 40, 30, 20, and 10. Distribution G has 1 paper cited 200 times, 4 papers each cited 100 times, 5 papers each cited 50 times and 10 papers each cited 20 times. Distribution H is 1 paper cited 500 times, 1 paper cited 300 times, 3 papers each cited 150 times, 10 papers each cited 100 times, 25 papers each cited 40 times, 60 papers each cited 30 times, 70 papers each cited 20 times, 50 papers each cited 10 times, and 40 papers each cited 5 times.

^†The sum of those citations of individual papers that are greater than or less than the value of h. Excess citations have the value of e as defined by Inline graphic . Therefore, e is the excess citations not credited by the h-index.

^‡The citations are distributed to show the effects on an author with a high number of publications and citations that include some publications with citations less than h.

Discussion

It is apparent (Table 1) that many citation distributions can have the same h-index and number of publications but have unique h_b-indices. The first row in Table 1 has the fewest citations (h²) for an h-index of 10 and it has zero ‘excess’ (e) citations. All citations contribute and the h-index and the h_b-index are identical (the h_b-index is Inline graphic ). The second row in Table 1 also has an h-index of 10. However, the h_b-index is . Thus, the h_b-index is increased, relative to h, by the contribution of the 90 citations that otherwise do not contribute. Table 1 also shows that six authors, each with identical h-indices of 10 and exactly 10 papers but unique distributions of citations (A–F), have h_b-indices that range from 10 to 31.21. Fundamentally, the h-index can never be larger than the total number of publications, but it can range downward to zero. The h-index precisely equals the total number of papers that are cited ≥h times. More specifically, as shown by citation distributions A–G (Table 1), once each paper has been cited a minimum of 10 times, the h-index becomes 10. Regardless of additional citations, the h-index remains at 10 until another paper is published and each of the now eleven papers is cited a least 11 times. These additional ‘excess’ citations are revealed and credited by the h_b-index as the function Inline graphic where e is the value of the ‘excess’ citations. Incorporating provides a smooth increase in the h_b-index.

To further compare the h-index with the h_b-index, consider an additional case (not in Table 1) with citations distributed identically to distribution B (Table 1) except that all 90 excess citations are distributed among papers cited fewer times than the h index (10 in this case). This necessarily raises the total number of papers beyond 10 but these citations contribute nothing to the h-index. To evaluate the effects on the h_b-index for this case, citations must be specifically assigned. Assume 10 papers with 10 citations each (to generate h = 10) and one paper each with the following number of citations: 9, 8, 7, 6, 5, 4, 3, 2, and 1. Now there are 19 papers with 145 total citations. The h-index remains 10 but the h_b-index is now Inline graphic which equals 16.71 and credits the 9 papers that received a total of 45 citations. As a further example, if there had been 10 papers each cited 10 times and one paper cited an ‘excess’ 9 times, the h-index would, of course, be 10 but the h_b-index would be . Likewise, for 1 ‘excess’ citation, the h_b-index would be Inline graphic . The consequences for other cases are easily calculated.

It can be argued that the failure of the h-index to credit papers with citations below h is of low concern. However, researchers early in their careers are more likely to have such excess citations from papers not yet highly cited. Indeed, the h_b-index recognizes this and gives appropriate credit based on the comparatively low (but significant for discriminating among authors) contribution of Inline graphic by lowly cited papers. However, when there are papers that receive many citations in excess of h (Table 1 distributions B–H; see footnote for specifics), these papers are appropriately highly credited by the h_b-index which rises to comparatively large values because is comparatively large.

Conclusion

The h-index is improved by calculating the h_b-index which is the h-index plus a value Inline graphic with e defined as the excess citations (the citations in excess of the minimum total number (h²) that can define the h-index). The h_b-index credits citations that otherwise have no weight, differentiates among authors who have the same h-index, and is especially useful for distinguishing among publication records of young researchers with modest h-indices but significant papers that are ‘ripening’ but not yet highly cited.

References

1.Hirsch J. An index to quantify an individual's scientific research output. Proc Natl Acad Sci USA 2005;102(46):16569–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Spearman C, Quigley MJ, Quigley MR, Wilberger J. Survey of the h-index for all of academic neurosurgery: another power-law phenomenon? J Neurosurg 2010;113(5):929–33. [DOI] [PubMed] [Google Scholar]
3.Holden C. Random samples: data point-impact factor. Science 2005;309(8):1181c. [Google Scholar]
4.Ball P. Index aims for fair ranking of scientists. Nature 2004;365:900–1. [DOI] [PubMed] [Google Scholar]
5.Kelly C, Jennings M. The h index and career assessment by numbers. Trends Ecol Evol 2006;21(4):167–70. Available from: http://dx.doi.org/1016/j.tree.2006.01.005 [accessed 14 April 2012]. [DOI] [PubMed] [Google Scholar]
6.Bornmann L, Daniel H-D. What do we know about the h index? J Am Soc Infor Sci Tech 2007;58(7):1381–85. [Google Scholar]
7.Bornmann L, Mutz R, Daniel H-D. Are there better indices for evaluation purposes than the h index? A comparison of nine different variants of the h index using data from biomedicine. J Am Soc Infor Sci Tech 2008;59(5):830–7. [Google Scholar]
8.Budd J, Magnuson L. Higher education literature revisited: citation patterns examined. Res Higher Educ 2010;51(3):294–304. [Google Scholar]
9.Bray N, Major C. Status of journals in the field of higher education. J Higher Educ 2011;82(4):479–503. [Google Scholar]
10.McIntyre K, Hawkes I, Waret-Szkuta A, Morand S, Baylis M. The h-index as a quantitative indicator of the relative impact of human diseases. PLoS One [Electronic Resource] 2011;6(5):e19558. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Iglesias J, Pecharroman C. Scaling the h-index for different ISI fields. Scientimetrics 2007;73(2):303–20. [Google Scholar]
12.Thompson Scientific Web of Science Available from: http://isiknowledge.com [accessed 14 April 2012].
13.Google Scholar Citations [Internet] Available from: http://scholar.google.com/citations?user=WOv4-EEAAAAJ&hl=en [accessed 14 April 2012].
14.Hartzing A. [Internet] The Publish or Perish Book. Melbourne, Australia: Tarman Software Research Pty Ltd; 2010, Available from: http://www.harzing.com/pop.htm [accessed 14 April 2012]. [Google Scholar]

[CIT0001] 1.Hirsch J. An index to quantify an individual's scientific research output. Proc Natl Acad Sci USA 2005;102(46):16569–72. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0002] 2.Spearman C, Quigley MJ, Quigley MR, Wilberger J. Survey of the h-index for all of academic neurosurgery: another power-law phenomenon? J Neurosurg 2010;113(5):929–33. [DOI] [PubMed] [Google Scholar]

[CIT0003] 3.Holden C. Random samples: data point-impact factor. Science 2005;309(8):1181c. [Google Scholar]

[CIT0004] 4.Ball P. Index aims for fair ranking of scientists. Nature 2004;365:900–1. [DOI] [PubMed] [Google Scholar]

[CIT0005] 5.Kelly C, Jennings M. The h index and career assessment by numbers. Trends Ecol Evol 2006;21(4):167–70. Available from: http://dx.doi.org/1016/j.tree.2006.01.005 [accessed 14 April 2012]. [DOI] [PubMed] [Google Scholar]

[CIT0006] 6.Bornmann L, Daniel H-D. What do we know about the h index? J Am Soc Infor Sci Tech 2007;58(7):1381–85. [Google Scholar]

[CIT0007] 7.Bornmann L, Mutz R, Daniel H-D. Are there better indices for evaluation purposes than the h index? A comparison of nine different variants of the h index using data from biomedicine. J Am Soc Infor Sci Tech 2008;59(5):830–7. [Google Scholar]

[CIT0008] 8.Budd J, Magnuson L. Higher education literature revisited: citation patterns examined. Res Higher Educ 2010;51(3):294–304. [Google Scholar]

[CIT0009] 9.Bray N, Major C. Status of journals in the field of higher education. J Higher Educ 2011;82(4):479–503. [Google Scholar]

[CIT0010] 10.McIntyre K, Hawkes I, Waret-Szkuta A, Morand S, Baylis M. The h-index as a quantitative indicator of the relative impact of human diseases. PLoS One [Electronic Resource] 2011;6(5):e19558. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0011] 11.Iglesias J, Pecharroman C. Scaling the h-index for different ISI fields. Scientimetrics 2007;73(2):303–20. [Google Scholar]

[CIT0012] 12.Thompson Scientific Web of Science Available from: http://isiknowledge.com [accessed 14 April 2012].

[CIT0013] 13.Google Scholar Citations [Internet] Available from: http://scholar.google.com/citations?user=WOv4-EEAAAAJ&hl=en [accessed 14 April 2012].

[CIT0014] 14.Hartzing A. [Internet] The Publish or Perish Book. Melbourne, Australia: Tarman Software Research Pty Ltd; 2010, Available from: http://www.harzing.com/pop.htm [accessed 14 April 2012]. [Google Scholar]

PERMALINK

The h_b-index, a modified h-index designed to more fairly assess author achievement

Olen R Brown

Abstract

Introduction

Results

Figure 1.

Table 1.

Discussion

Conclusion

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

The hb-index, a modified h-index designed to more fairly assess author achievement

Olen R Brown

Abstract

Introduction

Results

Figure 1.

Table 1.

Discussion

Conclusion

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

The h_b-index, a modified h-index designed to more fairly assess author achievement