Abstract
In August 2011, Thomson Reuters launched version 5 of the Science and Social Science Citation Index in the Web of Science (WoS). Among other things, the 222 ISI Subject Categories (SCs) for these two databases in version 4 of WoS were renamed and extended to 225 WoS Categories (WCs). A new set of 151 Subject Areas was added, but at a higher level of aggregation. Perhaps confusingly, these Subject Areas are now abbreviated “SC” in the download, whereas “WC” is used for WoS Categories. Since we previously used the ISI SCs as the baseline for a global map in Pajek (Pajek is freely available at http://vlado.fmf.uni-lj.si/pub/networks/pajek/) (Rafols et al., Journal of the American Society for Information Science and Technology 61:1871–1887, 2010) and brought this facility online (at http://www.leydesdorff.net/overlaytoolkit), we recalibrated this map for the new WC categories using the Journal Citation Reports 2010. In the new installation, the base maps can also be made using VOSviewer (VOSviewer is freely available at http://www.VOSviewer.com/) (Van Eck and Waltman, Scientometrics 84:523–538, 2010).
Keywords: Map, Overlay, Web of Science, Subject categories, Interdisciplinarity
The WC categories are sometimes different from the old SCs. For example, the European Journal of Pharmaceutics and Biopharmaceutics, which was previously classified to the ISI SC “Engineering, Chemical” was now assigned to “Pharmacology and Pharmacy.” In an Annex, Rafols et al. (2010) showed that with aggregated sets the map is relatively robust against error in the classifications, but the differences can be significant at lower levels of aggregation (Rafols and Leydesdorff 2009). As noted, the semantics may be confusing: WoS Subject Categories (WC) replace the old ISI Subject Categories (SC), but the old label SC for Subject Categories is now used in the download for the 151 newly created “Subject Areas” as they are called at the WoS interface. Both in the old and the new situation there are additionally 25 Subject Categories which are used exclusively in the Arts & Humanities Citation Index (A&HCI), and not included in our analysis because A&HCI does not have (hitherto) a Journal Citation Report (Leydesdorff et al. 2011, at pp. 2417 ff.). Table 1 summarizes the changes.
Table 1.
WoS version 4 | WoS version 5 | |
---|---|---|
Subject categories (SCI + SoSCI) | 222 ISI subject categoriesa | 225 WoS subject categories |
Abbreviation in the download | SC | WC |
A&HCI | 25 ISI subject categories | 26 WoS subject categories |
Subject areas | <not defined> | 151 subject areas; abbreviated SC in the download |
One of the 225 WC is no longer in use; this is: “Biology, Miscellaneous”. Only a single journal (Growth Development and Aging) is attributed to this category. Although this journal was cited 520 times in 2010 and is listed with an IF-2010 = 3.000, it is not processed in the Journal Citation Reports (JCR) from the citing side. In summary, we use the matrix of 224 WCs
aThree additional categories were no longer in use (Rafols and Leydesdorff 2009)
Whereas previously 18 factors were found most appropriate for explaining the structure of the aggregated citation matrix, an organization into 19 factors explaining 54.3 % of the variance is most apt for showing the disciplinary structure of the new matrix. The changes are mainly in the organization of mathematics. Rafols et al. (2010, at p. 1876) already noted that “the position of mathematics (all math subject categories) in the map remains open to debate. Since different strands of mathematics are linked to different major fields (medicine, engineering, social sciences), these may show as diverse entities in distant positions, rather than as a unitary corpus, depending on metrics, classifications, and clustering algorithms used.”
In the new overlays, the previous factor of “Computer Science” is divided into two groups and designated (by us) as “Computer Science” and “Mathematical Methods.” Table 2 provides the subject categories organized within these two compartments. The other 17 factors could remain similar to the previous classification (Rafols et al. 2010).
Table 2.
12 WC attributed to “computer science” | 6 WCs attributed to “mathematical methods” |
---|---|
Computer science, hardware and architecture | Computer science, interdisciplinary applications |
Engineering, electrical and electronic | Operations research and management science |
Computer science, artificial intelligence | Mathematics, applied |
Computer science, theory and methods | Statistics and probability |
Computer science, information systems | Engineering, industrial |
Telecommunications | Mathematics |
Automation and control systems | |
Computer science, cybernetics | |
Computer science, software engineering | |
Robotics | |
Imaging science and photographic technology | |
Transportation science and technology |
As with our previous installation, the base map with the relevant initialization routine for Pajek (v. 2.05) is available on the internet. The new base map can also be used as input to VOSviewer (Van Eck and Waltman 2010), but this did not improve the visualization in the case of 19 clusters. (See for more discussion about the differences between the two visualization techniques Leydesdorff and Rafols (2012).) However, the clustering algorithm in VOSviewer distinguished four, in our opinion highly meaningful, groups (Waltman et al. 2010). The organization into these four groups (biomedical, physical, environmental, and social sciences) is now provided as another (optional) partition within the Pajek (.paj) file containing the base map, and is also made available as input to VOSviewer.
The routine allows the user to make an overlay choosing one of the following two routes:
Within the WoS (v. 5.5), one can “Analyze Results” by clicking at the right top of the page with search results; choose “Web of Science Categories” among the ranking options, and export the data into a file “analyze.txt”. This file can be read by the program wc10.exe, and the resulting file “wc10.vec” can be imported into the base map as a vector.1 The visualization (Draw-Partition-Vector within Pajek) then shows the overlay (e.g., Fig. 1).
The routine wc10.exe also generates three so-called “map”-files for VOSviewer: vos4.csv, vos6.csv, and vos19.csv. (The csv-extension makes these text files also readable using Excel.) These files generate maps when read into VOSviewer with four, six, and 19 clusters, respectively (using different colors). The manual of VOSviewer can be consulted for further options (Fig. 2).
A PowerPoint file is provided at http://www.leydesdorff.net/overlaytoolkit/basemaps.pptx which allows the user to place the legends on top of the maps and which provides a few examples; these legends can also be edited. An additional routine enables the user to measure Rao-Stirling diversity as an index of the interdisciplinarity in the samples under study (cf. Carley and Porter 2012; Leydesdorff and Rafols 2011; Porter and Rafols 2009; Rafols et al. 2012; Stirling 2007).
By thus adjusting to the new situation, we hope to have provided the community with a means to map inter- and multidisciplinary sets of documents in future research using the new version of the Web of Science (cf. Leydesdorff and Rafols 2009; Melkers and Hicks 2012; Porter and Youtie 2009; Soós and Kampis 2011). A detailed manual for the mapping was provided (at the website) by Ken Riopelle. Additionally, a macro to transform the file “analyze.txt” into the Gephi format was provided by Clement Levallois.
Acknowledgments
We are grateful to Alan Porter for comments on a previous draft. We acknowledge Thomson Reuters for the use of the data. We acknowledge support from the ESRC grant RES-360-25-0076.
Open Access
This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
Footnotes
The same file “wc10.vec” is automatically generated when using the routine ISI.exe for organizing the saved output of the WoS, as specified at http://www.leydesdorff.net/software/isi.
Contributor Information
Loet Leydesdorff, Email: loet@leydesdorff.net, http://www.leydesdorff.net.
Stephen Carley, Email: stephen.carley@gmail.com.
Ismael Rafols, Email: i.rafols@sussex.ac.uk.
References
- Carley S, Porter AL. A forward diversity index. Scientometrics. 2012;90(2):407–427. doi: 10.1007/s11192-011-0528-1. [DOI] [Google Scholar]
- Leydesdorff L, Hammarfelt B, Salah AAA. The structure of the Arts & Humanities Citation Index: A mapping on the basis of aggregated citations among 1,157 journals. Journal of the American Society for Information Science and Technology. 2011;62(12):2414–2426. doi: 10.1002/asi.21636. [DOI] [Google Scholar]
- Leydesdorff L, Rafols I. A global map of science based on the isi subject categories. Journal of the American Society for Information Science and Technology. 2009;60(2):348–362. doi: 10.1002/asi.20967. [DOI] [Google Scholar]
- Leydesdorff L, Rafols I. How do emerging technologies conquer the world? An exploration of patterns of diffusion and network formation. Journal of the American Society for Information Science and Technology. 2011;62(5):846–860. doi: 10.1002/asi.21509. [DOI] [Google Scholar]
- Leydesdorff L, Rafols I. Interactive overlays: A new method for generating global journal maps from web-of-science data. Journal of Informetrics. 2012;6(3):318–332. doi: 10.1016/j.joi.2011.11.003. [DOI] [Google Scholar]
- Melkers J, Hicks D. Bibliometrics as a tool for research evaluation. In: Link A, Vornatas N, editors. Handbook on the Theory and Practice of Program Evaluation. Cheltenham: Edward Elgar; 2012. [Google Scholar]
- Porter AL, Rafols I. Is science becoming more interdisciplinary? Measuring and mapping six research fields over time. Scientometrics. 2009;81(3):719–745. doi: 10.1007/s11192-008-2197-2. [DOI] [Google Scholar]
- Porter AL, Youtie J. How interdisciplinary is nanotechnology? Journal of Nanoparticle Research. 2009;11(5):1023–1041. doi: 10.1007/s11051-009-9607-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rafols I, Leydesdorff L. Content-based and algorithmic classifications of journals: Perspectives on the dynamics of scientific communication and indexer effects. Journal of the American Society for Information Science and Technology. 2009;60(9):1823–1835. doi: 10.1002/asi.21086. [DOI] [Google Scholar]
- Rafols I, Leydesdorff L, O’Hare A, Nightingale P, Stirling A. How journal rankings can suppress interdisciplinary research: A comparison between innovation studies and business & management. Research Policy. 2012;41(7):1262–1282. doi: 10.1016/j.respol.2012.03.015. [DOI] [Google Scholar]
- Rafols I, Porter A, Leydesdorff L. Science overlay maps: A new tool for research policy and library management. Journal of the American Society for Information Science and Technology. 2010;61(9):1871–1887. doi: 10.1002/asi.21368. [DOI] [Google Scholar]
- Soós S, Kampis G. Towards a typology of research performance diversity: The case of top Hungarian players. Scientometrics. 2011;87:357–371. doi: 10.1007/s11192-011-0351-8. [DOI] [Google Scholar]
- Stirling A. A general framework for analysing diversity in science, technology and society. Journal of the Royal Society, Interface. 2007;4(15):707–719. doi: 10.1098/rsif.2007.0213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van Eck NJ, Waltman L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics. 2010;84(2):523–538. doi: 10.1007/s11192-009-0146-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waltman L, van Eck NJ, Noyons E. A unified approach to mapping and clustering of bibliometric networks. Journal of Informetrics. 2010;4(4):629–635. doi: 10.1016/j.joi.2010.07.002. [DOI] [Google Scholar]