Skip to main content
. 2024 Jul 24;11:816. doi: 10.1038/s41597-024-03655-9

Table 2.

Description of folder structure and files in the produced and publicly accessible dataset20.

File name Description
ReadMe.md Description file in Markdown format.
scopus_202*_V1_scholarlymigration_country_enriched.csv Country level yearly dataset on international emigration, immigration, net migration rates and other variables based on Scopus.
scopus_202*_V1_scholarlymigration_countryflows_enriched.csv Country level yearly “flow” dataset on international emigration, immigration, net migration rates and other variables based on Scopus.
openalex_202*_V1_scholarlymigration_country_enriched.csv Country level yearly dataset on international emigration, immigration, net migration rates and other variables based on OpenAlex.
openalex_202*_V1_scholarlymigration_countryflows_enriched.csv Country level yearly “flow” dataset on international emigration, immigration, net migration rates and other variables based on OpenAlex.
01_prepare_enrich_data.py Source code (Python >=3.9) for downloading World Bank data and merging with Scopus and OpenAlex data.
02_merge_openalex_and_scopus.py Source code (Python >=3.9) for merging Scopus and OpenAlex data.
03_plotting.py Source code (Python >=3.9) for plotting Figures 26.
04_compare_share_of_mobile_researchers.py Source code (Python >=3.9) for comparing Scopus, ORCID, and OpenAlex data. Please note, raw data at individual author level is not shared due to license limitations of Scopus. ORCID and OpenAlex data are publicly available.
FIGURES\ Folder with plotted figures in PDF format. Produced by script 03.
data_input\ Folder with input data (aggregated migration events of scholars).
data_processed\ Folder with processed/enriched data. Produced by scripts 01 and 02

For convenience of use, we have included the dataset in both CSV and parquet formats while maintaining the same file names.