Skip to main content
Data in Brief logoLink to Data in Brief
. 2018 Dec 14;25:103590. doi: 10.1016/j.dib.2018.12.029

Data on security requirements in open-source software projects

Wentao Wang a, Kavya Reddy Mahakala a, Arushi Gupta a, Nesrin Hussein a, Yinglin Wang b
PMCID: PMC6609732  PMID: 31312694

Abstract

The datasets presented in this article are related the research paper entitled “A Linear Classifier Approach for Identifying Security Requirements in Open Source Software Development” Wang et al. (2018) [1]. This article describes requirements collected from three open-source software (OSS) projects and labels of security requirements. The datasets are made available to support automated security requirements analyzing tools development as well as tools’ evaluation.

Keywords: Security requirements, Open-source software


Specifications Table

Subject area Computer: Software
More specific subject area Requirements Engineering
Type of data Figures, spreadsheets, and text file
How data was acquired Extracted by using REST API and manually analysis
Data format Filtered and analyzed
Experimental factors Two researchers collected requirements from three open-source software projects, systematically analyzing all requirements, and manually labeled all security requirements.
Experimental features The labels of security requirements were provided by manually analysis
Data source location University of Cincinnati, Cincinnati, OH, USA
Data accessibility Data is with this article.
The original data are stored in projects’ issue tracking systems, following are URLs of those issue tracking systems:
AXIS2:https://issues.apache.org/jira/projects/AXIS2/
Drools:https://issues.jboss.org/projects/DROOLS/
GeoServer:https://osgeo-org.atlassian.net/projects/GEOS/
Related research article W. Wang, K. R. Mahakala, A. Gupta, and Y. Wang, “A Linear Classifier Approach for Identifying Security Requirements in Open Source Software Development,” Journal of Industrial Information Integration, Accepted: Nov. 2018.

Value of the data

  • The goal of publishing our datasets is to support automated security assessment tools building as well as evaluation. Our datasets can be used in following research topics.

  • Security requirements identification/classification tool building: Our datasets record security requirements from three OSS projects as well as non-security requirements. They can be used to help conduct textual information based automated tools for security requirements identification.

  • Security requirements tracing/identification approach evaluation: We label security requirements in three OSS projects. This labeled data can be used to evaluate different security requirements tracing and identification approaches.

1. Data

There are three datasets provided with this article: Axis2, Drools, and GeoServer. In each project, we have a spreadsheet called “Requirements.xlsx” which records all requirements in this project. For each requirement, we record its id, title, descriptions, and an URL which link to its original data saved in issue tracking systems. We also saved all security requirements of this dataset in a text file named as “SecReq.txt”. Whether a requirement is a security requirement or not is judged by two security experts. We briefly describe the data collecting and judging process in the next section.

See also Fig. 1, Fig. 2.

Fig. 1.

Fig. 1

Subset of requirements from AXIS2.

Fig. 2.

Fig. 2

Security requirements’ ID from AXIS2.

2. Experimental design, materials and methods

Original requirements of three OSS projects are saved in projects’ JIRA issue trace systems. JIRA REST API [2] is used to download requirements to the local database. Fig. 1 presents a subset of requirements we collected from AXIS2. For each requirement, four types of information are recorded in our datasets. The first column “JIRA_KEY” records requirements’ IDs which can be used to search and trace security requirements in the issue track system. Columns “Title” and “Description” parts record textual information of each requirement. We also provide URL of each requirement in which researchers can collect more information about this requirement for their own studies.

Two researchers with strong security background (i.e., they have at least one year working experience on the security requirements), then, systematically judge whether a specific requirement is a security requirement. If both of them have the same judgment (i.e., this requirement is a security requirement), attribute “JIRA_KEY” of this requirement is saved in the result TXT file. Otherwise, a joint meeting will be conducted to make final decisions. According to Cohen׳s Kappa analysis, two analysts achieved almost perfect agreement. Fig. 2 presents a sub-list of security requirements in AXIS2. The full lists of security requirements in three OSS projects can be found in Supplementary material files.

Funding source

The work is funded in part by the U.S. National Science Foundation (Award CCF 1350487) and the National Natural Science Foundation of China (Fund No. 61375053).

Footnotes

Transparency document

Transparency data associated with this article can be found in the online version at https://doi.org/10.1016/j.dib.2018.12.029.

Appendix A

Supplementary data associated with this article can be found in the online version at https://doi.org/10.1016/j.dib.2018.12.029.

Contributor Information

Wentao Wang, Email: wang2wt@mail.uc.edu.

Kavya Reddy Mahakala, Email: mahakaky@mail.uc.edu.

Arushi Gupta, Email: gupta2ai@mail.uc.edu.

Nesrin Hussein, Email: husseinm@mail.uc.edu.

Yinglin Wang, Email: wang.yinglin@shufe.edu.cn.

Transparency document. Supplementary material

Supplementary material

mmc1.pdf (1.2MB, pdf)

.

Appendix A. Supplementary material

Supplementary material

mmc2.xlsx (873.5KB, xlsx)

.

Supplementary material

mmc3.zip (453B, zip)

.

Supplementary material

mmc4.xlsx (78.6KB, xlsx)

.

Supplementary material

mmc5.zip (263B, zip)

.

Supplementary material

mmc6.xlsx (1.2MB, xlsx)

.

Supplementary material

mmc7.zip (1.7KB, zip)

.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

mmc1.pdf (1.2MB, pdf)

Supplementary material

mmc2.xlsx (873.5KB, xlsx)

Supplementary material

mmc3.zip (453B, zip)

Supplementary material

mmc4.xlsx (78.6KB, xlsx)

Supplementary material

mmc5.zip (263B, zip)

Supplementary material

mmc6.xlsx (1.2MB, xlsx)

Supplementary material

mmc7.zip (1.7KB, zip)

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES