Skip to main content
Data in Brief logoLink to Data in Brief
. 2019 Dec 20;28:105035. doi: 10.1016/j.dib.2019.105035

Code and data on the processing of the pulsed-field gel electrophoresis images: A matlab script

Ahmad Mohammadbeigi a,b, Mohammad Rezaei c,d,, Naser Zohourian Sani a,f, Nemat Soltani a, Parviz Mohajeri e
PMCID: PMC6940661  PMID: 31909128

Abstract

Here a matlab script was presented for lane tracking and band detection on the pulsed field gel electrophoresis (PFGE) images. It can also be used as a software tool for automatic analysis of PFGE images. The data consist of several MATLAB codes which collectively have the task of lane tracking, band detecting and pattern recognition on the PFGE images. The lane tracking stage is semi-automatic and the band detection stage is fully automatic. Finally, the pattern of lanes that includes number of, location, width and light intensity level of bands was obtained.

Keywords: Band detection, Pattern recognition, Pulsed-field gel electrophoresis, Image processing, Matlab script


Specifications Table

Subject Biomedical Engineering
Specific subject area image processing in microbiology and biotechnology
Type of data MATLAB code, image, video
How data were acquired All source codes written in Matlab software.
Data format MATLAB code, JPEG, Mp4
Parameters for data collection All the codes were implemented in MATLAB-R2009a on a system with Intel Core - i5 2430M, quad-core processor overclocked at 3.2 GHz with 8GB of RAM clocked at 1600 MHz. A trial version of GelCompar II version 6.6.11 was used to evaluate and optimize the codes.
Description of data collection The images were captured using PFGE BIORAD at the Microbiology Laboratory of Kermanshah University of Medical Sciences. The images were provided by two types of bacteria, including Acineto-AF, Staphylococcus aureus.
Data source location Institution: Department of Biomedical Engineering in Kermanshah University of Medical Science
City/Town/Region: Kermanshah
Country: Iran
Latitude and longitude: 34°23′27.9″N 47°06′07.7″E
Data accessibility
  • -

    with the article

  • -

    The dataset is freely available at [1] for any academic, educational, and research purposes.

Repository name: Mendeley Data
Data identification number: https://doi.org/10.17632/mcnfncf25t.1
Direct URL to data: https://data.mendeley.com/datasets/mcnfncf25t/1
Related research article Author's name: Mohammad Rezaei, Mahmood Amiri, Parviz Mohajeri, Mansour Rezaei
Title: A new algorithm for lane detection and tracking on pulsed field gel electrophoresis images
Journal: Chemometrics and Intelligent Laboratory Systems
DOI: 10.1016/j.chemolab.2016.05.018
Value of the Data
  • The provided codes can be used to pulsed-field gel electrophoresis image analysis.

  • The Matlab script will allow microbiologist to molecular subtyping.

  • This approach can be used to automatic lane tracking, band detection and pattern recognization on PFGE images.

1. Data

The data consist of several MATLAB codes, which collectively have the task of lane tracking, band detecting and pattern recognition on the pulsed-field gel electrophoresis images. The PFGE is a laboratory technique used by researchers and scientists to produce a DNA fingerprint for a bacterial isolate as a group of the same type of bacteria [[2], [3], [4], [5]]. The images were provided by two types of bacteria, including Acinetobacter [6], Staphylococcus aureus [7] which were attached with the article. All the codes and data needed for this purpose available in the Mendeley data source [1]. In addition, there is a video showing the performance of the data and Matlab script. The flowchart of implemented algorithm as Matlab script was shown in Fig. 1.

Fig. 1.

Fig. 1

The flowchart of the proposed algorithm. It composed of three phases:1- Lane detection 2- Noise reduction and band extraction 3- Pattern recognition.

2. Experimental design, materials, and methods

The material used in this dataset includes images and codes. The images were collected using PFGE BIORAD at the Microbiology Laboratory of Kermanshah University of Medical Sciences in “tiff” format. To analyse the images, MATLAB-R2009a [8] on a system with Intel Core - i5 2430M, quad-core processor overclocked at 3.2 GHz with 8GB of RAM clocked at 1600 MHz was used. To evaluate and optimize the codes, a trial version of GelCompar II software was also used.

2.1. Algorithm for the lane tracking

A program, “lane_tracking.m”, is responsible for lane tracking. First, image was converted to gray-scale from RGB format. The desired area of the image including lanes was cropped then was resized to 500 × 500 pixels. Next, image segmented to sub-images to calculate vertical projection profile (VPP). To detect local of each lane in the sub-images, local maxima from signal of VPP was detected. Using the matlab codes, “widthfind.m” and “Remov_locmax.m”, false-positive local maxima in the background was removed. After detecting the center of lanes in each of sub-images, the centers associated with each lane are given to the function “createFit.m” to fit the patch of lane. In detail, the proposed algorithm is described in Ref. [9]. Fig. 2 depicts the lane tracking.

Fig. 2.

Fig. 2

A PFGE image with lanes which tracked using the matlab script “lane_tracking.m”: The red lines demonstrate lanes and the green stars are center of the lanes.

2.2. Algorithm for the band detection

The band detection as one of the stages consists of two phases. First, a matlab code, “bandremovebackgroundm.m”, is responsible for removing background noise from lanes image. To this end, the lane image was smoothed by an adaptive median filter on lane images. Then, as a completion step, lower envelope of VPP of lane was subtracted. The source code for this function is given bellow.

Image 1

In addition, the program “band.m” is responsible for band detection. To this end, after resizing it to 100 × 500 pixels and calculating the VPP of lane, the location of the bands was detected using the function “peakfind.m”. The source code for this step is as follows. Finally, the pattern of lanes was determined by calculating the four parameters, including number of location, width and light intensity level of bands.

Image 2

Acknowledgments

The authors gratefully acknowledge the Research Council of Kermanshah University of Medical Sciences.

Footnotes

Appendix A

Supplementary data to this article can be found online at https://doi.org/10.1016/j.dib.2019.105035.

Conflict of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Appendix A. Supplementary data

The following are the Supplementary data to this article:

Multimedia component 1
mmc1.zip (2.5MB, zip)
Multimedia component 2
mmc2.zip (2.3MB, zip)
Multimedia component 3
mmc3.xml (1KB, xml)

References

  • 1.Rezaei M. Mendeley Data; 2019. Matlab Scripts for Lane Tracking and Band Detection on the Pulsed-Field Gel Electrophoresis Images; p. v1. [Google Scholar]
  • 2.Jeon S., Lim N., Park S., Park M., Kim S. Comparison of PFGE, IS6110-RFLP, and 24-Locus MIRU-VNTR for molecular epidemiologic typing of Mycobacterium tuberculosis isolates with known epidemic connections. J. Microbiol. Biotechnol. 2018;28:338–346. doi: 10.4014/jmb.1704.04042. [DOI] [PubMed] [Google Scholar]
  • 3.Alkharsah K.R., Rehman S., Alnimr A., Diab A., Hawwari A., Tokajian S. Molecular typing of MRSA isolates by spa and PFGE. J. King Saud Univ. Sci. 2018;31:999–1004. [Google Scholar]
  • 4.Sharma-Kuinkel B.K., Rude T.H., Fowler V.G. Pulse field gel electrophoresis. Methods Mol. Biol. 2016;1373:117–130. doi: 10.1007/7651_2014_191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lopez-Canovas L., Martinez Benitez M.B., Herrera Isidron J.A., Flores Soto E. Pulsed field gel electrophoresis: past, present, and future. Anal. Biochem. 2019;573:17–29. doi: 10.1016/j.ab.2019.02.020. [DOI] [PubMed] [Google Scholar]
  • 6.Munoz-Price L.S., Weinstein R.A. Acinetobacter infection. N. Engl. J. Med. 2008;358:1271–1281. doi: 10.1056/NEJMra070741. [DOI] [PubMed] [Google Scholar]
  • 7.Stijn Blot R.N., Vandewoude K., Colardyn F. Staphylococcus aureus infections. N. Engl. J. Med. 1998;339:520–532. doi: 10.1056/NEJM199812313392716. [DOI] [PubMed] [Google Scholar]
  • 8.The Mathworks Inc MATLAB - MathWorks, 2016-11-26. 2016. Www.Mathworks.Com/Products/Matlab
  • 9.Rezaei M., Amiri M., Mohajeri P., Rezaei M. A new algorithm for lane detection and tracking on pulsed field gel electrophoresis images. Chemometr. Intell. Lab. Syst. 2016;157:1–6. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Multimedia component 1
mmc1.zip (2.5MB, zip)
Multimedia component 2
mmc2.zip (2.3MB, zip)
Multimedia component 3
mmc3.xml (1KB, xml)

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES