Skip to main content
Dentomaxillofacial Radiology logoLink to Dentomaxillofacial Radiology
. 2013 Sep 19;42(9):20130121. doi: 10.1259/dmfr.20130121

Could standardizing “commercial off-the-shelf” (COTS) monitors to the DICOM part 14: GSDF improve the presentation of dental images? A visual grading characteristics analysis

D J McIlgorm 1,*, C Lawinski 2, S Ng 3, J P McNulty 4
PMCID: PMC3828017  PMID: 23990526

Abstract

Objectives:

To investigate whether standardizing commercial off-the-shelf (COTS) display devices to the digital imaging and communications in medicine part 14: greyscale standard display function (DICOM part 14: GSDF) would affect the presentation of dental images.

Methods:

Two COTS display devices from the radiology department of a dental teaching hospital and a laptop computer monitor for reference were calibrated to conform to DICOM part 14: GSDF. Four dental surgeons and two final-year students undertook a relative visual grading analysis of the two devices before and after calibration, under control of the viewing environment.

Results:

Calibrating COTS display devices to conform to the DICOM part 14: GSDF and viewing under reduced ambient light result in a consistent, perceived visual sensation for the presented radiological image. The area under the visual grading characteristics curve (AUCVGC) before calibration is 0.62 CI (0.56, 0.68) and AUCVGC after calibration is 0.51 CI (0.45, 0.57).

Conclusions:

Standardizing COTS display devices to the DICOM part 14: GSDF can improve image presentation.

Keywords: calibration, greyscale, luminance

Introduction

The digital imaging and communications in medicine (DICOM) part 14: greyscale standard display function (GSDF) is a standard of greyscale uniformity that has been specified by the American College of Radiologists and the National Electrical Manufacturers Association for display systems.1 The luminance requirements that meet this standard for radiological applications are based on complex psychovisual experiments that centre on luminance measurements and their relationship to perceived brightness.2 The adaptive phenomenon of the human visual system means that one’s sensitivity is increased to small brightness variations when the area of interest is surrounded by bright elements. Barten2 investigated these brightness variations and the DICOM part 14: GSDF was defined, with the distance between two luminances that the human eye could just about detect called just noticeable differences (JNDs).

For most display devices when the pixel values contained within the processed image file are presented as digital driving levels (DDLs) and then subsequently converted to luminance levels by the display controller, the displayed image can take on a “uniqueness” for that particular display device. If the digital driving levels are transformed by a function defined by the DICOM part 14: GSDF, then each unit change in image value, regardless of its magnitude, would correspond to a change in the luminance value linearly related to the perceptual contrast threshold of the human eye. When this is achieved the display device is said to be “perceptually linearised”.3

The American Association of Physicists in Medicine Task Group 18 (AAPM TG18)4 issued standard guidelines that addresses the issue of performance evaluation for medical display devices that are used for diagnostic and clinical reading purposes, and this includes a recommendation that such display devices should be standardized to the DICOM part 14: GSDF.1 AAPM TG184 also recommends assessing and scoring anatomical images for the visibility and quality of detail presented on the display device.

This investigation uses a visual grading characteristics (VGC) analysis to test the validity of this recommendation.5 VGC analysis draws upon the methods associated with preference studies, image criteria studies, visual grading analysis (VGA) and receiver operating characteristic (ROC) analyses.6,7 VGC analysis is characterized by its simplicity and discriminating power and allows subjective opinions to be quantified and analysed. Comparing the images produced by two test modalities, observers grade the visibility of the structure on the test image relative to that of a reference image using an arbitrary step scale (much worse to much better), with the middle value of the scale meaning a visibility equal to the reference image.5,6 The VGA results can then be used to characterize the difference between the two modalities in the same way as the difference in the response of the observers to the signal and noise distributions are used to characterize the observers in an ROC study.5 It is an observer performance method that is quick and straightforward to conduct, and relevant radiographic anatomy or pathology may meritoriously be used for the image analysis.5 The objective of this investigation was to use a VGC analysis to determine whether standardizing the luminance response for a display device would have an effect on the presentation of clinical images.

Materials and methods

Ethical approval (BDM/11/12-16), as well as informed consent from the participating observers, was obtained prior to this investigation.

Selection of monitors

Six commercial off-the-shelf (COTS) liquid crystal display (LCD) monitors in the radiology department of the Dublin Dental University Hospital, Dublin, Ireland, with 8-bit capability display controllers and set to a resolution of 1024 × 768 pixels had their characteristic luminance response curves transformed by Verilum® v. 5.02 (Image Smiths Inc., Bethesda, MD) software to conform to the DICOM part 14: GSDF luminance response.8 It was then possible to switch between the characteristic luminance response and the DICOM part 14: GSDF luminance response for each monitor. The software is also used as a quality assurance tool and provides information on how well a monitor conformed to the DICOM part 14: GSDF luminance response transformation.8 The monitor that had undergone the best transformation to the DICOM part 14: GSDF luminance response was selected, IBM ThinkVision L171 (IBM, Armonk, NY), as well as the monitor that had undergone the poorest transformation, NEC Multisync 1550 V (NEC, Tokyo, Japan). Table 1 presents technical information and conformance information for all six monitors. The null hypothesis was then tested for the following four situations.

  • There is no difference in the overall perceived visual sensation between the characteristic luminance response of the NEC monitor and when it is transformed to the DICOM part 14: GSDF luminance response.

  • There is no difference in the overall perceived visual sensation between the characteristic luminance response of the IBM monitor and when it is transformed to the DICOM part 14: GSDF luminance response.

  • There is no difference in the overall perceived visual sensation between the IBM monitor and the NEC monitor when both are using their own characteristic luminance responses.

  • There is no difference in the overall perceived visual sensation between the IBM monitor and the NEC monitor when both are transformed to the DICOM part 14: GSDF luminance response.

Table 1.

Technical specifications and digital imaging and communications in medicine part 14: greyscale standard display function (DICOM part 14: GSDF) conformance data provided by the Verilum8 software application (Image Smiths, Inc., Bethesda, MD) for the six commercial off-the-shelf (COTS) liquid crystal display (LCD) monitors in the radiology department and for the laptop monitor used to display the reference image

Monitor Product type Year of manufacture Max luminance (cd/m2) Screen size (inches) JND per luminance intervala Minimuma Maximuma Root mean square errorb
IBM Thinkvision L171 (Armonk, NY) LCD 2005 67.85 17 1.27 1.18 1.36 0.051
HP 1740 (Palo Alto, CA) LCD 2006 74.35 17 1.44 1.33 1.55 0.062
Philips Brilliance 150P2 (Amsterdam, Netherlands) LCD 2001 127.11 15 1.64 1.51 1.82 0.077
HP 1740 (Palo Alto, CA) LCD 2006 51.91 17 1.55 1.39 1.72 0.081
Philips Brilliance 150P2 (Amsterdam, Netherlands) LCD 2001 128.10 15 1.66 1.47 1.87 0.119
NEC Multisync 1550 V (Minato, Tokyo) LCD 2002 63.16 15 1.04 0.68 1.20 0.124
ACER emachines E525 series. (New Taipei City, Taiwan) (laptop monitor for reference image) LCD 2009 192.20 15 1.79 1.65 1.90 0.063
a

These measurements are an indicator of how well the just noticeable differences (JNDs) of the DICOM part 14: GSDF have been linearly mapped to the digital driving levels of the monitor. Large differences between the maximum and minimum values would suggest inconsistency.

b

The smaller the root mean square error the more closely the monitor approximates the DICOM part 14: GSDF and would give a more consistent perceptual response. It shows how well the monitor has been perceptually linearized.

To perform the tests, the method described by Geijer et al9 was used, whereby the visibility of a target structure in a presented image on each of the two selected monitors was compared with the corresponding structure presented on a monitor displaying a reference image, and the observer would give a score for each of the selected monitors. The laptop monitor that was used to present the reference image was an ACER emachines e525s-902G25Mi (ACER Inc., New Taipei City, Taiwan), which was set to a resolution of 1024 × 768 pixels and also had its characteristic luminance response curve transformed to the DICOM part 14: GSDF by the Verilum software application. This monitor had a maximum luminance that would be classed as primary (reporting or diagnostic) by AAPM TG 18.4,10 Table 1 also presents the technical information and the conformance information for this laptop monitor.

Evaluation of images

4 clinical supervisors (registered dentists with a mean of 22 years of experience interpreting dental radiographs) and 2 final year dental students (each with 3 years' experience interpreting dental radiographs) carried out the VGA.

At the first period of VGA, each monitor including the laptop monitor to display the reference image was using its own characteristic luminance response (before calibration), and at the second period of VGA, after an interval of 6 weeks, all three monitors were using the DICOM part 14: GSDF luminance response (after calibration). 15 anonymous dental images for teaching dental radiology were used for the VGA. The 8-bit JPEG images with horizontal and vertical resolution of 96 dots per inch were displayed on a Windows® (Microsoft Corporation, Redmond, WA) viewing application in slide show mode. The image files were displayed as they were saved, having been optimized to facilitate their interpretation during the teaching module. They consisted of nine intraoral images, four panoramic images, one upper standard occlusal image and one lateral cephalometric image. The participants were asked not to alter the display of the images, which were scaled to fit the screen of the monitors. They were asked to view the images at 90° to the screen at a distance of between 30 cm (close inspection) and 60 cm (normal viewing).11 The ambient lighting was set to between 25 lux and 40 lux and was measured with a calibrated Luxi photometer (Unfors Instruments, Billdal, Sweden) set to illuminance mode.12 The participants were given no information about any of the parameters of the monitors, except that the three devices were going to be technically adjusted after the first period of VGA. At each period of VGA, the participants were given information on two pre-defined criteria relating to target structures within each image, which were designed by the investigators.

The participants were instructed to score the display quality of the pre-defined criteria of the target structure on the test image relative to the corresponding landmark on the reference image using the following scoring system.

  1. Test image is clearly superior to the reference image.

  2. Test image is somewhat superior to the reference image.

  3. Test image is equal to the reference image.

  4. Test image is somewhat inferior to the reference image.

  5. Test image is clearly inferior to the reference image.

Figures 1 and 2 are examples of images that were used.

Figure 1.

Figure 1

One of the intraoral images used in the visual grading analysis

Figure 2.

Figure 2

One of the panoramic images used in the visual grading analysis

Criteria for Figure 1. It is a periapical radiograph of the 36.

  1. A low-density radio-opaque area is visible between the roots of 36 above the clearly defined radiolucent area.

  2. A low-density radio-opaque band (soft tissue) is well demarcated from the higher density radio-opacity (mandibular bone) in the 37 region.

Criteria for Figure 2. It is a panoramic radiograph of an 8-year-old child.

  1. The position of the radiolucency of the right mental foramen can be accurately located.

  2. The complete radio-opaque outline of both 38 and 48 tooth crypts are visible.

Analysis

The VGA scores were analysed in a manner similar to that used in ROC analysis.5 A constructed VGC curve described the relationship between the proportions of fulfilled image criteria for the two compared monitors, taking into consideration all the possible thresholds for a fulfilled criterion. The area under the VGC curve (AUCVGC) was used as a single measure of the difference in image quality between the two compared monitors. An AUCVGC value greater than 0.5 would indicate that one of the compared monitors offered a preferred image quality over the other and an AUCVGC value of 0.5 would indicate that the image quality offered by the two monitors was similar in preference.

Results

The collected data consisted of 180 observations for each of the two monitors (IBM and NEC) when using their own characteristic luminance response (before calibration), and 180 observations for each of the two monitors after they had been transformed to the DICOM part 14: GSDF luminance response (after calibration). Based on the specific criteria and observers used in this investigation, the NEC monitor offered a preferred perceived visual sensation for the displayed image after it was calibrated to the DICOM part 14: GSDF (Figure 3), AUCVGC = 0.73 confidence interval (CI) (0.67, 0.79). The IBM monitor also offered a preferred perceived visual sensation for the displayed image after it was calibrated to the DICOM part 14: GSDF (Figure 4), AUCVGC= 0.63 CI (0.57, 0.69).

Figure 3.

Figure 3

The 95% confidence interval of the VGC curve (AUCVGC = 0.73) does not cover 0.5. The NEC monitor (NEC Multisync® 1550 V; NEC, Tokyo, Japan) presents a preferred image quality after calibration. AUCVGC, area under the VGC curve; CI, confidence interval; Std Dev, standard deviation; VGA, visual grading analysis; VGC, visual grading characteristics

Figure 4.

Figure 4

The 95% confidence interval of the VGC curve (AUCVGC = 0.63) does not cover 0.5. The IBM monitor (IBM ThinkVision L171; IBM, Armonk, NY) presents a preferred image quality after calibration. AUCVGC, area under the VGC curve; CI, confidence interval; Std Dev, standard deviation; VGA, visual grading analysis; VGC, visual grading characteristics

The IBM monitor offered a preferred perceived visual sensation for the displayed image over that of the NEC monitor before either of them was calibrated to the DICOM part 14: GSDF (Figure 5), AUCVGC = 0.62 CI (0.56, 0.68). One possible explanation may be that the display controller system for the IBM monitor is more flexible at using its DDLs or repeating DDL values to give a characteristic luminance response that is preferred by the observer, and it seems this flexibility might be further improved by the calibration process (Figure 4). Another explanation may simply be the age of the NEC monitor compared to the IBM monitor.

Figure 5.

Figure 5

The 95% confidence interval of the VGC curve (AUCVGC = 0.62) does not cover 0.5. The IBM monitor (IBM ThinkVision L171; IBM, Armonk, NY) presents a preferred image quality. AUCVGC, area under the VGC curve; CI, confidence interval; Std Dev, standard deviation; VGA, visual grading analysis; VGC, visual grading characteristics

However, both the NEC monitor and the IBM monitor showed a similar preferred perceived visual sensation for the displayed image after both had been calibrated to the DICOM part 14: GSDF (Figure 6), AUCVGC = 0.51 CI (0.45, 0.57). These results suggest that the modified luminance values after calibration for both these COTS monitors have resulted in an overall preferred perceived visual sensation for the observers.

Figure 6.

Figure 6

The 95% confidence interval of the VGC curve (AUCVGC = 0.51) does cover 0.5. The IBM monitor (IBM ThinkVision L171; IBM, Armonk, NY) and the NEC monitor (NEC Multisync® 1550 V; NEC, Tokyo, Japan) present an equally preferred image quality after both monitors have been calibrated to the DICOM part 14: GSDF. AUCVGC, area under the VGC curve; CI, confidence interval; Std Dev, standard deviation; VGA, visual grading analysis; VGC, visual grading characteristics

Discussion

For a COTS monitor using its own characteristic luminance response, there is a tendency that the shades are emphasized in the low luminance range and compressed in the middle and high luminance ranges. With the DICOM part 14: GSDF calibration process, a DDL is selected to produce a luminance value nearest to the desired luminance on the DICOM part 14: GSDF curve, so that for a DICOM part 14: GSDF transformed monitor, differences between any two shades should be perceived as the same degree of change in luminance in the greyscale displayed.1 The concept of “perceptual linearization” attributed to the DICOM part 14: GSDF calibration process makes sure that the mapping from the image data space of the grey-scale processed image to the human observers visual sensory space accurately transmits changes in intensities in the image to the human observer.3 The practical significance is that distortions in the transferred clinical information between the presented processed image data and the perceived visual sensation are minimized. This investigation has shown that an image presented on a DICOM part 14: GSDF calibrated monitor is preferred by the clinical observer. Whether calibrating a COTS monitor to the DICOM part 14: GSDF has any affect on clinical diagnosis, particularly with the availability of image processing tools that can enhance the presentation of the image for diagnostic purposes, may require further investigation, and a recent dental study found that DICOM part 14: GSDF calibrated display devices made no significant difference to the accuracy of approximal caries lesion diagnosis.13,14 This investigation has also shown that it is possible to obtain a similar image presentation on different COTS monitors by adopting a standardization protocol. “Perceptually linearizing” monitors used for dental radiological applications, in conjunction with maintaining a reduced ambient light level,15,16 can result in a consistent perceived visual sensation for the displayed image that is preferred by the observer.

The DICOM part 14: GSDF standardization process should therefore allow for a consistency in image presentation across various display devices, although this consistency may not be fully guaranteed.17 As more imaging modalities within dental radiology become increasingly computerized along with a resulting increase in soft-copy display, there will be a requirement for standardized display devices. Given that there has been some speculation in the dental literature of possible litigation consequences arising from not adhering to a standardization protocol, dental practitioners may require education in the area of optimization and calibration of dental display devices.18,19 It is only in Germany where a legal requirement exists in relation to acceptance testing and quality assurance for display devices, whereas in other jurisdictions such a specific mandatory requirement would not be evident, even though standards and guidelines are available.4,20 In this investigation, the best possible displayed luminance values after calibration and consequently the maximum number of JNDs that matched the contrast sensitivity of the human visual system were preferred by the observers for the tested monitors, even if the maximum luminance levels for these 8 bit input to 8 bit output COTS monitors would be considered low. This investigation has also shown that perceptually linearizing a COTS monitor using the DICOM part 14: GSDF standardization process allows a quantitative characterization of the display system to be obtained.3,8 The quantitative information from the calibration process provided a better description of the display system than just the luminance range of the monitor, and this information could assist practitioners in choosing display system designs for clinical applications. One may ask the question, how many JNDs are enough within the luminance range of the monitors that are being used for dental radiology, and could they affect clinical diagnosis?

To address such a question, it may be necessary to investigate image quality from those display devices supporting 10 and 12 bit output from 8 bit input of video memory, or from display controllers that can provide 10 bit input and 10 bit output, since the outcome for clinical applications would also be dependent on the bit depth of the display controller, which provides the discrete samples in the DDL range, as well as the luminance range.21

Although it is widely accepted that ROC studies using a “gold standard” are generally considered the most reliable method for evaluating the diagnostic value of medical imaging modalities and techniques, visual grading studies are relatively straightforward to conduct and clinical images are often readily available and could provide a useful additional image quality evaluation method in the dental radiology assessment toolkit. The concept behind a VGA is that if the practitioner knows the anatomical landmarks or appearance of a pathological lesion, and is presented with a high-quality image, then he/she would be confident to make a diagnosis. The benefit of visual grading is that it is not limited to a specific diagnosis, while the disadvantage is that it is a subjective method of evaluating the quality of an image, and not a measure of the ability to make the correct diagnosis. VGC analysis is used as a measure of the difference in image quality between modalities and is not an absolute value of subjective image quality. In addition, the measurement of other factors that could potentially influence the grading may not be possible with this type of assessment. In the absence of any well-defined criteria for evaluating dental image quality, it would be necessary to design one’s own criteria for these type of studies based on diagnostically relevant anatomical structures and features, and one should not underestimate the ability of any clinician to recognize anatomical features in order that a diagnosis could be reached.22

In conclusion, display systems can influence the contrast resolution requirements for clinical applications, standardizing COTS display devices to the DICOM part 14: GSDF can improve the presentation of dental radiological images.

References

  • 1.National Electrical Manufacturers Association (NEMA) Digital imaging and communications in medicine (DICOM) part 14: greyscale standard display function. Rosslyn, VA: NEMA Publishing; 2006.
  • 2.Barten PGJ. Contrast sensitivity of the human eye and its effects on image quality. Bellingham, WA: SPIE Optical Engineering Press; 1999 [Google Scholar]
  • 3.Bradley M, Hemminger R, Johnston RE, Rolland JP, Muller KE. Introduction to perceptual linearization of video display systems for medical image presentation. J Digit Imaging 1995; 8: 21–34 [DOI] [PubMed] [Google Scholar]
  • 4.Samei E, Badano A, Chakraborty D, Compton K, Cornelius C, Corrigan K, et al. Assessment of display performance for medical imaging systems, report of the American Association of Physicists in Medicine (AAPM) Task Group 18. AAPM online report no. 03. Madison, WI: Medical Physics Publishing; 2005 [DOI] [PubMed] [Google Scholar]
  • 5.Båth M, Månsson LG. Visual grading characteristics (VGC) analysis: a non-parametric rank-invariant statistical method for image quality evaluation. Br J Radiol 2007; 80: 169–176 10.1259/bjr/35012658 [DOI] [PubMed] [Google Scholar]
  • 6.Månsson LG. Methods for the evaluation of image quality: a review. Radiat Prot Dos 2000; 90: 89–99 [Google Scholar]
  • 7.van Erkel AR, Pattynama PM. Reciever operating characteristic (ROC) analysis: basic principles and applications in radiology. Eur J Radiol 1998; 27: 88–94 [DOI] [PubMed] [Google Scholar]
  • 8.Verilum. 5.02.006 Users guide. Bethesda, MD: Image Smiths Inc, 1999. [Google Scholar]
  • 9.Geijer H, Geijer M, Forsberg L, Kheddache S, Sund P. Comparison of color LCD and medical-grade monochrome LCD displays in diagnostic radiology. J Digit Imaging 2007; 20: 114–121 10.1007/s10278-007-9028-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Brettle DS. Display considerations for hospital-wide viewing of soft copy images. Br J Radiol 2007; 80: 503–507 [DOI] [PubMed] [Google Scholar]
  • 11.Samei E. AAPM/RSNA physics tutorial for residents: technological and psychophysical considerations for digital mammographic displays. Radiographics 2005; 25: 491–501 10.1148/rg.252045185 [DOI] [PubMed] [Google Scholar]
  • 12.Brennan P, McEntee M, Evanoff M, Philips P, O’Connor WT, Manning DJ. Ambient lighting: effect of illumination on soft-copy viewing of radiographs of the wrist. AJR 2007; 188: W177–W180 10.2214/AJR.05.2048 [DOI] [PubMed] [Google Scholar]
  • 13.Haak R, Wicht MJ, Nowak G, Hellmich M. Influence of displayed image size on radiographic detection of approximal caries. Dentomaxillofac Radiol 2003; 32: 242–246 [DOI] [PubMed] [Google Scholar]
  • 14.Hellén-Halme K, Nilsson M, Petersson A. Effect of monitors on approximal caries detection in digital radiographs—standard versus precalibrated DICOM part 14 displays: an in vitro study. Oral Surg Oral Med Oral Pathol Oral Radiol Endod 2009; 107: 716–720 10.1016/j.tripleo.2008.12.011 [DOI] [PubMed] [Google Scholar]
  • 15.Haak R, Wicht MJ, Hellmich M, Nowak G, Noack MJ. Influence of room lighting on grey-scale perception with a CRT-and a TFT monitor display. Dentomaxillofac Radiol 2002; 31: 193–197 10.1038/sj/dmfr/4600668 [DOI] [PubMed] [Google Scholar]
  • 16.Robson KJ, Kotre CJ. Pilot study into optimisation of viewing conditions for electronically displayed images. Radiat Prot Dosimetry 2005; 117: 298–303 10.1093/rpd/nci744 [DOI] [PubMed] [Google Scholar]
  • 17.Lowe JM, Brennan PC, Evanoff MG, McEntee MF. Variations in performance of LCDs are still evident after DICOM gray-scale standard display calibration. AJR Am J Roentgenol 2010; 195: 181–187 10.2214/AJR.09.2851 [DOI] [PubMed] [Google Scholar]
  • 18.MacDonald-Jankowski DS, Orpe EC. Some current legal issues that may affect oral and maxillofacial radiology. Part 2: digital monitors and cone-beam computed tomography. J Can Dent Assoc 2007; 73: 507–511 [PubMed] [Google Scholar]
  • 19.Hellén-Halme K, Petersson A, Warfvinge G, Nilsson M. Effect of ambient light and monitor brightness and contrast settings on the detection of approximal caries in digital radiographs: an in vitro study. DentomaxillofacRadiol 2008; 37: 380–384 10.1259/dmfr/26038913 [DOI] [PubMed] [Google Scholar]
  • 20.Institute of Physics and Engineering in Medicine Recommended standards for the routine performance testing of diagnostic X-ray imaging equipment. Report 91 York, UK: IPEM; 2006 [Google Scholar]
  • 21.Hemminger BM, Blume HR. Are medical image display systems perceptually optimal? Measurements before and after perceptual linearization. Proceedings of medical imaging 1996: image capture, formatting and display. Vol. 2707–58. Bellingham, WA: SPIE Press; 1996 [Google Scholar]
  • 22.EUR 16260 European guidelines on quality criteria for diagnostic radiographic images. Luxembourg, Belgium: Office for official publications of the European Communities; 1996 [Google Scholar]

Articles from Dentomaxillofacial Radiology are provided here courtesy of Oxford University Press

RESOURCES