Skip to main content
Journal of General Internal Medicine logoLink to Journal of General Internal Medicine
. 2010 Mar 27;25(8):865–872. doi: 10.1007/s11606-010-1327-8

Narrative Review: Should Teaching of the Respiratory Physical Examination Be Restricted Only to Signs with Proven Reliability and Validity?

Jochanan Benbassat 1,, Reuben Baumal 2
PMCID: PMC2896600  PMID: 20349154

Abstract

OBJECTIVE

To review the reported reliability (reproducibility, inter-examiner agreement) and validity (sensitivity, specificity and likelihood ratios) of respiratory physical examination (PE) signs, and suggest an approach to teaching these signs to medical students.

METHODS

Review of the literature. We searched Paper Chase between 1966 and June 2009 to identify and evaluate published studies on the diagnostic accuracy of respiratory PE signs.

RESULTS

Most studies have reported low to fair reliability and sensitivity values. However, some studies have found high specificites for selected PE signs. None of the studies that we reviewed adhered to all of the STARD criteria for reporting diagnostic accuracy.

CONCLUSIONS

Possible flaws in study designs may have led to underestimates of the observed diagnostic accuracy of respiratory PE signs. The reported poor reliabilities may have been due to differences in the PE skills of the participating examiners, while the sensitivities may have been confounded by variations in the severity of the diseases of the participating patients.

IMPLICATION FOR PRACTICE AND MEDICAL EDUCATION

Pending the results of properly controlled studies, the reported poor reliability and sensitivity of most respiratory PE signs do not necessarily detract from their clinical utility. Therefore, we believe that a meticulously performed respiratory PE, which aims to explore a diagnostic hypothesis, as opposed to a PE that aims to detect a disease in an asymptomatic person, remains a cornerstone of clinical practice. We propose teaching the respiratory PE signs according to their importance, beginning with signs of life-threatening conditions and those that have been reported to have a high specificity, and ending with signs that are "nice to know," but are no longer employed because of the availability of more easily performed tests.

KEY WORDS: respiratory physical examination, diagnostic accuracy, medical students

INTRODUCTION

Despite the evidence that supports the role of the physical examination (PE) in the assessment of a patient's disease1,2, there is considerable controversy as to whether the PE has outlived its usefulness. Some suggest that imaging may provide a more direct view into the body and prevent errors3. Others argue that the value of ancillary testing is overrated4; that it cannot replace a physician's ability to recognize familiar patterns of disease5,6; and that the failure to perceive the importance of the PE is due to poor teaching and learning of basic clinical skills58.

This later criticism has led to major changes in the way the skills of patient interviewing9,10 and diagnostic reasoning11,12 are taught. However, teaching of the PE has remained unchanged. From the 1950s13 to the present time12, textbooks of the PE continue to offer a comprehensive and unselective compilation of signs, which include those no longer considered to be useful14, while often excluding important PE signs15. Some medical schools do encourage students to form diagnostic hypotheses early on while listening to the patient’s narrative and conduct a targeted PE aimed at confirming or refuting these hypotheses8. However, we suspect that even in these schools, this educational approach is not reinforced during the subsequent clinical undergraduate training. During their clerkship and internship, students are expected to perform a complete PE along a predetermined sequence, which begins with the patient’s appearance and vital signs, and moves on to an examination from head to toes. Consequently, students often perform hasty PEs with frequent shortcuts that are unlikely to detect physical findings. This may explain the observed decline of students' breast examination skills over the course of their training16 and students’ reliance on “hard” laboratory and imaging data rather than on the clinical assessment of their patients4.

We believe that, in order to enhance students' appreciation of the PE, its teaching should focus on selected and important PE skills8 rather than overwhelm the learners with an all-inclusive list of signs. One possible way to discriminate between important and less important components of the PE would be to select for signs with proven diagnostic accuracy. Indeed, the call by Sackett and Rennie17 for a rational approach to the clinical examination triggered a series of publications (e.g.,18), reviews (e.g.,14) and a textbook19 dealing with the evidence base of the PE. Most of them emphasized the need for additional studies of the reliability and validity of PE signs. However, we know of no suggestions for reconciling the teaching of the PE with the paucity of evidence on the diagnostic accuracy of most PE signs.

The objective of this paper is to update the presently available evidence, or lack thereof, for the reliability and validity of PE signs, and suggest a modified approach to their teaching to undergraduate medical students. We chose the PE of the respiratory system because its importance has been debated ever since the advent of chest radiography20.

METHODS

We used Paper Chase21 to search Medline and Old Medline between 1966 and June 2009. An effort was made to identify all original published studies of the diagnostic accuracy of specific respiratory PE signs. We excluded reviews and studies of the diagnostic value of combinations of PE signs. First, we used the terms ['physical examination'] and ['respiratory' or 'pulmonary diseases' or 'lung diseases']. Of the 392 hits, only 9 were original studies of the reliability and validity of respiratory PE signs. Second, we used the terms ['physical examination' or 'auscultation' or 'percussion'] and ['pneumonia' or 'pleural effusion' or 'airway obstruction' or 'pneumothorax' or 'asthma' or 'pulmonary embolism'] and obtained 701 hits, which included 13 additional studies. Finally, we searched the reference sections of all relevant studies and identified another 38, some of which were published before 1966. After excluding 17 studies of the validity and 3 studies of the reliability of respiratory signs in children and infants, we were left with a total of 40 studies: 13 of the reliability2234, 20 of the validity3554 and 7 of both5561.

Other search strategies, such as those using the terms ['physical examination'] and ['diagnostic accuracy'], or ['physical examination'] and ['sensitivity' or 'specificity' or 'reliability'] and [respiratory], did not identify additional studies. Our failure to capture most studies of the diagnostic accuracy of PE signs using conventional literature searches may have been due to less than optimal indexing.

Studies of diagnostic accuracy are subject to various sources of bias that may result from their design, selection of patients, performance of the test and analysis of data. The recognition that the quality of reporting of these studies is often deficient led to the development of a 25-item list of Standards for Reporting of Diagnostic Accuracy (STARD)62. Similar to the experts who developed this list, we realize that the methodology for designing and conducting studies of diagnostic accuracy requires further development. However, at present, these criteria are the best available measure of the value of such studies, including those of the diagnostic accuracy of the PE63. Therefore, we chose to evaluate the studies included in the present review by the degree of their adherence to the STARD check list.

We reviewed all 40 publications and tabulated (1) the degree of their adherence to the STARD criteria62, (2) the reliability of the various respiratory PE signs and (3) their validity for detecting defined disorders. We use the term "reliability" interchangeably with the reproducibility of the findings that are obtained when the same PE is repeated on the same patient by the same or different examiners. Most commonly, reliability was reported as kappa statistics on a -1 (complete disagreement) to 0 (chance agreement) to +1 (perfect agreement) scale. Values between 0 and 0.4 are commonly accepted as indicating low agreement, those between 0.4 and 0.6, fair agreement; between 0.6 and 0.8, good agreement; between 0.8 and 1, perfect agreement64. Less frequently, reliability was presented as agreement rates between examiners (e.g.,22), coefficients of correlation (e.g.,29) or the "standard deviation agreement index" (e.g.,25), which is, similar to the kappa statistics, a measure of inter-examiner agreement beyond the one expected by chance. "Validity" refers to the ability of a PE sign to discriminate between patients with and without the disease under consideration, and it is expressed as sensitivity and specificity relative to an agreed upon gold standard. The phrase "diagnostic accuracy," as used here, refers to the contribution of a given PE sign to establishing a diagnosis and to its usefulness in clinical practice.

RESULTS

  1. Adherence to the STARD criteria for diagnostic accuracy

Most of the reviewed studies were published before the development of the STARD criteria in 2003, and none of them adhered to all of these criteria. Most studies complied with the following criteria: they were prospective (33/40), provided an explicit or implicit definition of the study objectives (37/40) and data on the study populations (39/40), methods of patient recruitment (27/40) and methods of presentation of results (38/40). All validity studies presented their findings relative to a gold standard of diagnosis. In most studies, the test results were dichotomous (PE signs present or absent); in some studies, such as those of respiratory rates, the results were presented as above or below cutoff values. The examiners in 16 of the 20 reliability studies were blinded to the findings of other examiners, and those in 20 of the 27 validity studies were blinded to the results of the gold standard (data not shown in table format).

However, only few of the reviewed studies adhered to the following criteria: only 2 of the 20 reliability studies, and 4 of the 27 validity studies reported data on the severity of disease of the participating patients; only 10 reliability studies and 14 validity studies reported on attempts to use standardized PE procedures in order to enhance the consistency of the examiners' PE techniques; only 1 reliability study and 4 validity studies presented data on the number of qualifying patients who did not participate in the study; only 7 validity studies reported the reliability of the various PE signs (data not shown in table format).

  1. Reliability (reproducibility) of respiratory PE signs (Tables 1, 2)

Table 1.

Reliability of Respiratory Physical Examination Signs Elicited by Inspection, Palpation and Percussion

Sign Inter-examiner reliabilitya Reference
Respiratory distress (use of accessory muscle/intercostal retractions) "31% disagreement" 22
"26% disagreement" 26
b 25,28
Cyanosis "18% disagreement" 26
0.36 33
b 25
Deformities of thorax/barrel chest and spinal column/kyphosis "29% disagreement" 22
"16% disagreement" 24
"24% disagreement" 26
b 25
Respiratory rate 0.25 33
b 25
Chest movements/expansion (decreased/asymetric) "22% disagreement" 24
"19% disagreement" 26
0.14 31
0.38 33
0.85 61
b 25
Clubbing "Considerable disagreement" 23
"8% disagreement" 24
"Almost total agreement" 25
0.45 33
Crico-sternal distance 0.28 33
Position of trachea 0.01 33
b 25
Vocal fremitus (reduced) 0.24 33
0.86 61
b 25
Dullness on percussion 0.34 31
0.16 to 0.46c 32
"3% disagreement" 56
0.52 33
0.84 61
b 25
Hyperresonance on percussion "33% disagreement" 22
"23% disagreement" 26
0.26 31
0.25 to 0.43c 32
0.50 33
b 25,28
Diaphragmatic expansion "20% disagreement" 26
"60% agreement" 30
0.05 59
Cardiac dullness (impaired/absent) "32% disagreement" 22
"24% disagreement" 26
0.47 59
Auscultatory percussion (hyperresonance) 0.37 59
Auscultatory percussion (reduced) "10% disagreement" 56
-0.04 to 0.45c 60
0.76 61

aUnless stated otherwise, the inter-examiner reliability is expressed as kappa

bInter-examiner reliability was expressed as "standard deviation agreement index," and it was about midway between perfect agreement rates and those expected by chance alone

cRange of reproducibility between multiple pairs of examiners

Table 2.

Reliability of Respiratory Physical Examination Signs Elicited by Auscultation

Sign Inter-examiner reliabilitya Reference
Bronchial breathing 0.19 31
0.32 35
-0.14 to 0.14c 60
Whispering pectoriloquy 0.11 33
Whispering pectoriloquy 0.16 to 0.25c 60
Bronchophony -0.10 to 0.18c 60
Egophony b 25
Breath sound intensity "Differences among examiners–not statistically significant" 55
"29% disagreement" 22,26
Coefficient of correlation: 0.58 29
Coefficient of correlation: 0.97 29
0.16 31
0.23 to 0.43c 32
0.43 33
0.38 59
0.42 58
0.89 61
b 25,28
Rales/crackles/crepitations 0.36 31
0.32 58
0.30 to 0.63c 32
0.41 33
0.24 to 0.65c 60
0.67 61
b 25,28
Rhonchi 0.55 31
-0.06 to 0.18c 60
b 25
Vocal resonance (diminished) 0.78 61
b 25
Vocal resonance (increased) b 25
Wheezes 0.54 to 0.70c 32
0.51 33
0.50 59
0.43 58
0.85 34
-0.05 to 0.65c 60
b 28
Wheezing at maximal forced exhalation "90% agreement" 57
Expiratory phase (prolonged) 0.53 31
0.35 33
Pleural friction rub -0.02 31
0.51 33
-0.02 61

aUnless stated otherwise, the inter-examiner reliability is expressed as kappa

bInter-examiner reliability expressed as "standard deviation agreement index," and it was about midway between perfect agreement rates and those expected by chance alone

cRange of reproducibility between multiple pairs of examiners

The only study of the intra-examiner reproducibility that we know of found that the examiners disagreed with themselves in 11–26% of the cases and that pulmonary specialists were significantly less self-consistent than medical students32. Of the 20 reliability studies, only 4 studies32,34,60,61 reported inter-examiner agreement rates above k = 0.6 for one or more PE signs. Another five studies24,25,29,56,57 reported more than 90% or "almost total" inter-examiner agreement for at least one PE sign.

The following PE signs were reported by some, but not most studies to have reliabilities of kappa = 0.6–1.0 or disagreement rates of 10% or less: chest movements, clubbing, vocal fremitus, dullness on percussion and reduced auscultatory percussion (Table 1), breath sound intensity, crepitations, vocal resonance (diminished) and wheezes (Table 2). Low to fair inter-examiner agreement rates, i.e., kappa = 0.0–0.6, were consistently reported in one or more studies for the following PE signs: cyanosis, respiratory rate, crico-sternal distance, deformities of the thorax, respiratory distress, position of trachea, hyperresonance on percussion, diaphragmatic expansion and cardiac dullness (Table 1); bronchial breathing, pectoriloquy, bronchophony, egophony, rhonchi, vocal resonance (increased), prolonged expiratory phase and pleural friction rub (Table 2).

  1. Sensitivity, specificity and likelihood ratios of respiratory PE signs for defined diseases (Tables 3, 4, 5)

Table 3.

Sensitivity, Specificity and Likelihood Ratios of Respiratory PE Signs Elicited by Inspection, Palpation and Percussion

Sign Diagnosis Gold standard Sensitiviy Specificity LR positivea LR negativea Reference
Laryngeal height <4 cm OADb Spirometry 0.24 0.76 1.0 1.0 47
Clubbing Lung cancer Histology 0.10 - - - 45
Hoover's sign (inward motion of the lower lateral rib cage with inspiration) OADb - Mild FEV1 >0.8 normal 0.00 - - - 51
- Moderate 0.5–0.8 0.36
- Severe 0.3–0.5 0.43
- Very severe <0.3 0.76
Cyanosis Pneumonia Chest x-ray 0.15 0.97 5.0 0.9 54
Heart rate >100/min Pneumonia Chest x-ray 0.17–0.50 0.62–0.92 1.0–2.5 0.6–0.9 38,39,41,48,52,54
>120/min 0.18 0.96 4.5 0.8 53
Heart rate >100/min Pulmonary emboli Pulmonary angio 0.58 - - - 37
Respiratory rate >20/min Pneumonia Chest x-ray 0.45,0.90 0.31,0.79 1.3, 2.1 0.3, 0.7 52,54
>25 0.07,0.28 0.92, 0.99 3.5, 7.0 0.8,0.9 38,53
>30 0.29–0.41 0.72–0.88 1.5–2.6 0.8–0.9 39,41,48,54
>40 0.05 0.97 1.7 1.0 39
Respiratory rate >20/min Hemo-pneumothorax, after trauma- blunt Chest x-ray 0.43 1.00 0.6 49
- Penetrating 0.32 0.99 32.0 0.7
Respiratory rate >20/min Pulmonary emboli Pulmonary angio 0.85 - - - 37
Pulsus paradoxus Bronchial asthma Spirometry 0.31 1.00 0.7 36d
Asymetric expansion of the chest Pneumonia Chest x-ray 0.04 1.00 1.0 38
Asymetric expansion of the chest Pleural effusion Chest x-ray 0.74 0.91 8.2 0.3 61
Vocal fremitus (increased) Pneumonia Chest x-ray 0.08 0.99 8.0 0.9 38
Vocal fremitus (reduced) Pleural effusion Chest x-ray 0.76 0.88 6.3 0.3 61
Dullness on percussion Pneumonia Chest x-ray 0.04 0.99 4.0 1.0 38
0.14 0,96 3.5 0.9 43
0.26 0.93 3.7 0.8 41
0.13 0.94 2.2 0.9 39
Dullness on percussion Any chest x-ray lesion Chest x-ray 0.15 0.97 5.0 0.9 40
0.08–0.09 0.98–0.99 4.0–9.0 0.9–0.9 56c
Dullness on percussion Pleural effusion Chest x-ray 0.89 0.81 4.7 0.1 61
Diaphragm expansion OADb Spirometry 0.37 0.74 1.4 0.9 59
Cardiac dullness (absent) OADb Spirometry 0.26 0.96 6.5 0.8 59
Auscultatory percussion (reduced) Any chest x-ray lesion Chest x-ray 0.10–0.15 0.88–0.88 0.8–1.3 1.0–1.0 56c
0.19 0.85 1.3 1.0 40
Auscultatory percussion (reduced) Pneumonia Chest x-ray 0.00–0.50 0.88–0.97 2.0–12.5 0.5–1.0 60c
Auscultatory percussion (reduced) Pleural effusion Chest x-ray 0.96 1.0 0.04 44
0.58 0.85 3.9 0.5 61
Auscultatory percussion (hyperesonant) OADb Spirometry 0.42 0.86 3.0 0.7 59

aLikelihood ratio positive = sensitivity/(1-specificity); likelihood ratio negative = (1-sensitivity)/specificity

bOAD = obstructive airway disease

cRange of data obtained by more than one examiner

dDerived from data on a total of 25 patients presented in figure form

Table 4.

Sensitivity, Specificity and Likelihood Ratios of Respiratory PE Signs Elicited by Auscultation

Sign Diagnosis Gold standard Sensitivity Specificity LR positivea LR negativea Reference
Forced expiratory time >6 s OADb Spirometry 0.92 0.43 1.6 0.2 42
Bronchial breathing Pneumonia Chest x-ray 0.11–0.63 0.64–0.94 1.1–3.3 0.5–1.0 60c
0.14 0.96 3.5 0.9 41
Chest CT 0.08 1.0 0.9 50
Bronchophony Pneumonia Chest x-ray 0.11–0.69 0.61–0.96 1.2–12.5 0.5–1.0 60c
0.04 0.99 4.0 1.0 54
Egophony Pneumonia Chest x-ray 0.13–0.54 0.61–0.98 1.0–12.5 0.8–1.0 60c
0.16 0.97 5.3 0.9 41
0.08 0.96 2.0 1.0 39
0.04 1.0 1.0 38
Breath sounds (diminished) Pleural effusion Chest CT 0.42 0.90 4.2 0.6 50
Chest x-ray 0.88 0.83 5.2 0.1 61
Breath sounds (diminished) OADb Spirometry 0.57 0.88 4.8 0.5 59
0.29 0.85 1.9 0.8 58
d - - - 55
Breath sounds (diminished) Pneumonia Chest x-ray 0.15 0.95 3.0 0.9 43
0.49 0.80 2.5 0.6 41
0.34 0.85 2.3 0.8 39
Breath sounds (diminished) Hemo-pneumothorax, after trauma-blunt Chest x-ray 1.00 1.00 0.0 49
- penetrating 0.50 1.00 0.5
Rales/crackles/crepitations Pneumonia Chest x-ray 0.54–1.0 0.39–0.79 1.2–4.8 0.5–0.8 60c
0.59 0.70 2.0 0.6 54
0.50 0.81 2.6 0.6 41
0.36 0.77 1.6 0.8 39
0.19 0.93 2.7 0.9 38
0.67 0.44 1.2 0.8 48
0.26 - - - 35
Rales/crackles/crepitations OADb Spirometry 0.01 0.99 1.0 1.0 58
Rales/crackles/crepitations Ventilator-associated pneumonia Histological evidence 0.73 0.43 1.3 0.6 46
Rales/crackles/crepitations Pulmonary emboli Pulmonary angio 0.56 - - - 37
Rhonchi Pneumonia Chest x-ray 0.26 0.82 1.4 0.9 41
0.34 0.78 1.5 0.8 39
0.15 0.90 1.5 0.9 38
Reduced vocal resonance Pleural effusion Chest x-ray 0.76 0.88 6.3 0.3 61
Wheezes Pneumonia Chest x-ray 0.22 0.75 0.9 1.0 39
0.22 0.71 0.8 1.1 48
Wheezes OADb Spirometry 0.26 0.74 1.0 1.0 47
0.06 0.99 6.0 0.9 59
0.14 0.99 14.0 0.9 58
Wheezes at maximal forced exhalation Asthma Metacholine challenge 0.57 0.37 0.9 1.2 57
Pleural friction rub Pleural effusion Chest x-ray 0.05 0.99 5.0 0.7 61
Pleural friction rub Pneumonia Chest x-ray 0.17 0.96 4.3 0.9 39
0.10 0.98 5.0 0.9 43
0.04 0.99 4.0 1.0 38
Accentuated P-2 heart sound Pulmonary emboli Pulmonary angio 0.57 - - - 37

aLikelihood ratio positive = sensitivity/(1-specificity); likelihood ratio negative = (1-sensitivity)/specificity

bOAD = Obstructive airway disease

cRange of data obtained by more than one examiners

d"Significant correlation between breath sound intensity scores and spirometry"

Table 5.

Clinical Contexts and Respiratory Physical Examination Signs with Probable Diagnostic Value

Setting and context Physical examination sign LR positivea LR negativea Diagnosis Reference
cOPD: Patients with acute cough Asymetric expansion of the chest 1.0 Pneumonia 38
Vocal fremitus (increased) 8.0 0.9
Dullness on percussion 4.0 1.0
Egophony 1.0
Pleural friction rub 4.0 1.0
bER: Fever and acute respiratory symptoms Cyanosis 5.0 0.9 Pneumonia 54
Heart rate >120/min 4.5 0.8 53
Dullness on percussion 2.2–3.7 0.8–0.9 39,41,43
Bronchial breathing 1.1–3.5 0.5–1.0 41,60
Bronchophony 1.2–12.5 0.5–1.0 54,60
Egophony 1.0–12.5 0.8–1.0 41,60
Pleural friction rub 4.3,5.0 0.9 39,43
dICU: Acute respiratory distress syndrome Bronchial breathing 0.9 Pneumonia 50
cOPD: Patients with acute cough Asymetric expansion of the chest 8.2 0.3 Pleural effusion 38
Inpatients with respiratory symptoms Vocal fremitus (diminished) 6.3 0.3 Pleural effusion 61
Dullness on percussion 4.7 0.1
Breath sounds (diminished) 5.2 0.1
Pleural friction rub 5.0 0.7
Vocal resonance (diminished) 6.3 0.3
dICU: Acute respiratory distress syndrome Breath sounds (diminished) 4.2 0.6 Pleural effusion 50
cOPD: screening of plumbers Forced expiratory time >6 s 1.6 0.2 OADe 42
cOPD: self-reported history of smoking, asthma, chronic bronchitis Wheezes 6.0 0.9 OADe 59
Cardiac dullness (absent) 6.5 0.8
cOPD for preoperative evaluation Wheezes 14.0 0.9 OADe 58
bER: Chest trauma – blunt Respiratory rate >20/min 0.6 Hemo-pneumothorax 49
Breath sounds (diminished) 0.0
bER: Chest trauma - penetrating Respiratory rate >20/min 32.0 0.7 Hemo-pneumothorax 49
Breath sounds (diminished) 0.5

aLikelihood ratio positive = sensitivity/(1-specificity); likelihood ratio negative = (1-sensitivity)/specificity

bER = Emergency room

cOPD = outpatient department

dICU = Intensive care unit

eOAD= Obstructive airway disease

The vast majority of studies found sensitivity values of 0.5 or less (Tables 3, 4). Sensitivity values with likelihood ratios- negative (LR-) of 0.2 or less were reported only for dullness on percussion in diagnosing pleural effusion among inpatients with respiratory symptoms (Tables 3, 5), forced expiratory time of more than 6 s duration for obstructive airway disease (OAD) in asymptomatic plumbers screened for lung diseases and for diminished breath sounds in diagnosing hemo-pneumothorax in the context of trauma (Tables 4, 5).

On the other hand, high specificity values with LR+ of 4.0 or more have been reported for PE signs, such as those of pulmonary consolidation, in as many as 17 of the 27 validity studies (Tables 3, 4). Table 5 lists the settings and clinical contexts in which respiratory PE signs may be useful for increasing or reducing the post-test odds of a specific diagnosis.

DISCUSSION

Two main findings emerge from the present review. First, most studies have found low to fair reliability values for respiratory PE signs. This finding detracts from the credibility of the validity studies and is consistent with the view that "clinical skills textbooks fail evidence-based examination"65. Second, since none of the reviewed studies complied with all of the STARD criteria62, their design may have been flawed. Consequently, the reported reliability (Tables 1, 2) and sensitivity (Tables 3, 4) values should be interpreted with caution.

A finding of a poor reliability (i.e., high examiner variability) of a PE sign may indicate either that it has a poor diagnostic accuracy or that some of the examiners had deficient PE skills. This latter possibility is consistent with the reported lack of improvement, or even deterioration, of respiratory6,32, breast16 and cardiac66 PE skills with seniority and experience. The possibility that examiners differed in their PE skills is also suggested by the reported variability in the approach to the respiratory PE of 403 members of the British Thoracic Society7. A lack of adherence to appropriate PE technique also explains the low reliability found in studies of tachypnea25,33: examiners appear to rely on their subjective impression rather than on counting67. On the other hand, an adherence to technique may explain the high reliabilities of respiratory PE signs that have been reported by some authors (e.g.,61).

The reported low to moderate sensitivity of respiratory PE signs may have been similarly due to deficient examination skills: an examiner with poor skills is likely to miss a PE finding. Alternatively, the reported sensitivities may have been confounded by differences in the severity of the diseases of the examined patients. Indeed, it has been reported that the sensitivity of reduced breath sounds55, pulsus paradoxus36 and Hoover's sign (an inward motion of the lower lateral rib cage with inspiration)51 increases with the severity of airway obstruction.

These two main flaws in the design of most studies, namely their failure to control for disease severity and examiners' skills, have probably led to underestimates of the reported reliability, sensitivity and specificity values. Future studies may avoid these biases by adhering to the STARD requirements. However, pending the publication of properly controlled studies, the reliability and validity of most respiratory PE signs remain uncertain. In 1986, Mulrow et al.32 concluded that "despite their routine use, most physical examination techniques, including pulmonary auscultation and percussion, are poorly standardized and of uncertain [diagnostic] value." This conclusion is also pertinent today.

The uncertain sensitivity of the respiratory PE argues against its utility for screening of asymptomatic persons. Screening for disease requires that the test used be highly sensitive, well above 0.7, which is not the case for most respiratory PE signs (Tables 3, 4). However, their low or uncertain reliability and validity do not preclude their usefulness in patients with suspected respiratory diseases for two reasons. First, the observed reliabilities and sensitivities may have been confounded by flaws in the study design. Second, the possible bias produced by these flaws would be toward underestimating the diagnostic accuracy of the various respiratory PE signs. Therefore, the high specificities and high LRs+ reported for some of these signs appear to be credible and to indicate that they may be useful in specific clinical contexts. For example, assuming that the pretest probability of pneumonia in outpatients with acute cough is 10% (odds 1:9)38, a finding of asymetric expansion of the chest would increase the odds to 8:9, i.e., increase the post-test probability to 47%. Assuming a pretest probability of pneumonia of 12–30% (odds 1:9–3:7) among emergency room patients with fever and acute respiratory symptoms41, a finding of pleural friction rub would increase the odds of pneumonia to 5:9–15:7 or to a post-test probability of 36–68%. Future studies should explore whether the various respiratory PE signs provide independent information. In other words, it is at present uncertain whether a combination of PE signs (e.g., of pulmonary consolidation such as dullness on percussion, bronchial breathing and egophony) allows the multiplication of each of their LRs in order to assess the post-test probability of pneumonia.

Therefore, we believe that a meticulously performed respiratory PE, which aims to explore a diagnostic hypothesis, as opposed to a PE that aims to detect a disease in an asymptomatic person, remains a cornerstone of clinical practice. We propose that teaching of the PE should not discriminate among respiratory signs according to their presently uncertain reliabilities and sensitivities, but rather according to the importance of the disease under consideration and to their specificity.

The most important PE signs are, first, those of life-threatening conditions in any clinical context. For example, a patient, who presents with any degree of respiratory abnormality (tachypnea, bradypnea, apnea, labored breathing, stridor, accessory muscle recruitment or paradoxical breathing) is in respiratory distress. Its detection mandates immediate treatment with oxygen and a sustained effort to establish the cause by looking for stridor (croup, epiglottitis), wheezes (bronchial asthma, bronchitis), reduced breath sounds and changes in percussion note (pneumothorax or pleural effusion) and for signs suggesting pulmonary emboli. Second, we believe that teaching should emphasize the respiratory signs that have been reported to have high positive or low negative LRs (Table 5).

At the other end of the spectrum, the least important respiratory PE signs are those that are no longer employed in clinical practice because of the availability of more easily performed ancillary tests. For example, hand-held spirometry provides an easier and more precise assessment of obstructive airway disease than Hoover's sign and pulsus paradoxus. Spirometry may also alert physicians to the possibility of mild pulmonary disorders, and it may be used for monitoring patients with conditions such as asthma and cystic fibrosis. Similarly, pulse oxymetry may detect reduced blood oxygenation at earlier stages than central cyanosis68. Therefore, we join the calls to incorporate pulse oximetry and spirometry into the PE, and add hand-held oximeters and spirometers to the stethoscope, sphygmomanometer and reflex hammer that a doctor already uses during patient examination69.

Acknowledgments

Conflict of Interest None disclosed.

References

  • 1.Hampton JR, Harrison MJ, Mitchell JR, Prichard JS, Seymour C. Relative contributions of history-taking, physical examination, and laboratory investigation to diagnosis and management of medical outpatients. Br Med J. 1975;2:486–9. doi: 10.1136/bmj.2.5969.486. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Reilly BM. Physical examination in the care of medical inpatients: an observational study. Lancet. 2003;362:1100–05. doi: 10.1016/S0140-6736(03)14464-9. [DOI] [PubMed] [Google Scholar]
  • 3.Todd IK. A thorough pulmonary exam and other myths. Acad Med. 2000;75:50–1. doi: 10.1097/00001888-200001000-00011. [DOI] [PubMed] [Google Scholar]
  • 4.Halkin A, Reichman J, Schwaber M, Paltiel O, Brezis M. Likelihood ratios: getting diagnostic testing into perspective. QJM. 1998;91:247–58. doi: 10.1093/qjmed/91.4.247. [DOI] [PubMed] [Google Scholar]
  • 5.Feddock CA. The lost art of clinical skills. Am J Med. 2007;120:374–378. doi: 10.1016/j.amjmed.2007.01.023. [DOI] [PubMed] [Google Scholar]
  • 6.Bradding P, Cookson JB. The dos and don'ts of examining the respiratory system: a survey of British Thoracic Society members. J R Soc Med. 1999;92:632–4. doi: 10.1177/014107689909201207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Mangione S, Nieman LZ. Pulmonary auscultatory skills during training in internal medicine and family practice. Am J Respir Crit Care Med. 1999;159:1119–24. doi: 10.1164/ajrccm.159.4.9806083. [DOI] [PubMed] [Google Scholar]
  • 8.Benbassat J, Baumal R, Heyman SN, Brezis M. Viewpoint: suggestions for a shift in teaching clinical skills to medical students: the reflective clinical examination. Acad Med. 2005;80:1121–6. doi: 10.1097/00001888-200512000-00012. [DOI] [PubMed] [Google Scholar]
  • 9.Morgan WL, Engel GL, eds. The Clinical Approach to the Patient. Philadelphia: WB Saunders Co; 1969: 197–204.
  • 10.Hargie O, Dickson D, Boohan M, Hughes K. A survey of interviewing skills training in UK schools of medicine: present practices and prospective proposals. Med Educ. 1998;32:25–34. doi: 10.1046/j.1365-2923.1998.00154.x. [DOI] [PubMed] [Google Scholar]
  • 11.DeGowin EL. Bedside diagnostic examination. 1. New York: Macmillan Co; 1965. [Google Scholar]
  • 12.LeBlond RF, Brown DD, DeGowin RL. DeGowin’s diagnostic examination. 9. New York: McGraw-Hill; 2009. [Google Scholar]
  • 13.Kampmeier RH. Physical examination in health and disease. 2. Philadelphia: FA Davis Co; 1957. [Google Scholar]
  • 14.Joshua AM, Celermajer DS, Stockler MR. Beauty is in the eye of the examiner: reaching agreement about physical signs and their value. Intern Med J. 2005;35:178–87. doi: 10.1111/j.1445-5994.2004.00795.x. [DOI] [PubMed] [Google Scholar]
  • 15.Cook CJ, Smith GB. Do textbooks of clinical examination contain information regarding the assessment of critically ill patients? Resuscitation. 2004;60:129–36. doi: 10.1016/j.resuscitation.2003.09.009. [DOI] [PubMed] [Google Scholar]
  • 16.Lee KC, Dunlop D, Dolan NC. Do clinical breast examination skills improve during medical school? Acad Med. 1998;73:1013–9. doi: 10.1097/00001888-199809000-00025. [DOI] [PubMed] [Google Scholar]
  • 17.Sackett DL, Rennie D. The science of the art of the clinical examination. JAMA. 1992;267:2650–2. doi: 10.1001/jama.267.19.2650. [DOI] [PubMed] [Google Scholar]
  • 18.Metlay JP, Kapoor WN, Fine MJ. Does this patient have community-acquired pneumonia? Diagnosing pneumonia by history and physical examination. JAMA. 1997;278:1440–5. doi: 10.1001/jama.278.17.1440. [DOI] [PubMed] [Google Scholar]
  • 19.McGee S. Evidence based physical diagnosis. 2. Philadelphia: WB Saunders Co; 2007. [Google Scholar]
  • 20.Auld AG. The roentgen rays in the diagnosis of pulmonary disease. Lancet. 1903;162:341–2. doi: 10.1016/S0140-6736(01)34102-8. [DOI] [Google Scholar]
  • 21.Horowitz GL, Bleich HL. PaperChase: a computer program to search the medical literature. New Engl J Med. 1981;305:924–930. doi: 10.1056/NEJM198110153051605. [DOI] [PubMed] [Google Scholar]
  • 22.Fletcher CM. The clinical diagnosis of pulmonary emphysema. Proc R Soc Med. 1952;45:577–84. [PubMed] [Google Scholar]
  • 23.Pyke DA. Finger clubbing. Lancet. 1954;2:352–354. doi: 10.1016/S0140-6736(54)92662-8. [DOI] [PubMed] [Google Scholar]
  • 24.Schilling RS, Hughes JP, Dingwall-Fordyce I. Disagreement between observers in an epidemiological study of respiratory disease. Br Med J. 1955;1:65–8. doi: 10.1136/bmj.1.4905.65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Smyllie HC, Blendis lM, Armitage P. Observer disagreement in physical signs of the respiratory system. Lancet. 1965;2:412–3. doi: 10.1016/S0140-6736(65)90759-2. [DOI] [PubMed] [Google Scholar]
  • 26.Schneider IC, Anderson AE. Correlation of clinical signs with ventilatory function in obstructive lung disease. Ann Intern Med. 1965;62:477–85. doi: 10.7326/0003-4819-62-3-477. [DOI] [PubMed] [Google Scholar]
  • 27.Nairn JR, Turner-Warwick M. Breath sounds in Emphysema. Brit J Dis Chest. 1969;63:29–37. doi: 10.1016/S0007-0971(69)80041-0. [DOI] [PubMed] [Google Scholar]
  • 28.Godfrey S, Edwards RHT, Campbell EJM, Armitage P, Oppenheimer EA. Repeatability of physical signs in airway obstruction. Thorax. 1969;24:4–9. doi: 10.1136/thx.24.1.4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Bohadana AB, Peslin R, Uffholtz H. Breath sounds in the clinical assessment of airflow obstruction. Thorax. 1978;33:345–51. doi: 10.1136/thx.33.3.345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Williams TJ, Ahmad D, Morgan WK. A clinical and roentgenographic correlation of diaphragmatic movement. Arch Intern Med. 1981;141:878–80. doi: 10.1001/archinte.141.7.878. [DOI] [PubMed] [Google Scholar]
  • 31.Gjorup T, Bugge PM, Jensen AM. Interobserver variation in assessment of respiratory signs. Physicians' guesses as to interobserver variation. Acta Med Scand. 1984;216:61–6. doi: 10.1111/j.0954-6820.1984.tb03772.x. [DOI] [PubMed] [Google Scholar]
  • 32.Mulrow CD, Dolmatch BL, Delong ER, Feussner JR, Benyunes MC, Dietz JL, Lucas SK, Pisano ED, Svetkey LP, Volpp BD, et al. Observer variability in the pulmonary examination. J Gen Intern Med. 1986;1:364–7. doi: 10.1007/BF02596418. [DOI] [PubMed] [Google Scholar]
  • 33.Spiteri MA, Cook DG, Clarke SW. Reliability of eliciting physical signs in examination of the chest. Lancet. 1988;1:873–5. doi: 10.1016/S0140-6736(88)91613-3. [DOI] [PubMed] [Google Scholar]
  • 34.Purohit A, Bohadana A, Kopferschmitt-Kubler MC, Mahr L, Linder J, Pauli G. Lung auscultation in airway challenge testing. Respir Med. 1997;91:151–7. doi: 10.1016/S0954-6111(97)90051-2. [DOI] [PubMed] [Google Scholar]
  • 35.Osmer JC, Cole BK. The stethoscope and roentgenogram in acute pneumonia. South Med J. 1966;59:75–7. doi: 10.1097/00007611-196601000-00014. [DOI] [PubMed] [Google Scholar]
  • 36.Bilgi C, Jones RL, Sproule BJ. Relation between pulsus paradoxus and pulmonary function in patients with chronic airways obstruction. Can Med Assoc J. 1977;117:1389–92. [PMC free article] [PubMed] [Google Scholar]
  • 37.Stein PD, Willis PW, 3rd, DeMets DL. History and physical examination in acute pulmonary embolism in patients without preexisting cardiac or pulmonary disease. Am J Cardiol. 1981;47:218–23. doi: 10.1016/0002-9149(81)90388-X. [DOI] [PubMed] [Google Scholar]
  • 38.Diehr P, Wood RW, Bushyhead J, Krueger L, Wolcott B, Tompkins RK. Prediction of pneumonia in outpatients with acute cough–a statistical approach. J Chronic Dis. 1984;37:215–25. doi: 10.1016/0021-9681(84)90149-8. [DOI] [PubMed] [Google Scholar]
  • 39.Gennis P, Gallagher J, Falvo C, Baker S, Than W. Clinical criteria for the detection of pneumonia in adults: guidelines for ordering chest roentgenograms in the emergency department. J Emerg Med. 1989;7:263–8. doi: 10.1016/0736-4679(89)90358-2. [DOI] [PubMed] [Google Scholar]
  • 40.Bourke S, Nunes D, Stafford F, Hurley G, Graham I. Percussion of the chest re-visited: a comparison of the diagnostic value of ausculatory and conventional chest percussion. Ir J Med Sci. 1989;158:82–4. doi: 10.1007/BF02942111. [DOI] [PubMed] [Google Scholar]
  • 41.Heckerling PS, Tape TG, Wigton RS, Hissong KK, Leikin JB, Ornato JP, Cameron JL, Racht EM. Clinical prediction rule for pulmonary infiltrates. Ann Intern Med. 1990;113:664–70. doi: 10.7326/0003-4819-113-9-664. [DOI] [PubMed] [Google Scholar]
  • 42.Kern DG, Patel SR. Auscultated forced expiratory time as a clinical and epidemiologic test of airway obstruction. Chest. 1991;100:636–9. doi: 10.1378/chest.100.3.636. [DOI] [PubMed] [Google Scholar]
  • 43.Melbye H, Straume B, Aasebo U, Dale K. Diagnosis of pneumonia in adults in general practice. Relative importance of typical symptoms and abnormal chest signs evaluated against a radiographic reference standard. Scand J Prim Health Care. 1992;10:226–33. doi: 10.3109/02813439209014066. [DOI] [PubMed] [Google Scholar]
  • 44.Guarino JR, Guarino JC. Auscultatory percussion: a simple method to detect pleural effusion. J Gen Intern Med. 1994;9:71–4. doi: 10.1007/BF02600204. [DOI] [PubMed] [Google Scholar]
  • 45.Baughman RP, Gunther KL, Buchsbaum JA, Lower EE. Prevalence of digital clubbing in bronchogenic carcinoma by a new digital index. Clin Exp Rheumatol. 1998;16:21–6. [PubMed] [Google Scholar]
  • 46.Petersen IS, Aru A, Skodt V, Behrendt N, Bols B, Kiss K, Simonsen K. Evaluation of pneumonia diagnosis in intensive care patients. Scand J Infect Dis. 1999;31:299–303. doi: 10.1080/00365549950163617. [DOI] [PubMed] [Google Scholar]
  • 47.Straus SE, McAlister FA, Sackett DL, Deeks JJ. The accuracy of patient history, wheezing, and laryngeal measurements in diagnosing obstructive airway disease. CARE-COAD1 Group. Clinical Assessment of the Reliability of the Examination-Chronic Obstructive Airways Disease. JAMA. 2000;283:1853–7. doi: 10.1001/jama.283.14.1853. [DOI] [PubMed] [Google Scholar]
  • 48.Mehr DR, Binder EF, Kruse RL, Zweig SC, Madsen RW, D'Agostino RB. Clinical findings associated with radiographic neumonia in nursing home residents. J Fam Pract. 2001;50:931–7. [PubMed] [Google Scholar]
  • 49.Bokhari F, Brakenridge S, Nagy K, Roberts R, Smith R, Joseph K, An G, Wiley D, Barrett J. Prospective evaluation of the sensitivity of physical examination in chest trauma. J Trauma. 2002;53:1135–8. doi: 10.1097/00005373-200212000-00017. [DOI] [PubMed] [Google Scholar]
  • 50.Lichtenstein D, Goldstein I, Mourgeon E, Cluzel P, Grenier P, Rouby JJ. Comparative diagnostic performances of auscultation, chest radiography, and lung ultrasonography in acute respiratory distress syndrome. Anesthesiology. 2004;100:9–15. doi: 10.1097/00000542-200401000-00006. [DOI] [PubMed] [Google Scholar]
  • 51.Garcia-Pachon E, Padilla-Navas I. Frequency of Hoover's sign in stable patients with chronic obstructive pulmonary disease. Int J Clin Pract. 2006;60:514–7. doi: 10.1111/j.1368-5031.2006.00850.x. [DOI] [PubMed] [Google Scholar]
  • 52.O'Brien WT, Rohweder DA, Lattin GE, Thornton JA, Dutton JP, Ebert-Long DL, Duncan MD. Clinical indicators of radiographic findings in patients with suspected community-acquired pneumonia: who needs a chest x-ray? J Am Coll Radiol. 2006;3:703–6. doi: 10.1016/j.jacr.2006.02.007. [DOI] [PubMed] [Google Scholar]
  • 53.Nolt BR, Gonzales R, Maselli J, Aagaard E, Camargo CA, Metlay JP. Vital-sign abnormalities as predictors of pneumonia in adults with acute cough illness. Am J Emerg Med. 2007;25:631–6. doi: 10.1016/j.ajem.2006.11.031. [DOI] [PubMed] [Google Scholar]
  • 54.Saldias PF, Cabrera TD, Solminihac LI, Hernandez AP, Gederlini GA, Diaz FA. Predictive value of history and physical examination for the diagnosis of community-acquired pneumonia in adults. Rev Med Chil. 2007;135:143–150. doi: 10.4067/s0034-98872007000200001. [DOI] [PubMed] [Google Scholar]
  • 55.Pardee EN, Martin CJ, Morgan EH. A test odf the practical value of estimating breath sound intensity. Chest. 1976;70:341–344. doi: 10.1378/chest.70.3.341. [DOI] [PubMed] [Google Scholar]
  • 56.Bohadana AB, Coimbra FT, Santiago JR. Detection of lung abnormalities by auscultatory percussion: a comparative study with conventional percussion. Respiration. 1986;50:218–25. doi: 10.1159/000194929. [DOI] [PubMed] [Google Scholar]
  • 57.King DK. Wheezing on maximal forced exhalation in the diagnosis of atypical asthma. Ann Intern Med. 1989;110:451–55. doi: 10.7326/0003-4819-110-6-451. [DOI] [PubMed] [Google Scholar]
  • 58.Holleman DR, Simel DL, Goldberg JS. Diagnosis of obstructive airways disease from the clinical examination. J Gen Intern Med. 1993;8:63–8. doi: 10.1007/BF02599985. [DOI] [PubMed] [Google Scholar]
  • 59.Badgett RG, Tanaka DJ, Hunt DK, Jelley MJ, Feinberg LE, Steiner JF, Petty TL. The clinical evaluation for diagnosing obstructive airways disease in high-risk patients. Chest. 1994;106:1427–31. doi: 10.1378/chest.106.5.1427. [DOI] [PubMed] [Google Scholar]
  • 60.Wipf JE, Lipsky BA, Hirschmann JV, Boyko EJ, Takasugi J, Peugeot RL, Davis CL. Diagnosing pneumonia by physical examination: relevant or relic? Arch Intern Med. 1999;159:1082–7. doi: 10.1001/archinte.159.10.1082. [DOI] [PubMed] [Google Scholar]
  • 61.Kalantri S, Joshi R, Lokhande T, Singh A, Morgan M, Colford JM, Pai M. Accuracy and reliability of physical signs in the diagnosis of pleural effusion. Respir Med. 2007;101:431–8. doi: 10.1016/j.rmed.2006.07.014. [DOI] [PubMed] [Google Scholar]
  • 62.Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glaziou PP, Irwig LM, Moher D, Rennie D, Vet HCV, Lijmer JG. The STARD statement for reporting studies of diagnostic accuracy: explanations and elaboration. Ann Intern Med. 2003;138:w1–w12. doi: 10.7326/0003-4819-138-1-200301070-00012-w1. [DOI] [PubMed] [Google Scholar]
  • 63.Simmel DL, Rennie D, Bossuyt PM. The STARD statement for reporting diagnostic accuracy studies: application to the history and physical examination. J Gen Intern Med. 2008;23:768–74. doi: 10.1007/s11606-008-0583-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33:159–174. doi: 10.2307/2529310. [DOI] [PubMed] [Google Scholar]
  • 65.King DB, Dickinson JA, Boulton MR, Toumpas C. Clinical skills textbooks fail evidence-based examination. ACP J Club. 2006;144:A10–1. [PubMed] [Google Scholar]
  • 66.Vukanovic-Criley JM, Criley S, Warde CM, Boker JR, Guevara-Matheus L, Churchill WH, Nelson WP, Criley JM. Competency in cardiac examination skills in medical students, trainees, physicians and faculty. Arch Intern Med. 2006;166:610–616. doi: 10.1001/archinte.166.6.610. [DOI] [PubMed] [Google Scholar]
  • 67.Mukkamala SG, Gennings C, Wenzel RP. R = 20: bias in the reporting of respiratory rates. Am J Emerg Med. 2008;26:237–9. doi: 10.1016/j.ajem.2007.05.001. [DOI] [PubMed] [Google Scholar]
  • 68.Weber MW, Usen S, Palmer A, Jaffar S, Mulholland EK. Predictors of hypoxaemia in hospital admissions with acute lower respiratory tract infection in a developing country. Arch Dis Child. 1997;76:310–4. doi: 10.1136/adc.76.4.310. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Mower W, Sachs C, Nicklin E, et al. Pulse oximetry as a fifth vital sign. Pediatrics. 1997;99:681–6. doi: 10.1542/peds.99.5.681. [DOI] [PubMed] [Google Scholar]

Articles from Journal of General Internal Medicine are provided here courtesy of Society of General Internal Medicine

RESOURCES