Abstract
Objectives
The objective of this study was to develop a valid, reliable, web-based generic feline health-related quality-of-life (HRQoL) questionnaire instrument to measure the affective impact of chronic disease.
Methods
A large initial item pool, obtained through interviews with cat owners, was reduced using predetermined criteria, survey scores for relevance and clarity, and the ability of individual items to discriminate between healthy and sick cats when owners completed a prototype questionnaire. Using these data, factor analysis was used to derive a scoring algorithm and provide evidence for factorial validity. Validity was demonstrated further in a field trial using a ‘known groups’ approach (sick vs healthy cats will have a different HRQoL profile, and the HRQoL profile of cats will deteriorate as comorbidities increase). Test–retest reliability was assessed using intra-class correlation coefficients (ICCs).
Results
In total, 165 items were reduced to 20 and, on the basis of a factor analysis that explained 72.3% of the variation in scores input by 71 owners of 30 healthy and 41 sick cats using the prototype, these were allocated to three domains (vitality, comfort and emotional wellbeing [EWB]) with a scoring algorithm derived using item loadings. Subsequently, the owners of 36 healthy and 58 sick cats completed one or two (n = 48) assessments. Median scores (healthy vs sick) for all domains were significantly different (P <0.001), 78% of cats were correctly classified as healthy or sick and for comorbidities the correlation coefficients were moderate (vitality 0.64; comfort 0.63; EWB 0.50). Test–retest reliability was good (ICC vitality 0.635; comfort 0.716; EWB 0.853).
Conclusions and relevance
This study provides initial evidence for the validity and reliability of a novel HRQoL instrument to aid the assessment and management of chronic diseases of cats.
Introduction
It is the unpleasant feelings (affective component) associated with chronic disease that cause an individual to suffer. The medical profession recognises the importance of the valid and reliable measurement of how people feel and has addressed this through the development of instruments to measure health-related quality of life (HRQoL) for disease detection (discriminative purposes) or to measure change in health status over time (evaluative purposes). 1 Structured questionnaire instruments are developed and tested using well-established psychometric methodology.2–4
Instruments to measure HRQoL in companion animals consist of questions for the owner, who is well placed to report on the subtle changes in behaviour, attitude and demeanour that occur with chronic disease. While several feline disease-specific instruments exist,5–9 to date no validated generic HRQoL instrument exists for the purpose of comparing treatments or disease states. 10 A generic instrument (CHEW [Cat HEalth and Wellbeing]), based on owner-perceived health status has been reported, 11 but not validated, in sick cats. Similarly CatQol (Bijsmans) focuses on general health, eating, behaviour and management, but has been validated only in cats with chronic kidney disease (CKD). 12 More recently, Tatlock et al have described an owner-reported feline quality-of-life (QoL) scale for healthy cats. 13
Briefly, the psychometric approach to instrument design consists of the generation of a pool of items (questions), most often through interviews with key informants, reduction of these by various techniques, including expert judgement of the relevance and adequacy of items, 14 the identification of items that do not discriminate well between known groups of subjects, 15 and the use of a statistical technique called factor analysis (FA), 16 before pretesting and then testing for validity and reliability.
Evidence for any new instrument’s validity and reliability is essential before use in a clinical context. Various kinds of validity may be sought. For example, content validity is a measure of the extent to which the items included in a questionnaire are relevant and adequate for its purpose: it is established during its construction and assessed by expert judgement. Criterion validity is the agreement of a new instrument with some existing ‘gold standard’, but where that does not exist, evidence can be gathered to support concurrent criterion validity (comparison with a validated measure of a related construct) or predictive criterion validity where performance of the new measure successfully predicts that of a later measure. Construct validity – evidence that the instrument is measuring the construct that it is intended to measure – is considered to be the most robust and fundamental form of validity. 1
A construct is something that is not directly observable or measurable, such as ‘happiness’. The construct being measured here is HRQoL, which is the subjective evaluation by an individual of its circumstances, which include an altered health state and the impact of related interventions. 17 Construct validity is established by a process of hypothesis testing, where hypotheses are based upon how an instrument should perform if it is measuring the construct of interest. For example, factorial validity applies if FA of data generated using the instrument reveals an interpretable factor structure that fits the construct the instrument was designed to measure. 18 In a ‘known-groups’ approach to construct validation, predictions are made about how scores obtained with the instrument will differ between groups, such as healthy and sick animals, or will reflect disease burden, and these predictions are tested.
A reliable instrument will produce the same score when an unchanging subject is measured at two time points by the same observer (repeatability/intra-rater reliability), or when two people measure the same subject at one time (reproducibility/inter-rater reliability). 2
Previously a novel generic instrument to measure HRQoL in dogs was developed in which most of the items reported aspects of behaviour that owners believed were expressions of a dog’s subjective experience (feelings),17,19 and evidence for the validity and reliability of a web-based version was reported. 15 Subsequent shortening resulted in a 22-item instrument that retains the capacity of the prototype to measure the animal’s feelings. 20 The aim of this study was to develop an equivalent generic instrument for cats and to provide first evidence for its content validity, construct validity and intra-rater reliability.
Materials and methods
Ethical approval was granted by the University of Glasgow, and all participants gave informed consent.
Item generation and initial selection
Semi-structured interviews were conducted with the owners of healthy cats and cats with conditions likely to affect QoL, recruited through the University of Glasgow Small Animal Hospital (UGSAH) and several veterinary practices. The interviews were recorded and transcribed verbatim to generate items consisting of terms and phrases used by interviewees to describe their cats when healthy and sick. Interviews were continued until no new information emerged and redundancy was reached.2,10 Questions (open and closed) were worded carefully to limit response bias.21,22 Qualitative analysis of the transcripts was conducted using established methods in grounded theory, a methodology commonly used in the social sciences that involves the gathering and analysis of data to construct a theoretical framework for whatever is being studied. This is in contrast to conventional methods, which adopt an existing theoretical framework, and then collect data to show whether or not it applies to the phenomenon being studied. 23
Each item was considered by the authors and excluded if it was deemed to be related to individual personality traits; disease-specific; lacking in clarity/readability; more relevant to clinical examination rather than owner report; not relevant to HRQoL; or where a more appropriate description of that observation had been offered.
Content validation
An online survey (SurveyMonkey, San Mateo, CA, USA) of the remaining items was conducted in which groups of cat owners and clinicians were asked to judge their clarity and their relevance to the measurement of feline HRQoL. Relevance was scored using a 4-point Likert scale (0 = ‘not relevant’, 3 = ‘very relevant’). 24 The clarity of each item was determined using binary response options, ‘clear or not clear’. 25 Participants were asked for feedback on why they found an item not relevant or unclear, and were invited to suggest additional items.
Degree of relevance scores were then dichotomised by assigning scores of 0 and 1 as ‘not relevant’ and scores of 2 and 3 as ‘relevant’. Content validity index scores for relevance (I-CVIR) and clarity (I-CVIC) of each item were derived by averaging the scores given to each item and dividing by the number of respondents. 24 Items were excluded if I-CVIR ⩽0.60 or considered for revision/exclusion if I-CVIC <0.70.
Prototype construction and pre-testing
In the prototype instrument, each item was accompanied by a 7-point Likert scale (0 = ‘not at all’, 6 = ‘couldn’t be more’) to allow the respondent to rate the extent to which the item described his or her cat. Some of these anchors were re-worded to suit the form (word or phrase) of the item.
Software developers, Kyria Ltd (www.kyria.co.uk) produced a web-based prototype instrument for pre-testing with a number of cat owners. The prototype was revised as required to ensure optimal utility, functionality and lack of ambiguity.
To compare owners’ impressions of health status with that of clinicians, which has been found to differ in the dog according to previous work (unpublished), an additional owner question, ‘Is your cat perfectly healthy?’ – Yes/No, was included in the prototype instrument but did not form part of the assessment.
Field test 1 for item reduction, factorial validation and determination of scoring algorithm
Cat owners were recruited from first-opinion practices, feline specialist practices and the UGSAH. Owners completed one assessment for their cat and the attending clinician completed a general health assessment (supplementary material Appendix S1) to verify the cat’s health status.
The research team reviewed dot plots of the response scores (0–6) generated for each of the items and eliminated any item that was judged by all not to discriminate well between healthy and sick cats.
To establish evidence for factorial validity, and to determine a scoring algorithm for the instrument, a FA (principal components method with a varimax rotation) was performed with remaining items. The scores attributed to each item by the owners were used for the FA. The analysis allocates each item to a factor with a loading (0–1), which determines the closeness of its relationship to the factor. Resulting loadings were sorted, and any item with a loading <0.4 was excluded. A scree test and the Kaiser criterion were used to identify the optimum number of factors and the interpretability of a range of factor models was examined. Factors were interpreted based on how those items loading onto a particular factor were related, and a factor model was chosen that accounted for an acceptable amount of the variability in the data, was readily interpretable and did not include any factors containing only one or two items. An algorithm, based on the item–factor associations of the selected factor model, was derived in order to generate a domain score for each of the resultant factors/domains.
Field test 2 for construct validity and reliability testing
A new group of cat owners was recruited from first opinion practices, feline specialist practices and the UGSAH. The attending clinician completed a general health assessment (supplementary material Appendix S1) to verify the cat’s health status.
Owners of healthy and sick cats, grouped according to the clinical judgement of the consulting clinician, completed at least one assessment. A number of owners of healthy cats completed two assessments, 2 weeks apart, and test–retest reliability was assessed using the intra-class correlation coefficient (ICC). A one-way random model was assumed where the subjects (cats) are assumed random. 26
Using the first assessment for each cat, box plots and descriptive statistics were used to identify differences between healthy and sick cats, followed by formal statistical analysis using non-parametric Mann–Whitney tests owing to the non-normality of the data. Linear discriminant analysis was used to determine the ability of the instrument to differentiate healthy from sick cats. The correlation between the number of comorbidities affecting each cat and their HRQoL scores was investigated using linear regression and Pearson correlation coefficient for all three domains was calculated for healthy cats, cats with 1–2 comorbidities and cats with ⩾3 comorbidities.
The following hypotheses were tested: (1) that the HRQoL profile of scores will differ between healthy cats and sick cats; and (2) that the HRQoL profile will be worse for cats with poorer health status, as defined by the number of comorbidities present in individuals.
Results
An overview of the development process is shown in Figure 1.
Item generation and initial selection
Semi-structured interviews conducted with the owners (n = 18) of healthy (n = 19) and sick (n = 10) cats generated an initial pool of 165 items for consideration (Table 1). Table 2 illustrates the format of interview questions. One hundred and six items met our criteria for exclusion or revision (for examples see Table 3). Fifty-one items were retained for content validation.
Table 1.
Semi-structured interview | Field test 1 – online testing of prototype instrument | Field test 2 – testing for validity and reliability |
---|---|---|
• Cat owners (18: 5 males; 13 females)
• Single and multi-cat households (2–4 cats) • Cat age range 1.75–21 years • 19 healthy and 10 sick (four OA and hyperthyroidism; two OA and CKD; four OA only) |
• 71 single owner assessments (30 healthy cats; 41 sick cats) (43 males; 28 females)
• Mean age of healthy cats 6.5 years (range 0.3–16.5 years) • Mean age of sick cats 11.5 years (range 1.2–19.5 years) • 66% DSH; 34% purebred (eg, Maine Coon, Persian, Siamese, etc) • 95% comorbidities (see Table 5 for conditions) • 29% of owners of unhealthy cats misclassified their cats as being healthy – in contrast to clinical assessment |
• 94 single owner assessments (26 healthy cats; 58 sick cats) (48 male cats; 46 female cats)
• 48 repeat assessments • Mean age of healthy cats 4.6 years (range 1–10 years) • Mean age of sick cats 11.7 years (range 1.1–19.9 years) • 86% DSH; 14% purebred • 72% comorbidities (see Table 5 for conditions) • 26% of owners of unhealthy cats misclassified their cats as being healthy – in contrast to clinical assessment |
Demographics of cats include age, health status and presenting conditions, sex and breed. Misclassification between owner impression and clinician report of health status is reported for each study
OA = osteoarthritis; CKD = chronic kidney disease; DSH = domestic shorthair
Table 2.
Examples of questions asked of cat owners |
---|
Please describe your cat’s daily routine |
Can you tell how your cat is feeling, and, if so, describe how? |
How do you know when your cat is unwell? Healthy? Feeling happy? |
How did you first know your cat was unwell? Were there any behavioural changes you noticed specifically? |
How do you monitor that the disease is getting worse? |
How do you know that treatment is working? Or not working? |
What areas of your cat’s life are most impacted by the condition? |
Table 3.
Number of potential items | Rationale | Examples |
---|---|---|
21 eliminated | Explicitly described underlying personality traits | ‘Gentle’, ‘mischievous’, ‘bold’ |
22 eliminated | Specific to one disease | ‘Yowling’, ‘needing manual evacuation’, ‘night howling’ |
Clinical potential items | ‘Doesn’t like joint manipulation at the vet’, ‘muscle wastage’, weight loss/gain’ | |
Specific to individual cat or not easily recognisable | ‘Whiskers fanned out’, ‘easy to give medication to when she’s feeling well’, ‘head butts’ | |
Not relevant to measuring HRQoL | ‘Runs away after difficulty giving him his medication’, ‘cloudy eyes’, ‘bright eyes’ | |
63 eliminated | True synonyms/more appropriate descriptor commonly used | ‘Friendly’ was most commonly reported and synonym of ‘sociable’, ‘follows me around’ and ‘come to greet you’ |
More appropriate descriptor available | ‘Content’ was chosen instead of ‘chilled’ as it was believed more appropriate for a wider audience | |
Another descriptor would adequately capture a behaviour over one that is ‘too specific’ | ‘Interested in his/her food’ was retained, covering ‘loss of appetite’, ‘enjoying food’, ‘less hungry’ | |
11 revised | Revised to improve clarity/readability | ‘Doesn’t go out in winter anymore’ was revised to ‘going out in cold weather’; ‘getting up and down the stairs’ to ‘managing getting up and down the stairs’ |
HRQoL = health-related quality of life
Content validation
Fifty-eight participants (48 owners of 14 sick cats and 34 healthy cats) and 10 clinicians completed surveys assessing the clarity and relevance of the remaining 51 items. As a result of not having met I-CVI criteria, 13 of these items were eliminated and 11 items were revised, with one of those being split into two separate items (Figure 1).
Prototype construction and pre-testing
The prototype for field test 1 consisted of 39 items, 27 single words with the standard response option (0 = ‘not at all’; 6 = ‘couldn’t be more’) and 12 items where response options were reworded to suit the form of the item (eg, ‘hiding away’ – not hiding away at all/ couldn’t be hiding away more).
Pre-testing of the online instrument was conducted with 15 owners of five healthy cats and 10 sick cats. Following this, response options for two items – ‘Jumping or climbing up/down’ and ‘Usual sleeping patterns and/or places’ – were revised to improve readability and comprehension.
Field test 1 for item reduction, factorial validation and determination of scoring algorithm
Using the online prototype instrument for field test 1, 71 owner and clinician assessments from UGSAH, five general practices and one feline specialist practice were completed over a period of 5 months for 30 healthy cats and 41 cats diagnosed with a chronic condition expected to affect QoL (Table 1). Ninety-five percent of cats presented with 1–6 comorbidities (Table 5). Review of dot plots of these item responses suggested that 19 items were unlikely, in the opinion of the research team, to discriminate between sick and healthy cats (Figure 2); these were removed leaving 20 items to be included in the instrument for field test 2 (Table 4).
Table 5.
Presenting conditions | Field test 1 | Field test 2 |
---|---|---|
Degenerative joint disease | 23 | 33 |
Obesity | 8 | 11 |
Painful cancer | 0 | 1 |
Non-painful cancer | 3 | 4 |
Chronic skin disease | 7 | 1 |
Chronic medical condition | 28 | 21 |
Cardiac disease | 11 | 4 |
Neurological disease | 2 | 1 |
Chronic ear disease | 3 | 2 |
Chronic dental disease | 21 | 9 |
Chronic kidney disease | 20 | 16 |
Hyperthyroidism | 6 | 9 |
Chronic lower urinary tract disease | 2 | 6 |
Cat flu | 2 | 3 |
Chronic gastrointestinal disease | 10 | 5 |
Previous physical trauma | 4 | 3 |
Underweight | 12 | 21 |
Other | 7* | 2* |
Comorbid conditions were reported in 95% and 72% of cases in field tests 1 and 2, respectively
Other conditions included: field test 1 – cognitive decline (n = 3), cancer in remission (n = 2), diabetes (n = 2), proliferative gum disease (asymptomatic), liver disease and otitis externa right ear and scratches to pinna; field test 2 – hypertension (n = 2)
Table 4.
1 Please tell us how well this word describes (cat name) as he is today: Active
Couldn’t be more active – Not at all active: 6–0 2 Please tell us how well this word describes (cat name) as he is today: Unsteady Couldn’t be more unsteady – Not at all unsteady: 6–0 3 Please tell us how well this word describes (cat name) as he is today: Energetic Couldn’t be more energetic – Not at all energetic: 6–0 4 Please tell us how well this word describes (cat name) as he is today: Comfortable Couldn’t be more comfortable – Not at all comfortable: 6–0 5 Please tell us how well this word describes (cat name) as he is today: Lethargic Couldn’t be more lethargic – Not at all lethargic: 6–0 6 Please tell us how well this word describes (cat name) as he is today: Showing hunting behaviour Couldn’t be showing hunting behaviour more – Not at all showing hunting behaviour: 6–0 7 Please tell us how well this word describes (cat name) as he is today: Lively Couldn’t be more lively – Not at all lively: 6–0 8 Please tell us how well this word describes (cat name) as he is today: Alert Couldn’t be more alert – Not at all alert: 6–0 9 Please tell us how well this word describes (cat name) as he is today: Sore Couldn’t be more sore – Not at all sore: 6–0 10 Please tell us how well this word describes (cat name) as he is today: Content Couldn’t be more content – Not at all content: 6–0 11 Please tell us how well this word describes (cat name) as he is today: Playful Couldn’t be more playful – Not at all playful: 6–0 12 Please tell us how well this word describes (cat name) as he is today: Uncomfortable Couldn’t be more uncomfortable – Not at all uncomfortable: 6–0 13 Please tell us how well this word describes (cat name) as he is today: Enjoying the things he usually does Couldn’t be enjoying the things he usually does more – Not at all enjoying the things he usually does: 6–0 14 Please tell us how well this word describes (cat name) as he is today: Jumping or climbing up/down as usual Jumping or climbing up/down as usual – Not jumping or climbing up/down as usual: 6–0 15 Please tell us how well this word describes (cat name) as he is today: Exploring Couldn’t be exploring more – Not at all exploring: 6–0 16 Please tell us how well this word describes (cat name) as he is today: Feeling himself Couldn’t be feeling himself more – Not at all feeling himself: 6–0 17 Please tell us how well this word describes (cat name) as he is today: Stiff Couldn’t be more stiff – Not at all stiff: 6–0 18 Please tell us how well this word describes (cat name) as he is today: Happy Couldn’t be more happy – Not at all happy: 6–0 19 Please tell us how well this word describes (cat name) as he is today: Inquisitive Couldn’t be more inquisitive – Not at all inquisitive: 6–0 20 Please tell us how well this word describes (cat name) as he is today: Slow Couldn’t be more slow – Not at all slow: 6–0 |
A FA was conducted using the responses to these 20 items, all of which had loadings >0.4. A model containing three factors was considered to be optimal, accounting for 72.3% of the variance in the owner response data and consisting of factors that could be interpreted as HRQoL domains which were named by the research team as ‘vitality’ (11 items), ‘comfort’ (eight items) and ‘emotional wellbeing’ (EWB) (seven items). Some items loaded onto more than one factor. An algorithm, based on the item–factor associations for the three-factor model, was derived in order to generate three domain scores. However, for commercial reasons a description of the factor composition and the algorithm are not presented.
Field test 2 for construct validity and reliability testing
Using the resulting online instrument for field test 2, the owners of 36 healthy cats and 58 sick cats as determined by clinician assessment, representing a comprehensive range of breeds (Table 1), completed one assessment and, of these 94 owners, 48 owners completed two assessments. According to their responses to a direct question, a total of 26% of owners of sick cats believed their cat to be perfectly healthy, despite a clinician diagnosis of ill health. Sick cats presented with a range of conditions and 72% had 1–6 additional comorbidities (Table 5).
Construct validity
Differences between healthy and sick cats existed for all three domains (Table 6), supporting hypothesis 1 (null hypothesis: no difference in median score between healthy and sick cats, rejected at P <0.01), with greater variability in the sick group than in the healthy group (Figure 3). Linear discriminant analysis (using cross-validation) showed that the instrument correctly classified as either healthy or sick 78% (healthy 71%; sick 89% classified correctly) of the cats assessed. An increase in the number of comorbidities was associated with a deterioration in HRQoL profile (Figure 4). The Pearson correlation coefficients for vitality, comfort and EWB, and comorbidities (healthy, 1–2 and >3) were −0.64, −0.63 and −0.50, respectively.
Table 6.
Domain | Number of cats | Mean ± SD | IQR | Median | Mann–Whitney difference in median (healthy – unwell) |
P value | 95% CI |
---|---|---|---|---|---|---|---|
Vitality | |||||||
Healthy | 36 | 3.30 ± 0.53 | 0.84 | 3.37 | 1.04 | <0.001 | 0.71–1.32 |
Sick | 58 | 2.32 ± 0.75 | 1.08 | 2.23 | |||
Comfort | |||||||
Healthy | 36 | 5.90 ± 0.20 | 0.23 | 6.00 | 0.63 | <0.001 | 0.40–0.98 |
Sick | 58 | 5.06 ± 0.73 | 1.06 | 5.25 | |||
EWB | |||||||
Healthy | 36 | 3.73 ± 0.32 | 0.45 | 3.87 | 0.56 | <0.001 | 0.33–0.77 |
Sick | 58 | 3.15 ± 0.63 | 0.83 |
IQR = interquartile range; CI = confidence interval
Reliability
Forty eight owners completed a second assessment for their cats with a minimum of 14 days between assessments and the ICCs (95% confidence intervals [CIs]) for vitality, comfort and EWB were 0.635 (0.044–0.862), 0.716 (0.256–0.893) and 0.853 (0.615–0.945), respectively.
Discussion
One hundred and sixty-five potential items were collected from the owners of sick and healthy cats using best practice for qualitative research. 2 As information obtained from key informants underpins content validity, comprehensive representation of all relevant populations is necessary. Freeman et al described a generic HRQoL scale for cats, 11 where key informants were restricted to owners/caregivers of healthy cats. Similarly Tatlock et al used pet owners of healthy cats as key informants. 13 However, 29% and 27% of owners (field tests 1 and 2, respectively) in this study thought their cats were healthy when clinicians deemed them to be sick, reinforcing that such judgement may be unreliable. 27 Bijsmans et al used owners as informants for CatQol, but no details are available regarding the health status of these cats. 12 In contrast, the health status of the 19 healthy cats and 10 sick cats belonging to owners recruited as key informants in this study was verified by a veterinary surgeon. Although this number of cats could be considered to be low, interviews were conducted until no new information emerged. In addition, 48 different owners involved in the content validation process were invited to suggest additional items if they felt the collection of items was inadequate. Initial reduction of the 165 items was based on criteria devised by the investigators, an approach considered to be appropriate in human medicine.28–30
In veterinary science, many rely on cognitive debriefing interviews to establish the content validity of an instrument scale,11,31 or simply ask owners to judge whether the instrument appears to be capable of measuring what it is intended to measure (face validity). 32 However, in this study a group of vets, as well as a large group of owners, were involved in the validation process, adding to the robustness of this stage in the process.
In human medicine and the social sciences, the quantification of content validity has been introduced. One approach, used here, asks relevant ‘experts’ to rate the relevance and clarity of items using a rating scale, and those ratings are used to calculate a CVI for each individual item on the scale (I-CVI), providing objective information to guide researchers in revising, deleting, or substituting items. 33 The instrument described here is the first instrument in veterinary science to quantify and establish content validity of each item using this technique.
Following field test 1, 19 items were excluded based on research team judgement that they did not discriminate healthy from sick cats. Although it was considered unlikely that an item that was unable to discriminate healthy from sick cats would prove useful in an evaluative context, the possibility cannot be discounted and items removed at this stage may be reassessed for inclusion if the instrument proves not to be responsive to clinical change in further longitudinal studies.
The remaining 20 items all loaded >0.4 in the FA. Factor loadings of 0.3, 0.5 and 0.7 are generally considered to be low, medium and high, respectively, 34 with loadings of 0.3 deemed to be the minimum consideration level for FA. Increasing the loading threshold to 0.4 may have accounted for the fact that no further items were removed at this stage. Furthermore, the fact that the majority of loadings were >0.6 indicates stability of the factor model. 35 Although FA provides any number of factor models for a given data set, there are established methods of identifying how many factors could sensibly be extracted, including the scree test and the Kaiser criterion, 36 both considered in this study. The three-factor model adopted here, accounting for 72% of the variability in the owner response data, compares favourably with the canine HRQoL (64%) 15 and 72% for the shortened instrument, 20 an 11-factor questionnaire designed to measure the behaviour and temperament of pet dogs (57%) 37 and a four-factor QoL questionnaire regarding infants (45%), 38 and confirms factorial validity.
Scores in all three domains of HRQoL were significantly different between healthy and sick cats. The fact that domain scores in the sick cats showed more variation than the healthy cats is not surprising given the heterogeneity of disease in the sick cat population. Interestingly, the variability in healthy cat vitality was similar to that of the sick cats, but this probably reflects the tendency for the individual variation in vitality that tends to exist in this species. Furthermore, cats with greater comorbidity had lower HRQoL scores, indicating a poorer QoL, with moderate Pearson correlation coefficients for all three domains, 39 thus upholding known-groups hypothesis, providing additional evidence for construct validity.
Evidence for known-group validity relating to health status of other generic HRQoL instruments in cats is sparse. Freeman et al investigated the validity of their scale in a large group of 1303 cats, 11 but only eight of these were categorised as being ‘not very healthy’ or ‘not at all healthy’ by their owners – in any case a judgement that we have shown in this study to be unreliable. Bijsmans et al demonstrated that their instrument detected difference between healthy cats and those with CKD, 12 but this evidence is of limited value in relation to the proposed generic nature of their instrument.
Discriminant analysis indicated an overall misclassification rate of 22% vs that reported for dogs with chronic pain (12% misclassification) 19 and for a proxy instrument for pain measurement in communicatively impaired children (13%). 40 Misclassifications in the study reported here may have been a result of measurement error, or may have occurred because the QoL of some healthy cats was compromised at the time for reasons other than poor health, or because some sick cats may have been experiencing a good QoL at the time.
Criterion validity was not carried out because no gold standard instrument for the measurement of HRQoL exists, but the authors do not discount the possibility of being able to demonstrate concurrent or predictive criterion validity in future studies when suitable measures become available.
Test–retest reliability was carried out on data for healthy cats only, for which health status would be less likely to change over a 2 week period than would that of sick cats. A 2 week period between the completion of questionnaires is commonly chosen for this purpose, being a short enough period for change in health status to be unlikely but being a long enough period for respondents to be unlikely to remember their previous responses. The ICC values for the comfort and EWB domains were >0.7 and >0.8, respectively, indicating that test–retest reliability for those domains was good, and it was moderate for vitality (ICC >0.6). 41
Conclusions
The measurement of feline HRQoL is becoming more necessary as chronic diseases such as CKD, hyperthyroidism, cognitive decline and osteoarthritis affect the QoL of an increasing number of ageing cats, and evidence-based medicine requires that robust measures of clinical impact be developed. This study has provided initial evidence for the reliability and validity of a novel generic instrument that measures the affective component of the chronic disease experience. However, it is important to emphasise that validity is not determined by a single statistic, but by a body of research that supports the claim that the instrument is valid for particular purposes, with defined populations and in specified contexts. 2 Accordingly, future research will seek to provide such evidence, as well as evidence for the instrument’s responsiveness to clinical change including that following treatment. The instrument is available for clinical use and for clinical trials from NewMetrica (www.newmetrica.com). For further information please contact the corresponding author.
Supplemental Material
Veterinary assessment
Acknowledgments
The authors wish to thank all the cat owners, as well as the veterinary surgeons and nurses in practice and at the University of Glasgow Small Animal Hospital, who willingly participated in our studies
Footnotes
Accepted: 16 January 2018
Supplementary material: Appendix S1: Veterinary assessment.
Professor Reid is a shareholder of NewMetrica, which is the developer and supplier of the instrument.
Funding: We acknowledge Scottish Enterprise for the award of a SMART grant, which enabled us to undertake the study, and Boehringer Ingelheim for additional financial support.
References
- 1. Fayers PM, Machin D. Quality of life: the assessment, analysis and interpretation of patient-reported outcomes. John Wiley & Sons, 2007. [Google Scholar]
- 2. Streiner DL, Norman GR, Cairney J. Health measurement scales: a practical guide to their development and use. New York: Oxford University Press, 2015. [Google Scholar]
- 3. Abell N, Springer DW, Kamata A. Developing and validating rapid assessment instruments. Oxford: Oxford University Press, 2009. [Google Scholar]
- 4. Brod M, Tesler LE, Christensen TL. Qualitative research and content validity: developing best practices based on science and experience. Qual Life Res 2009; 18: 1263–1278. [DOI] [PubMed] [Google Scholar]
- 5. Freeman LM, Rush JE, Oyama MA, et al. Development and evaluation of a questionnaire for assessment of health-related quality of life in cats with cardiac disease. J Am Vet Med Assoc 2012; 240: 1188–1193. [DOI] [PubMed] [Google Scholar]
- 6. Zamprogno H, Hansen BD, Bondell HD, et al. Item generation and design testing of a questionnaire to assess degenerative joint disease-associated pain in cats. Am J Vet Res 2010; 71: 1417–1424. [DOI] [PubMed] [Google Scholar]
- 7. Niessen SJM, Powney S, Guitian J, et al. Evaluation of a quality-of-life tool for cats with diabetes mellitus. J Vet Intern Med 2010; 24: 1098–1105. [DOI] [PubMed] [Google Scholar]
- 8. Noli C, Minafò G, Galzerano M. Quality of life of dogs with skin diseases and their owners. Part 1: development and validation of a questionnaire. Vet Dermatol 2011; 22: 335–343. [DOI] [PubMed] [Google Scholar]
- 9. Noli C, Borio S, Varina A, et al. Development and validation of a questionnaire to evaluate the quality of life of cats with skin disease and their owners, and its use in 185 cats with skin disease. Vet Dermatol 2016; 27: 247–e58. [DOI] [PubMed] [Google Scholar]
- 10. Giuffrida MA, Kerrigan SM. Quality of life measurement in prospective studies of cancer treatments in dogs and cats. J Vet Intern Med 2014; 28: 1824–1829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Freeman LM, Rodenberg C, Narayanan A, et al. Development and initial validation of the Cat HEalth and Wellbeing (CHEW) Questionnaire: a generic health-related quality of life instrument for cats. J Feline Med Surg 2016; 18: 689–701. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Bijsmans ES, Jepson RE, Syme HM, et al. Psychometric validation of a general health quality of life tool for cats used to compare healthy cats and cats with chronic kidney disease. J Vet Intern Med 2016; 30: 183–191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Tatlock S, Gober M, Williamson N, et al. Development and preliminary psychometric evaluation of an owner-completed measure of feline quality of life. Vet J 2017; 228: 22–32. [DOI] [PubMed] [Google Scholar]
- 14. Osse BHP, Vernooij-Dassen MJFJ, Schadé E, et al. A practical instrument to explore patients’ needs in palliative care: the Problems and Needs in Palliative Care questionnaire – short version. Palliat Med 2007; 21: 391–399. [DOI] [PubMed] [Google Scholar]
- 15. Reid J, Wiseman-Orr ML, Scott EM, et al. Development, validation and reliability of a web-based questionnaire to measure health-related quality of life in dogs. J Small Anim Pract 2013; 54: 227–233. [DOI] [PubMed] [Google Scholar]
- 16. Las Hayas C, Quintana JM, Padierna JA, et al. Use of rasch methodology to develop a short version of the Health Related Quality of life for Eating Disorders questionnaire: a prospective study. Health Qual Life Outcomes 2010; 8: 29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Wiseman-Orr ML, Scott EM, Reid J, et al. Validation of a structured questionnaire as an instrument to measure chronic pain in dogs on the basis of effects on health-related quality of life. Am J Vet Res 2006; 67: 1826–1836. [DOI] [PubMed] [Google Scholar]
- 18. Johnston AL, File SE. Sex differences in animal tests of anxiety. Physiol Behav 1991; 49: 245–250. [DOI] [PubMed] [Google Scholar]
- 19. Wiseman-Orr ML, Nolan AM, Reid J, et al. Development of a questionnaire to measure the effects of chronic pain on health-related quality of life in dogs. Am J Vet Res 2004; 65: 1077–1084. [DOI] [PubMed] [Google Scholar]
- 20. Reid J, Wiseman-Orr ML, Scott EM. Shortening of an existing online health-related quality of life (HRQL) instrument for dogs. J Small Anim Pract 2018; 59: 334–342. [DOI] [PubMed] [Google Scholar]
- 21. Oppenheim AN. Questionnaire design, interviewing and attitude measurement. 2nd ed. London: Bloomsbury Publishing, 2000. [Google Scholar]
- 22. Foddy WH. Constructing questions for interviews and questionnaires: theory and practice in social research. Cambridge: Cambridge University Press, 1993. [Google Scholar]
- 23. Glaser B. Grounded theory methodology. Introd Qual Res Psychol 2013; 3: 69–82. [Google Scholar]
- 24. Lynn MR. Determination and quantification of content validity. Nurs Res; 35: 382–385. [PubMed] [Google Scholar]
- 25. van Schelven AR, Dikken J, Sillekens LGM, et al. Content validation of the Dutch version of the ‘Older Patients in Acute Care Survey’, an instrument to measure the attitude of hospital nurses towards older patients. Int J Clin Med 2015; 6: 7–18. [Google Scholar]
- 26. Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull 1979; 86: 420–428. [DOI] [PubMed] [Google Scholar]
- 27. Spofford N, Lefebvre SL, McCune S, et al. Should the veterinary profession invest in developing methods to assess quality of life in healthy dogs and cats? J Am Vet Med Assoc 2013; 243: 952–956. [DOI] [PubMed] [Google Scholar]
- 28. Melzack R. The McGill Pain Questionnaire: major properties and scoring methods. Pain 1975; 1: 277–299. [DOI] [PubMed] [Google Scholar]
- 29. Juniper EF, Guyatt GH, Jaeschke R. How to develop and validate a new health-related quality of life instrument. In: Spilker B. (ed). Quality of life and pharmacoeconomics in clinical trials. 2nd ed. Philadelphia, PA: Lippincott-Raven, 1996, pp 49–56. [Google Scholar]
- 30. Armstrong FD, Toledano SR, Miloslavich K, et al. The Miami pediatric quality of life questionnaire: parent scale. Int J Cancer Suppl 1999; 12: 11–17. [DOI] [PubMed] [Google Scholar]
- 31. Favrot C, Linek M, Mueller R, et al. Development of a questionnaire to assess the impact of atopic dermatitis on health-related quality of life of affected dogs and their owners. Vet Dermatol 2010; 21: 64–70. [DOI] [PubMed] [Google Scholar]
- 32. Walton MB, Cowderoy E, Lascelles D, et al. Evaluation of construct and criterion validity for the ‘Liverpool Osteoarthritis in Dogs’ (LOAD) clinical metrology instrument and comparison to two other instruments. PLoS One 2013; 8: e58125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Polit DF, Beck CT. The content validity index: are you sure you know what’s being reported? Critique and recommendations. Res Nurs Health 2006; 29: 489–497. [DOI] [PubMed] [Google Scholar]
- 34. Shevlin M, Miles JNV. Effects of sample size, model specification and factor loadings on the GFI in confirmatory factor analysis. Pers Individ Dif 1998; 25: 85–90. [Google Scholar]
- 35. MacCallum RC, Widaman KF, Zhang S, et al. Sample size in factor analysis. Psychol Methods 1999; 4: 84–99. [Google Scholar]
- 36. Coste J, Bouée S, Ecosse E, et al. Methodological issues in determining the dimensionality of composite health measures using principal component analysis: case illustration and suggestions for practice. Qual Life Res 2005; 14: 641–654. [DOI] [PubMed] [Google Scholar]
- 37. Hsu Y, Serpell JA. Development and validation of a questionnaire for measuring behavior and temperament traits in pet dogs. J Am Vet Med Assoc 2003; 223: 1293–300. [DOI] [PubMed] [Google Scholar]
- 38. Manificat S. A new instrument to evaluate infant quality of life. Qual Life Newsl MAPI Res Inst 1999; 23: 7–8. [Google Scholar]
- 39. Stallard P, Williams L, Velleman R, et al. The development and evaluation of the pain indicator for communicatively impaired children (PICIC). Pain 2002; 98: 145–149. [DOI] [PubMed] [Google Scholar]
- 40. Mukaka MM. A guide to appropriate use of correlation coefficient in medical research. Malawi Med J 2012; 24: 69–71. [PMC free article] [PubMed] [Google Scholar]
- 41. Rosner B. Fundamentals of biostatistics. 8th ed. Boston: Brooks Cole, 2016. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Veterinary assessment