Keywords

Enteral feeding, patient-reported outcome measures, quality of life, validity

 

Authors

  1. Simpelaere, Ingeborg
  2. White, Anne
  3. Bekkering, Geertruida E.
  4. Geurden, Bart
  5. Van Nuffelen, Gwen
  6. De Bodt, Marc

Abstract

Review question/objective: The objective is to systematically review the psychometric properties and the clinical utility of patient-reported outcome measures (PROMs) and proxy-reported outcome measures that assess health-related quality of life (HRQoL) among patients receiving enteral feeding to make recommendations for use in clinical practice and research. The purpose of this systematic review is to evaluate the psychometric properties and the clinical utility of:

 

* Measures to assess HRQoL among patients receiving enteral feeding, regardless the cause of receiving enteral feeding.

 

* Measures to assess HRQoL in the subgroup "patients receiving enteral feeding as a consequence of dysphagia".

 

* Measures (1 and 2) that are completed by patients' proxy to assess HRQoL among patients receiving enteral feeding.

 

 

The research question is: What are the psychometric properties and the clinical utility of these measures? We will summarize evidence on the following properties: validity (content validity, criterion-related validity, construct validity, floor and ceiling effects), reliability (reproducibility and internal consistency) and responsiveness and clinical utility (interpretability and feasibility to complete the PROM and the proxy-reported outcome measure).

 

Article Content

Background

Patient-reported outcome measures (PROMs) can be used to assess patients' health-related quality of life (HRQoL) from the patients' perspective.1 The Food and Drug Administration of the US Department of Health and Human Services defines PROMs as "any report of the status of a patient's health condition that comes directly from the patient, without interpretation of the patient's response by a clinician or anyone else".2(p.2) Therefore, PROMs can be used to measure patients' own perceptions of their health status,1,3 HRQoL1,3 and individual treatment effects.4 However, self-reporting can be problematic or impossible for some patients.5 In these situations, proxy-reported outcome measures may be used for the assessment of the patient's HRQoL as the proxies can provide relevant information about how they think the patient would report on his or her HRQoL. Patient-reported and proxy-reported outcome measures are also used in the context of enteral feeding.6,7 Enteral feeding is an alternative feeding strategy to be used in patients with insufficient oral food intake,8 such as patients with severe dysphagia and other conditions.9-11 Although patients may benefit from this feeding strategy in terms of nutritional status,9,12 the impact on patients' HRQoL is equally important.12 Therefore, PROMs or proxy-reported (i.e. when self-reporting is complicated) outcome measures should be evaluated in addition to conventional clinical assessments and instrumental investigations.13 As a means to select the most suitable PROM and proxy-reported outcome measure for assessing HRQoL among patients receiving enteral feeding, it is important to evaluate the psychometric properties and the clinical utility of these measurement instruments.14

 

Quality of Life (QoL) is defined by the World Health Organization as follows: "individuals' perceptions of their position in life in the context of the culture and value systems in which they live and in relation to their goals, expectations, standards and concerns. It is a broad-ranging concept incorporating in a complex way the person's physical health, psychological state, level of independence, social relationships, personal beliefs and their relationships to salient features of the environment".15(p.3) There is still a lack of consensus among researchers about the definition of QoL that can explain the different constructs or concepts that are selected for their measurement instruments.16,17 Quality of life varies between populations, time periods and diseases.18-20 Also, the concept is subjective as it is based on a perception by the individual around his or her wellbeing.12 Individuals from the same environment seem to have a similar conception about QoL (e.g. "reaching happiness" and "satisfaction in life" are important constructs of QoL in individuals living in the Western countries).16 Health-related quality of life is distinct from QoL as it is directly associated to a health state of an individual.16,21 Various definitions of HRQoL exist.19,22 However, agreement is reached in that HRQoL is a multidimensional concept (i.e. consists of physical, psychological and social domains of health), it is based on the individual's perception of his or her wellbeing, encompasses the current health situation and the future perspective and is not restricted to the disease.16,19

 

Assessment of HRQoL is useful for measuring the outcome of disease and treatment effectiveness.23 The most common way to measure HRQoL issues is by patient-reported outcome (PRO) instruments, using a standardized questionnaire or an interview.4,13,24-26 Patient-reported outcome measures can be generic instruments, disease-specific instruments or battery of scales. Generic PROMs can be used in diverse patient populations and are suitable for comparing HRQoL and the efficiency of different treatment strategies across these populations.24,27,28 Generic instruments can examine a broad range of domains useful to various health conditions and diseases.19,24 In contrast, disease-specific PROMs assess one specific patient population in which a particular disease manifests, measure the HRQoL domains relevant to that specific patient population and the impact of specific treatments on that population group.19,24,28 Battery of scales comprises multiple scales that measure one particular HRQoL domain (i.e. social functioning).24 Such unidimensional HRQoL instruments will assess a relevant domain in depth, but does not allow to benchmark populations. Moreover, the burden on patients to complete multiple scales can become onerous and time consuming, which may impede results.24,29 In this review, we are interested in multidimensional PROMs, such as the generic and disease-specific scales. These multidimensional scales are intended to measure the constructs of interest and consist of subscales that measure different aspects of health.27 To select the most appropriate PROMs for measuring HRQoL among patients receiving enteral feeding, it is important that these PROMs measure the relevant constructs or concepts that are crucial for patients receiving enteral feeding.30 Clear definitions about HRQoL in enteral feeding and the constructs being measured should have been provided in the studies that have established the psychometric properties of the PROMs of interest. We will collect these definitions (cf. data extraction).

 

Among patients with cognitive, speech and language disorders, or in some other populations (i.e. infants), self-reported methods may be inappropriate or impossible.23,24 Proxies, defined as healthcare providers, spouses, parents or relatives who are closely involved with the patient,31 can act as alternative sources to obtain valuable information about patients' HRQoL.23,32 The proxies are asked to assess the patient's HRQoL in the manner that they believe the patient would report on this.5 The use of proxy-reported outcome measures allows to reveal information about the patient's HRQoL that may otherwise be lost.33 However, these proxy-reported outcome measures are not the same as PROMs as the PRO information is obtained by proxies.21,34 Furthermore, relying on proxy-provided information is only useful if proxies are able to report on the different HRQoL domains and if their ratings are sufficiently reliable.33 The reliability of the proxy ratings can be evaluated by examining patient-proxy agreement. This requires that the PROM and the accompanying proxy-reported outcome instrument measure the same constructs with identical items and that these instruments and items are valid and reliable.35,36 As various terms are available for these proxy versions (i.e. proxy-reported outcome measures, parent proxy-report instruments),34,35 we will make use of the term 'proxy-reported outcome measures' in this systematic review. Considering the patient-proxy agreement, a review of Sneeuw et al.23 revealed that proxy raters tend to report more HRQoL problems than patients do themselves. However, the subsequent underestimation of patient's HRQoL should be tempered as discrepancies in patient-proxy agreement were only found significant in a minority of cases.23

 

There is a growing interest to take into account HRQoL measures in patients receiving enteral tube feeding in hospital and among outpatients as well.12,37 Enteral tube feeding refers to the delivery of complete nutrition into the stomach. The tube can be placed through the nose via the esophagus into the stomach (nasogastric feeding or NG-feeding) or directly into the stomach through the skin on the abdomen (percutaneous gastrostomy (PEG) feeding or percutaneous endoscopic gastrostomy-feeding).11 Enteral tube feeding is an alternative for patients with insufficient oral food intake8 and is most commonly (64.1%)9 used in patients with dysphagia.9,11 Dysphagia or swallowing disorder refers to any dysfunction in the swallowing process as a result of anatomical or physiological deficits in the mouth, pharynx, larynx and/or esophagus.38,39 Enteral feeding therapy can benefit patients with dysphagia, who cannot meet their nutritional needs through any other intervention.9 Although most commonly used in patients with dysphagia, also other conditions can benefit from enteral tube feeding, such as patients with gastrointestinal dysfunction, patients with mental health disorders, patients with psychological problems and patients with increased nutritional requirements.11

 

The provision of enteral feeding can yield nutritional benefits11,40 and allow weight maintenance9 in all populations (infants,41 children,42 adults43 and the elderly9). Some patients with rapidly progressive diseases can also benefit from enteral tube feeding in earlier stages of their disease progress. For example, enteral tube feeding can relieve pressure to eat at mealtimes and can reduce mealtime duration.18 It may also reduce accelerated weight loss due to poor caloric intake.44 However, enteral feeding may also have downsides, for example enteral feeding can result in dependency upon the feeding tube.9 The subsequent psychological and social aspects9,43 of this dependency may affect the patient's HRQoL in a negative way.43 For instance, patients receiving enteral feeding sometimes report feelings of discomfort while dressing and washing, as well as uncomfortable feelings about their body image.12 Also, loss of social interaction associated with feeding (e.g. not visiting their family and not going out because of tube feeding) can take place.9 In a study of Roberge et al.,45 feelings of depression were reported in some patients receiving enteral feeding. Though the feeding technique is typically physically well tolerated,45 some consequences of enteral feeding may occur such as local wound complications, leakage around the insertion site, tube occlusion and increased reflux.46,47 Aspiration pneumonia, which refers to the inhalation of oropharyngeal material into the larynx and lower respiratory tract,48 can also be a common complication.49 Finally, in some populations, enteral feeding is associated with an increased mortality rate, both in hospital50 and in the community.51 Evaluating the impact of enteral feeding on the patients' HRQoL is therefore essential and should be taken into account when considering enteral feeding.

 

A literature review related to the topic of this systematic review has been previously published.12 Brotherton et al.12 reviewed studies that measured the impact of enteral tube feeding on adult patient's HRQoL to identify feeding-related factors that influence patient's HRQoL. This literature review exposed a lack of specific HRQoL assessment tools.12 Our systematic review will evaluate the psychometric properties of PROMs and proxy-reported outcome measures that assess HRQoL among patients receiving enteral feeding. Based upon an initially limited search in MEDLINE, the Cochrane Database of Systematic Reviews and JBI COnNECT+ in February 2016, to our knowledge there are no systematic reviews that have reviewed the psychometric properties and the clinical utility of PROMs and proxy-reported outcome measures assessing HRQoL among patients receiving enteral feeding.

 

The objective of this systematic review is to evaluate the psychometric properties (i.e. validity, reliability and responsiveness) and the clinical utility of PROMs and proxy-reported outcome measures (questionnaires or structured interviews) that assess HRQoL among patients receiving enteral feeding to make recommendations for its use in clinical practice and research.

 

Definitions

To fully understand the psychometric properties, some terms should be defined52,53: (Note that these descriptions are also applicable for the proxy-reported outcome measures.)

 

Validity52 can be defined in terms of:

  

* Content validity refers to the extent to which the different domains of the PROM are comprehensively sampled by the items in the questionnaire.

 

* Criterion-related validity measures the extent to which scores on a particular PROM are related to a gold standard. For most PROMs, it is not possible to measure criterion validity because there is no gold standard.34 A gold standard only exists when a shorter version of the PROM is tested against its complete version. This entire version of the PROM is then considered the gold standard.30

 

* Construct validity examines the extent to which the subscales on a particular PROM are correlated with subscales from other PROMs that assess similar constructs. These comparisons are based on theoretically derived hypotheses concerning the constructs that are being measured. These constructs and hypotheses should be defined in the studies of the PROMs being reviewed. Therefore, the constructs will be used as defined in the studies to evaluate construct validity.

 

* Floor and ceiling effects: In the case of floor and ceiling effects, extreme items are missing in the lower or upper end of the scale. Consequences of these effects are reduced reliability (no ability of the questionnaire to distinguish between patients with highest and lowest scores) and limited responsiveness (changes in subjects cannot be measured).

 

Reliability encompasses "reproducibility"52 and "internal consistency":53

 

Reproducibility can be defined in terms of "agreement" and "reliability":52

  

* Agreement refers to the degree to which the scores on repeated measures are close to each other, more specifically the absolute measurement error. The following estimates of agreement are frequently used:Standard error of measurement (SEM) = measurement error.Systematic differences should be considered as part of the SEM because they should be distinguished from real changes (i.e. due to treatment). Therefore, SEMagreement is preferred.Standard error of measurement could be converted in the smallest detectable change (SDC). The SDC is the smallest within-person change in score that - with P < 0.05 - can be interpreted as a "real change" in one individual (SDCind) with SDCind = 1.96 x [square root]2 x SEM or for a group SDCgroup with SDCind/[square root]n.Limits of agreement (LOA): "equal the mean change in scores of repeated measurements >= (mean change) +/- 1.96 x SDchange".54(p.37) SDchange = standard deviation of these changes.Minimal important change (MIC): "the minimal amount of change in the (sub)scale that is considered to be important".54(p.37)

 

* Reliability refers to the degree to which patients can be distinguished from each other, despite measurement errors (relative measurement error). Reliability is defined as "the proportion of the total variance in the measurements which is because of true differences among patients".55(p.743) The "true" does not refer to the accuracy of the score, but to its consistency. As previously reported,53 it is important to note that "true" should be seen in the context of classical test theory, as one of the two components of any observation, namely the "true score" and the "error associated with the observation".55(p.743) "True" is defined by Mokkink et al. as "the average score that would be obtained if the scale were given an infinite number of times".55(p.743) The preferred estimates of reliability are the intraclass correlation coefficient (ICC) for continuous measures33,52,53 and weighted Cohen's kappa for ordinal measures.52,53

 

Internal consistency refers to the extent to which items in a (sub)scale are intercorrelated and thus measuring the same concept. Factor analysis is applied to assess whether the items form one scale/dimension or more than one. After assessing the number of (homogeneous) (sub)scales, each (sub)scale should be measured by Cronbach's alpha.52

 

Responsiveness52 assesses the ability of a PROM to detect clinically important changes over time, even if these changes are small. Responsiveness is a measure of longitudinal validity and should therefore be examined by testing predefined hypotheses about expected correlations between changes in measures or expected differences in changes between known groups. The PROM should be able to distinguish clinically important change from measurement error. The following estimates of responsiveness are frequently used:

  

* Responsiveness ratio (Guyatt's RR): The degree in which the clinically important change (MIC) is related to the between-subject variability in within-subject changes in stable subjects (SDchange). Terwee et al.52 recommend the RR to be at least 1.96. "At the value of 1.96, the MIC equals the SDCind (= 1.96 x SDchange)".54(p.37)

 

* Area under the receiver operating characteristics curve or AUC: The ability of a questionnaire to distinguish between patients who have changed and who have not, based on an external criterion.

 

Clinical utility is defined in terms of "interpretability" and "feasibility" to complete the PROM and the proxy-reported outcome measure. It is considered as an important characteristic of an instrument for its use in research or clinical practice.56

  

* Interpretability is the extent to which a qualitative meaning can be assigned to quantitative scores. Interpretability can be determined by mean and SD for various subgroups and by determining MIC.52

 

* Feasibility is the facilitation in completing the PROM, the way of administration, time duration, the patient's burden to complete the PROM and/or the proxy's burden to complete the proxy-reported outcome measure.

 

Inclusion criteria

Types of participants

The current review will consider all studies including patients of all ages (infants, children, adults and elderly), who receive enteral tube feeding as a consequence of dysphagia or because of other conditions, and their proxies.

 

Intervention/phenomena of interest

This review will consider primary studies that report on:

  

* Multidimensional PROMs that assess HRQoL among patients receiving enteral feeding.

 

* Multidimensional proxy-reported outcome measures that assess HRQoL among patients receiving enteral feeding.

 

Batteries of scales will be excluded because these are not multidimensional HRQoL instruments.

 

Outcomes

The current review will consider studies that report on the clinical utility and psychometric properties concerning validity, reliability, responsiveness of the PROMs and the proxy-reported outcome measures of interest. It is likely that not all of these outcomes will be present in each of the included articles. This review will include studies that report on at least one of the following outcomes involving validity, reliability, responsiveness and/or clinical utility.

 

For validity, studies reporting on at least one of the following outcomes will be included:

  

* Content validity: Purpose of the instrument: definition of HRQoL related to enteral feeding and the constructs being measured, target population: adults/children/infants/elderly, mode of administration: self-administered/interview based, number of domains, number of items per scale/domain, scales (single-item and multiple-item scales), items related to enteral feeding, item selection (target population and experts or investigators were involved in item selection), range of scores, response options and interpretability of the items (clearly, simple items; recall time = the time period to which the questions refer).

 

* Criterion-related validity: Pearson's or Spearman correlation coefficient or other appropriate statistics.

 

* Construct validity: Constructs and hypotheses are specified in advance, correlation coefficients (i.e. Spearman or Pearson's correlation coefficient) or appropriate statistics according to the stated hypotheses.

 

* Floor and ceiling effects: Percentage (%) of respondents that achieve the lowest or highest possible score.

 

For reliability, studies reporting on at least one of the following outcomes will be included:

  

* Reproducibility: As defined in the background section, reproducibility encompasses "agreement" and "reliability". Therefore, studies reporting on at least one of the following outcomes of agreement and/or reliability will be included:Agreement: SDC (SDCind when applied in individuals and SDCgroup when applied in groups), LOA, convincing arguments that agreement is acceptable, and MIC.Reliability: ICC for continuous measures and weighted Cohen's kappa coefficient for ordinal measures.

 

* Internal consistency: Factor analysis, sample size and Cronbach's alpha.

 

For responsiveness, studies reporting on at least one of the following outcomes will be included: SDC, MIC, LOA, RR and AUC.

 

Clinical utility is defined in terms of interpretability and feasibility to complete the PROM and the proxy-reported outcome measure. Studies that report on at least one of the following outcomes will be included:

  

* Interpretability: Mean, SD and MIC.

 

* Feasibility: Time to administer, mode of administration, facilitation required to complete the PROM, patient's burden to complete the PROM and proxy's burden to complete the proxy-reported outcome measure.

 

Types of studies

The current review will consider any study that reports on multidimensional PROMs and proxy-reported outcome measures that assess HRQoL among patients receiving enteral feeding and addresses validity, reliability, responsiveness and clinical utility.

 

Search strategy

The search strategy aims to find both published and unpublished studies. A three-step search strategy will be utilized in this review. An initial limited search of MEDLINE and CINAHL will be undertaken followed by analysis of the text words contained in the title and abstract, and of the index terms used to describe the article. A second search, using all identified keywords and index terms, will then be undertaken across all included databases. Third, citations and the reference list of all selected studies and relevant reviews will be checked to identify any studies that were missed in our searches.

 

Four concepts will be used to determine the search terms: (1) enteral feeding, (2) PROM, (3) QoL and (4) validity. For each concept, both keywords (e.g. MeSH terms for PubMed) and free-text words will be used. All search terms within each concept will be combined with "OR". Then, all concepts will be combined by means of "AND". The full search strategy in PubMed is represented in Appendix I.

 

The databases to be searched include:

 

MEDLINE (via PubMed), Embase, CINAHL and PROQOLID.

 

The search for unpublished studies will include:

 

Google Scholar, MedNar and relevant homepages of patient-reported and proxy-reported outcome tools. Homepages of international and national societies for PROMs, such as the International Society for QoL Research, will be searched. Dissertations and theses will be hand searched; more specifically, three libraries of university colleges for dieticians (VIVES University College Bruges, HoGent campus libraries and Thomas More University College) will be hand searched. The search for dissertations and theses will also be performed by using the ProQuest database.

 

The search for unpublished studies and screening references will be independently performed by two reviewers (I.S. and A.W.).

 

Selecting studies

Selecting studies will also be independently performed by the two reviewers (I.S. and A.W.). The selection will be conducted in two stages: first, both reviewers will screen titles and abstracts against the inclusion criteria. All articles that potentially fulfill all criteria will be retrieved. Subsequently, the full texts will be screened using the same inclusion criteria. Studies published in English, German, French and Dutch will be considered for inclusion in this review, as these languages are mastered by the review team. All studies published prior to the start of this systematic review will be considered for inclusion in this review. Another search, as an update, will be conducted between the period of starting the review and finishing it.

 

Differences in opinion will be resolved by discussion. If no consensus is reached, a third reviewer will be asked to judge whether or not the article fulfills the inclusion criteria. A PRISMA flow chart will be used to present included and excluded studies.57

 

Assessment of methodological quality

Papers selected for retrieval will be assessed by two independent reviewers for methodological quality prior to inclusion in the review.

 

Study quality will be assessed using the COnsensus-based standards for the selection of health status Measurement INstruments (COSMIN) checklist.14,58 This is a validated and internationally accepted tool for the assessment of the methodological quality of studies.14 The checklist focusses on Health-Related Patient-Reported Outcomes (HR-PROs). The COSMIN tool consists of 12 separate sections ("boxes"), of which 10 will be used to evaluate whether a study meets the standard for good methodological quality. Of these, nine boxes represent psychometric properties, and one box focusses on the methodological quality assessment of studies on interpretability of HR-PRO tools.14 The remaining two boxes contain general requirements (i.e. Generalizability box) and will not be used in this review.

 

The COSMIN tool is a modular tool, which means that the number of boxes to fill out depends on the number of psychometric characteristics reported in a study. It may not be necessary to complete the whole checklist when assessing the methodological quality of a particular study. When a study, for example, only assesses responsiveness and criterion validity, two boxes need to be scored. Concerning the validation assessment of an instrument in different language groups, the same box may need to be scored multiple times in case the design of the study was different among countries.14

 

The COSMIN tool uses a 4-point scale (excellent, good, fair and poor) to rate each psychometric property for a particular study.58 A score can be obtained per box by taking the lowest rating of any item in a box ("worse score counts"). Finally, an overall score, which is determined by the lowest box score, can be obtained to rate the methodological quality of the study. The Interpretability box should be completed only for studies that investigate the interpretability of an instrument. The Interpretability box is also recommended to be used in systematic reviews of psychometric properties as a data extraction tool to extract all available information on interpretability issues (i.e. MIC), and therefore, no scoring was developed for that box.58 Appendix II outlines the COSMIN tool to be used.

 

Any disagreements concerning assessment of study quality that arise between the reviewers will be resolved by discussion with a third reviewer.

 

Studies with low quality, defined as "poor" scores in all boxes, will be excluded from the review.

 

Data extraction

Data will be extracted from papers included in the review by means of an attached self-developed data extraction tool (Appendix III). The data extracted will include specific details about the study characteristics, instrument characteristics and the outcomes of significance to the review question and specific objectives. The "Generalizability" box from the COSMIN tool will also be included in the data extraction tool.14

 

The following data will be extracted from included studies:

  

* Study characteristics: Authors, title and year of publication, aim of the study, study design, patient population characteristics (age, gender, medical history, etiology and/or comorbidities, method of enteral feeding, time on receiving enteral feeding and underlying etiology of receiving enteral feeding), characteristics of controls (age and gender), characteristics of proxy (type of proxy, age and gender) and sample size (= size of the population [patient, controls and proxy] in which the instrument was applied).

 

* Descriptive characteristics of the instrument: Name of the instrument, purpose of the instrument (definition of HRQoL related to enteral feeding and definition of the constructs), study/target population, mode of administration (self-administered or interview based), facilitation required to complete the PROM, number and type of scales/domains, number of items per scale/domain, total number of items, response options, range of scores and time to administer, and items related to enteral feeding.

 

* Outcomes: The relevant data on the psychometric properties and the clinical utility will be extracted, as previously reported, under the "Inclusion criteria" and the "Background".

 

Two reviewers will extract all data independently. Any differences in opinion will be discussed and resolved in a consensus meeting between the two reviewers.

 

Authors of primary studies will be contacted by electronic mail in case of missing information or to clarify unclear data.

 

Data synthesis

To judge the psychometric properties of the PROMs and the proxy-reported outcome measures, the quality criteria proposed by Terwee et al.52 will be used. These criteria are based on the criteria of the "Scientific Advisory Committee of the Medical Outcomes Trust"59 and have explicit criteria for what constitutes good psychometric properties. The checklist was applied by Barten et al.60 for the psychometric evaluations of both questionnaires and semi-structured interviews. All psychometric properties of the PROMs will be rated as "+" (positive rating), "-" (negative rating), "?" (indeterminate rating) and "0" (no information available).52

 

For validity, the following quality criteria will be used:

  

* Content validity: Purpose of the instrument: definitions of HRQoL in enteral feeding and the constructs being measured, target population: adults/children/infants/elderly, mode of administration: self-administered/interview based, multiple domains, multiple items per scale/domain, items related to enteral feeding, item selection: target population and experts or investigators were involved in item selection, large range of scores, interpretability of the items (clearly, simple items), recall time (= the time period to which the questions refer) is stated and justified.

 

* Criterion-related validity: Good validity if correlation with gold standard >=0.70. Convincing arguments that gold standard is gold should be described.

 

* Construct validity: Constructs and hypotheses are specified in advance, correlation coefficients (i.e. Spearman or Pearson's correlation coefficient) or appropriate statistics according to the stated hypotheses. Good validity if statistics (i.e. Spearman's rho) show that at least 75% of the results are in correspondence with these hypotheses, in (sub)groups of at least 50 patients or 50 proxies.

 

* Floor and ceiling effects: Absence of floor and ceiling effects: <=15% of respondents achieve the lowest or highest possible score in a sample of at least 50 patients or 50 proxies.

 

For reliability, the following quality criteria will be used:

  

* ReproducibilityAgreement: Good agreement if MIC < SDC or MIC outside the LOA or convincing arguments that agreement is acceptable. A sample size of at least 50 patients or 50 proxies is needed to assess agreement.Reliability: Good reliability if ICC or Weighted Cohen's kappa is at least 0.70 in a sample size of at least 50 patients or 50 proxies.

 

* Internal consistency: Good internal consistency if factor analysis is first applied on adequate sample size (7 x number of items and >100) AND Cronbach's alpha(s) calculated per dimension AND Cronbach's alpha(s) between 0.70 and 0.95.

 

For responsiveness, good responsiveness is obtained if SDC < MIC OR MIC outside the LOA OR RR > 1.96 OR AUC >= 0.70.

 

Clinical utility is defined in terms of interpretability and feasibility to complete the PROM and the proxy-reported outcome measure. Therefore, the following criteria will be used:

  

* Interpretability: Interpretability is good if mean and SD of scores for at least four relevant subgroups of patients are presented AND MIC is defined in a sample size of at least 50 patients or 50 proxies.

 

* Feasibility: Based on the arguments of the studies, it will be clear if the PROM and the proxy-reported outcome measure is feasible in terms of time to administer, mode of administration (interview based or questionnaire), facilitation required to complete the PROM, patient's burden to complete the PROM and proxy's burden to complete the proxy-reported outcome measure.

 

The data will be presented in a narrative text including the psychometric properties and the contextual data, both presented in the included studies. A meta-analysis will not be performed, because it is inappropriate for determining the psychometric properties of the PRO-tools.

 

The results of the review will be presented for each review question separately. Tables will be used in the presentation of the results.

 

Appendix I: Search strategy

Appendix II: Critical appraisal tool (COSMIN checklist)

(Terwee et al.58)

 

Appendix III: Data extraction tool

References

 

1. Marshall S, Haywood K, Fitzpatrick R. Impact of patient-reported outcome measures on routine practice: a structured review. J Eval Clin Pract 2006; 12 5:559-568. [Context Link]

 

2. US Department of Health and Human Services, Food and Drug Administration. Guidance for industry patient-reported outcome measures: use in medical product development to support labeling claims. 2009; 2. http://www.fda.gov/downloads/Drugs/Guidances/UCM193282.pdf. Accessed August 2015. [Context Link]

 

3. Garrett A, Schmidt L, Mackintosh A, Fitzpatrick R. Quality of life measurement: bibliographic study of patient assessed health outcome measures. BMJ 2002; 324 7351:1417. [Context Link]

 

4. Vanderwegen J, Van Nuffelen G, De Bodt M. The validation and psychometric properties of the Dutch version of the Swallowing Quality-of-Life Questionnaire (SWAL-QoL). Dysphagia 2013; 28 1:11-23. [Context Link]

 

5. Pickard AS, Knight SJ. Proxy-evaluation of health-related quality of life: a conceptual framework for understanding multiple proxy perspectives. Med Care 2005; 43 5:493-499. [Context Link]

 

6. Oozeer NB, Corsar K, Glore RJ, Penney S, Patterson J, Paleri V. The impact of enteral feeding route on patient-reported long term swallowing outcome after chemoradiation for head and neck cancer. Oral Oncol 2011; 47 10:980-983. [Context Link]

 

7. Needham DM, Dinglas VD, Bienvenu OJ, Colantuoni E, Wozniak AW, Rice TW, et al. BMJ 2013; 346 19:f1532. [Context Link]

 

8. Martin L, Blomberg J, Lagergren P. Patients' perspectives of living with a percutaneous endoscopic gastrostomy (PEG). BMC Gastroenterol 2012; 12 1:126. [Context Link]

 

9. Sura L, Madhavan A, Carnaby G, Crary MA. Dysphagia in the elderly: management and nutritional considerations. Clin Interv Aging 2012; 7:287-298. [Context Link]

 

10. Bakheit AMO. Management of neurogenic dysphagia. Postgrad Med J 2001; 77 913:694-699. [Context Link]

 

11. National Collaborating Centre for Acute Care (UK). Nutrition support for adults: oral nutrition support, enteral tube feeding and parenteral nutrition. 2006; London: National Collaborating Centre for Acute Care (UK), http://www.nice.org.uk/guidance/cg32http://www.nice.org.uk/guidance/cg32. Accessed December 2015. [Context Link]

 

12. Brotherton AM, Judd PA. Quality of life in adult enteral tube feeding patients. J Hum Nutr Diet 2007; 20 6:513-522. [Context Link]

 

13. Zraick RI, Atherson SR, Ham BK. Readability of patient-reported outcome questionnaires for use with persons with swallowing disorders. Dysphagia 2012; 27 3:346-352. [Context Link]

 

14. Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study. Qual Life Res 2010; 19 4:539-549. [Context Link]

 

15. World Health Organization. Programme on mental health: WHOQOL user manual. 1998; World Health Organization: Geneva, 3. Revision 2012. http://www.who.int/iris/handle/10665/77932. Accessed December 2014. [Context Link]

 

16. Romero M, Vivas-Consuelo D, Alvis-Guzman N. Is health related quality of life (HRQoL) a valid indicator for health systems evaluation? SpringerPlus 2013; 11 2:664. [Context Link]

 

17. Skevington SM, Lotfy M, O'Connell KA. WHOQOL Group. The World Health Organization's WHOQOL-BREF quality of life assessment: psychometric properties and results of the international field trial. A report from the WHOQOL group. Qual Life Res 2004; 13 2:299-310. [Context Link]

 

18. Brotherton A, Abbott J, Aggett P. The impact of percutaneous endoscopic gastrostomy feeding upon daily life in adults. J Hum Nutr Diet 2006; 19 5:355-367. [Context Link]

 

19. Testa MA, Simonson DC. Assessments of quality-of-life outcomes. N Engl J Med 1996; 334 13:835-840. [Context Link]

 

20. Barnett DB. Assessment of quality of life. Am J Cardiol 1991; 67 12:41C-44C. [Context Link]

 

21. Deshpande PR, Rajan S, Sudeepthi BL, Nazir CPA. Patient-reported outcomes: a new era in clinical research. Perspect Clin Res 2011; 2 4:137-144. [Context Link]

 

22. Patrick D, Erickson P. Health policy, quality of life: health care evaluation and resource allocation. In Romero M, Vivas-Consuelo D, Alvis-Guzman N. Is health related quality of life (HRQoL) a valid indicator for health systems evaluation? SpringerPlus 2013; 11 2:664. [Context Link]

 

23. Sneeuw KC, Sprangers MA, Aaronson NK. The role of health care providers and significant others in evaluating the quality of life of patients with chronic disease. J Clin Epidemiol 2002; 55 11:1130-1143. [Context Link]

 

24. de Haan R, Aaronson N, Limburg M, Hewer RL, van Crevel H. Measuring quality of life in stroke. Stroke 1993; 24 2:320-327. [Context Link]

 

25. Lemmens J, Bours GJ, Limburg M, Beurskens AJ. The feasibility and test-retest reliability of the Dutch Swal-QoL adapted interview version for dysphagic patients with communicative and/or cognitive problems. Qual Life Res 2013; 22 4:891-895. [Context Link]

 

26. Timmerman AA, Speyer R, Heijnen BJ, Klijn-Zwijnenberg IR. Psychometric characteristics of health-related quality-of-life questionnaires in oropharyngeal dysphagia. Dysphagia 2014; 29 2:183-198. [Context Link]

 

27. Janssens A, Rogers M, Thompson Coon J, Allen K, Green C, Jenkinson C, et al. A systematic review of generic multi-dimensional patient-reported outcome measures for children, Part II: Evaluation of psychometric performance of English-language versions in a general population. Value Health 2015; 18 2:334-345. [Context Link]

 

28. Teckle P, Peacock S, McTaggart-Cowan H, van der Hoek K, Chia S, Melosky B, et al. The ability of cancer-specific and generic preference-based instruments to discriminate across clinical and self-reported measures of cancer severities. Health Qual Life Outcomes 2011; 9 1:106. [Context Link]

 

29. Peters JB, Daudey L, Heijdra YF, Molema J, Dekhuijzen PN, Vercoulen JH. Development of a battery of instruments for detailed measurement of health status in patients with COPD in routine care: the Nijmegen Clinical Screening Instrument. Qual Life Res 2009; 18 7:901-912. [Context Link]

 

30. Abma IL, van der Wees PJ, Veer V, Westert GP, Rovers M. Sleep Med Rev 2016; 28:14-27. [Context Link]

 

31. Novella JL, Jochum C, Jolly D, Morrone I, Ankri J, Bureau F, et al. Agreement between patients' and proxies' reports of quality of life in Alzheimer's disease. Qual Life Res 2001; 10 5:443-452. [Context Link]

 

32. Sprangers MA, Aaronson NK. The role of health care providers and significant others in evaluating the quality of life of patients with chronic disease: a review. J Clin Epidemiol 1992; 45 7:743-760. [Context Link]

 

33. Sneeuw KC, Aaronson NK, Sprangers MA, Detmar SB, Wever LD, Schornagel JH. Comparison of patient and proxy EORTC QLQ-C30 ratings in assessing the quality of life of cancer patients. J Clin Epidemiol 1998; 51 7:617-631. [Context Link]

 

34. US Department of Health and Human Services, Food and Drug Administration. Guidance for industry patient-reported outcome measures: use in medical product development to support labeling claims. 2009; http://www.fda.gov/downloads/Drugs/Guidances/UCM193282.pdfhttp://www.fda.gov/downloads/Drugs/Guidances/UCM193282.pdf. Accessed December 2015. [Context Link]

 

35. Varni JW, Limbers CA, Burwinkle TM. Parent proxy-report of their children's health-related quality of life: an analysis of 13,878 parents' reliability and validity across age subgroups using the PEDsQLTM 4.0 Generic Core Scales. Health Qual Life Outcomes 2007; 5:2. [Context Link]

 

36. Cella D, Hahn EA, Jensen SE, Butt Z, Nowinski J, Rothrock N. Methodological issues in the selection, administration and use of patient-reported outcomes in performance measurement in health care settings. Washington, DC: National Quality Forum (NQF); 2012. [Context Link]

 

37. Anis MK, Abid S, Jafri W, Abbas Z, Shah HA, Hamid S, et al. Acceptability and outcomes of the percutaneous endoscopic gastrostomy (PEG) tube placement - patients' and caregivers' perspectives. BMC Gastroenterology 2006; 6 37:1-5. [Context Link]

 

38. Crary MA, Groher ME. Introduction to adult swallowing disorders. Philadelphia, PA: Butterworth Heinemann; 2003. [Context Link]

 

39. Bremen, Ickenstein GW. Diagnosis and treatment of neurogenic dysphagia. 1st ed.2011; UNI-MED Verlag AG, [Context Link]

 

40. Ney DM, Weiss JM, Kind AJ, Robbins J. Senescent swallowing: impact, strategies, and interventions. Nutr Clin Pract 2009; 24 3:395-413. [Context Link]

 

41. Flidel-Rimon O, Friedman S, Lev E, Juster-Reicher A, Amitay M, Shinwell E. Early enteral feeding and nosocomial sepsis in very low birthweight infants. Arch Dis Child Fetal Neonatal Ed 2004; 89 4:F289-F292. [Context Link]

 

42. Mahant S, Friedman JN, Connolly B, Goia C, Macarthur C. Tube feeding and quality of life in children with severe neurological impairment. Arch Dis Child 2009; 94 9:668-673. [Context Link]

 

43. Schneider SM, Pouget I, Staccini P, Rampal P, Hebuterne X. Quality of life in long-term home enteral nutrition patients. Clin Nutr 2000; 19 1:23-28. [Context Link]

 

44. Miller RG, Jackson CE, Kasarskis EJ, England JD, Forshew D, Johnston W, et al. Practice parameter update: the care of the patient with amyotrophic lateral sclerosis: multidisciplinary care, symptom management, and cognitive/behavioral impairment (an evidence-based review): report of the quality standards subcommittee of the american academy of neurology. Neurology 2009; 73 15:1227-1233. [Context Link]

 

45. Roberge C, Tran M, Massoud C, Poiree B, Duval N, Damecour E, et al. Quality of life and home enteral feeding: a French prospective study in patients with head and neck or oesophageal cancer. Br J Cancer 2000; 82 2:263-269. [Context Link]

 

46. Finucane TE, Christmas C, Travis K. Tube feeding in patients with advanced dementia: a review of the evidence. JAMA 1999; 282 14:1365-1370. [Context Link]

 

47. Roche V. Percutaneous endoscopic gastrostomy. Clinical care of PEG tubes in older adults. Geriatrics 2003; 58 11:22-26.28-9. [Context Link]

 

48. Marik PE. Aspiration pneumonitis and aspiration pneumonia. N Engl J Med 2001; 344 9:665-671. [Context Link]

 

49. Gomes GF, Pisani JC, Macedo ED, Campos AC. The nasogastric feeding tube as a risk factor for aspiration and aspiration pneumonia. Curr Opin Clin Nutr Metab Care 2003; 6 3:327-333. [Context Link]

 

50. Grant MD, Rudberg MA, Brody JA. Gastrostomy placement and mortality among hospitalized Medicare beneficiaries. JAMA 1998; 279 24:1973-1976. [Context Link]

 

51. Elia M, Russell CA, Stratton RJ, Holden CE, Micklewright A, Barton A, et al. British Association Parenteral and Enteral Nutrition, Trends in home artificial nutrition support in the UK during 1996-2000. A report by the British Artificial Nutrition Survey (BANS). London:2001. [Context Link]

 

52. Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol 2007; 60 1:34-42. [Context Link]

 

53. Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol 2010; 63 7:737-745. [Context Link]

 

54. Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol 2007; 60 1:37.

 

55. Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol 2010; 63 7:743.

 

56. Glad J, Jergeby U, Gustafsson C, Sonnander K. Social work practitioners' experience of the clinical utility of the home observation for measurement of the environment (HOME) Inventory. Child Fam Soc Work 2012; 17 1:23-33. [Context Link]

 

57. Moher D, Liberati A, Tetzlaff J, Altman DG. PRISMA Group. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med 2009; 6 7:e1000097. [Context Link]

 

58. Terwee CB, Mokkink LB, Knol DL, Ostelo RW, Bouter LM, de Vet HC. Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist. Qual Life Res 2012; 21 4:651-657. [Context Link]

 

59. Scientific Advisory Committee of the Medical Outcomes Trust. Assessing health status and quality-of-life instruments: attributes and review criteria. Qual Life Res 2002; 11 3:193-205. [Context Link]

 

60. Barten JA, Pisters MF, Huisman PA, Takken T, Veenhof C. Measurement properties of patient-specific instruments measuring physical function. J Clin Epidemiol 2012; 65 6:590-601. [Context Link]