A validity-driven approach to the understanding of the personal and societal burden of low back pain: development of a conceptual and measurement model
- Rachelle Buchbinder†1, 2Email author,
- Roy Batterham†3,
- Gerald Elsworth3,
- Clermont E Dionne4, 5,
- Emma Irvin6 and
- Richard H Osborne3
© Buchbinder et al.; licensee BioMed Central Ltd. 2011
Received: 31 May 2011
Accepted: 20 September 2011
Published: 20 September 2011
While the importance and magnitude of the burden of low back pain upon the individual is well recognized, a systematic understanding of the impact of the condition on individuals is currently hampered by the lack of an organized understanding of what aspects of a person's life are affected and the lack of comprehensive measures for these effects. The aim of the present study was to develop a conceptual and measurement model of the overall burden of low back pain from the individual's perspective using a validity-driven approach.
To define the breadth of low back pain burden we conducted three concept-mapping workshops to generate an item pool. Two face-to-face workshops (Australia) were conducted with people with low back pain and clinicians and policy-makers, respectively. A third workshop (USA) was held with international multidisciplinary experts. Multidimensional scaling, cluster analysis, participant input and thematic analyses organized participants' ideas into clusters of ideas that then informed the conceptual model.
One hundred and ninety-nine statements were generated. Considerable overlap was observed between groups, and four major clusters were observed - Psychosocial, Physical, Treatment and Employment - each with between two and six subclusters. Content analysis revealed that elements of the Psychosocial cluster were sufficiently distinct to be split into Psychological and Social, and a further cluster of elements termed Positive Effects also emerged. Finally, a hypothesized structure was proposed with six domains and 16 subdomains. New domains not previously considered in the back pain field emerged for psychometric verification: loss of independence, worry about the future, and negative or discriminatory actions by others.
Using a grounded approach, an explicit a priori and testable model of the overall burden of low back pain has been proposed that captures the full breadth of the burden experienced by patients and observed by experts.
Low back pain affects 80 to 85% of people at some stage in their life [1, 2] and is a major source of morbidity throughout the world . This condition is one of the most common causes of disability, lost work-days and visits to primary care practitioners in high-income countries [4–8]. Not only does low back pain have physical, psychological, social and economic consequences on the individual, its impact upon families, communities, industries and governments is enormous [4, 9, 10]. Recent epidemiological studies indicate that severe low back pain increases into old age  and may be increasing in prevalence in adolescence [11, 12], demonstrating a growing public health concern .
While the importance and magnitude of the burden of low back pain upon the individual is well recognized, a systematic understanding of the impact of the condition on individuals is currently hampered by the lack of an organized understanding of what aspects of a person's life are affected, and also by the lack of comprehensive measures of these effects. The burden of a disease is commonly defined in terms of mortality, morbidity (incidence and prevalence), cost and, more recently, disability and quality of life. While these are recognized as components of disease burden, none alone are sufficient for quantifying the overall burden of low back pain from the perspective of the individuals affected.
To date the measurement of the burden of low back pain has been based on indicators such as those mentioned above rather than on empirical reflections of the way in which back pain affects the lives of individuals with the condition and those associated with them. In part this relates to a general problem in measurement development, where measures are often based on theory or historically convenient indicators and tools. Measures developed using this process rarely provide a complete view of an issue and they are usually incomplete in unknown ways. The psychometric literature refers to the failure to cover all aspects of an issue as 'construct under-representation' , and highlight this as a serious threat to the validity of any measurement tool [14, 15].
The greater danger is that measures based upon incomplete coverage of a problem may then become widely used, which in turn affects the care provided and the outcomes that are valued (and funded). In relation to back pain, there is a mismatch between traditional approaches to measurement of impact, which have little focus on social issues, and evidence showing that social issues and complex interactions between social, psychological, physical and functional issues are the norm [16, 17].
The present paper has two equal and interacting aims. First, the article aims to develop a conceptual framework that can be generalized cross-culturally, to estimate the various impacts and overall burden of low back pain from the perspective of individuals with this condition and to explore the pathways by which the individual burden of low back pain becomes a burden for society. This conceptual model will then guide the development of the new measure.
The second aim of the paper is to demonstrate, using the example of low back pain, a process for concept definition and instrument development that is consciously and deliberately directed by modern approaches to validity, from the initial stages of conceptualization through all stages of application of the resultant tool.
In trying to capture these interacting aims, we have adopted the term validity-driven to describe a process that includes: grounded approaches to a concept definition that includes consultation with a broad range of stakeholders and deliberately eschews prevailing theories until later in the development process; stakeholder participation in the organization of ideas into groups that form the basis for hypothesizing scales to be included in the measurement tool; the development of a priori hypotheses about the way in which items co-vary and can be used to form measurement scales; recognition that construct validation is an ongoing process, and that an instrument is never validated but that each interpretation of the scores needs to be validated; and the specification of a program of research to support the valid application of the tool in relation to an increasing range of interpretations (uses).
In keeping with this process, the end point of the present paper is the detailing of the hypothesized measurement model of the overall burden of back pain from the perspective of individuals with this condition and the description of a proposed program of validation research. The approaches described in this paper have evolved in the instrument development and application work of members of the research team over more than a decade [18–24]. However, this is the first time that the whole process was formalized in advance, as a comprehensive approach to instrument development.
Materials and methods
Study design and participants
A grounded approach to conceptualization and the identification of draft items maximizes the likelihood that the resultant tool will fully cover the construct; in this case, the burden of low back pain. Our process for grounded conceptualization included three concept mapping groups that utilized processes modified from the methods developed by Trochim . Concept mapping is a formal group process tool for identifying and organizing ideas on a topic of interest. The steps include development of a seeding statement, generation of statements (brainstorming), sorting of the statements, generation of a concept map and revision of the concept map.
The Cabrini Human Research Ethics Committee approved the study (No. 13-02-03-09) and all patients who participated in the study provided written informed consent.
Naming groups of items that are (or are hypothesized to be) related
There are many options for naming groups of items, including clusters, domains, factors, scales and dimensions. We chose not to use the term 'dimensions' because it has a specific meaning when using multidimensional scaling (MDS), which relates to the number of spatial dimensions in which the MDS software seeks to fit the distances between items. We also chose not to use the term 'factors' because it relates to a specific type of statistical technique - factor analysis.
We use the term 'clusters' when we refer to the outcomes of concept mapping and the term 'domains' when we refer to a refined, hypothesized structure for a proposed instrument. These domains are referred to technically as latent variables during psychometric analysis using structural equation modeling. We use the term 'scales' after the psychometric properties of the instrument have been established.
We consider that the matching between clusters, domains (latent variables) and scales is one of the critical elements in demonstrating construct validity of the final tool. We also use the term 'statements' to refer to the ideas generated by participants in the concept mapping groups, and use the term 'items' when we have begun to redraft these statements into a form that is suitable for a questionnaire.
Concept mapping workshops with patients and professionals
We conducted two face-to-face concept mapping workshops in Melbourne, Australia. We sought patients from typical clinical and community settings, with the intention of capturing a broad range of experiences. One workshop included patients with low back pain of varying duration and severity recruited from a community-based rheumatology private practice as well as individuals who had identified themselves as having back pain from a research database of people with chronic conditions who have participated in chronic disease self-management education programs across Australia, held at the Centre for Rheumatic Diseases, University of Melbourne (n = 8).
The other workshop included a diverse range of clinicians and health policy-makers from government, WorkSafe (a government-operated workers' compensation insurance scheme in Victoria, Australia) and private health insurers, identified through professional networks and snowball recruitment (n = 10). We separated the patient and professional groups in order to facilitate frank discussion, and broad and rapid brainstorming.
To maximize the richness and depth of the data obtained, we used a nominal group process that is a method for obtaining the most comprehensive possible range of ideas from individuals on a topic of interest . Usual practice in qualitative data collection is to sample to saturation, which is the point at which no new ideas are emerging. The concept mapping process goes to great lengths to be as exhaustive as possible within each group, and therefore saturation is often reached after a small number of groups.
A carefully crafted seeding statement was presented to individuals in each group, who were then asked to work alone for 5 minutes to generate ideas in response to the statement. The seeding statement for patients was: 'Thinking as broadly as you can, generate statements about how low back pain affects your life (considering both yourself and those around you)'. For the health professional group, the seeding statement was slightly different: 'Thinking as broadly as you can, generate statements about how low back pain affects the life of people with the condition and the community'. Participants were asked to write down their responses according to the following rules: one idea per statement, use bullet points, make the statements brief, and work alone. The nominal group technique uses a facilitator who then asks that the ideas be presented to the group in an egalitarian manner, whereby each participant in turn presents one item on their list, starting with the first, until all items have been presented. Participants were discouraged from passing judgments about the statements but were encouraged to seek clarification of the nature or content of the statement if necessary. The critical advantage of this approach is that the perspective of individuals is collected in a manner that is not influenced or biased by the researcher nor influenced by other, and at times dominant, group members.
Once all statements had been presented, participants were asked to sort the statements into conceptually similar groups according to any system that made sense to them. For this step, they were asked to work alone. MDS and cluster analysis were then used to process participants' input and generate two-dimensional maps of key concepts related to low back pain impact and the interrelationships among these clusters.
Participants were asked to independently consider and label each group of statements and to check that each of the statements fit within that group. If a statement or statements were not considered to fit within the group, participants were asked to nominate the appropriate grouping. They were also asked to consider whether any of the groups should be joined. After this had been completed on an individual basis, we again used a nominal group approach to organize the final groupings, their labels and the included statements. We also checked for any missing domains/concepts.
Concept mapping with international experts
A similar concept mapping exercise was conducted via email and through a face-to-face workshop at the 10th International Forum for Primary Care Research on Low Back Pain held in Boston in 2009. The expertise of the expert international group was broad and included primary care, rheumatology, occupational health, physiotherapy, chiropractics, epidemiology, public health and health policy.
Prior to the Forum, an email was sent to all participants who had been allocated to the workshop (n = 31) asking them to generate statements in response to a similar seeding statement: 'Thinking as broadly as you can, generate statements about ... how low back pain affects the life of people with the condition and those around them'. Forty-five percent (14/31) of participants responded to this task.
The statements from the patient group, from the clinician/health policy group and from the Forum workshop participants were then combined and redundancies were removed. This final set of statements were then sent to Forum participants in a second email requesting that they sort the statements into conceptually similar groups according to any system that made sense to them. They were also asked to rank each of the statements in order of importance. Fifty-eight percent (18/31) completed this task.
The same process of multidimensional scaling and cluster analysis was used to process participants' input and generate two-dimensional maps of key clusters of low back pain impact and the interrelationships among these clusters.
At the Forum we presented the results of the patient and clinician/health policy-maker workshops and the final concept map that was generated by the Low Back Pain Forum workshop participants. Participants were asked to independently consider and label each group of statements and to check that each of the statements fit within that group. If a statement or statements were not considered to fit within the group, participants were asked to nominate the appropriate grouping. They were also asked to consider whether any of the groups should be joined. After this had been completed on an individual basis, the group worked together to organize the final groupings, their labels and the included statements. We also checked for any missing domains or concepts.
Integration of the three concept maps
At this point we had three concept maps: two from the initial groups and one from the international expert group. The process of integrating the three maps included a number of steps. In addition to the two-dimensional MDS that underlies the concept maps, we undertook three-dimensional and four-dimensional MDS using the Clustan software  and repeated the cluster analysis on the outputs of these analyses. Sometimes a three-dimensional or four-dimensional MDS can more accurately capture the similarities between statements and leads to cleaner (more self-evidently homogeneous) clusters. The output of the MDS and cluster analysis is viewed as a tree diagram; a diagram that allows all cluster solutions from a single cluster to a number that equals the number of items to be examined. This diagram allows us to examine the division of items each time a cluster is split into two smaller clusters to determine whether this split has substantive meaning. Through this process we looked to determine the smallest number of clusters (most general concepts) that made sense, the largest number of clusters (most refined concepts) that made sense, and the items that are considered most typical of each refined concept.
At the level of the most general concepts, the results from different concept mapping groups tend to be similar. This means that the results can be combined at this level and the results from the different concept mapping groups provide different details under these high-level concepts. These results for each group analysis are displayed as mind maps (Mindjet Mind Manager software, 2010, MindJet Ltdr, Sydney, New South Wales, Australia). The mind maps are then combined so that the common general concepts form the first level of detail and the branches represent each substantively meaningful split identified through examination of the tree diagrams.
Throughout this process the researchers attempted to use the cluster names assigned by the original group participants. The mind map aims to provide a clear hierarchical overview of the burden of low back pain as seen by the participants. This hierarchical representation does not, however, show the richness of the relationships between the clusters as well as the original maps. For this reason, the integrated mind map needs to always be considered in conjunction with the original maps.
Refinement of the structural model
The next step in refining the structural model was to check the proposed domains against the original item pool. The researchers classified every statement produced by the three concept mapping groups according to the proposed domains. In performing this classification we were looking for: items that cannot be classified - these may indicate the need for additional domains; items that seem to relate to more than one domain - these may be ambiguous items or may indicate a relationship between the hypothesized domains; domains that still seemed to contain multiple concepts and may need to be split; and match between domain names and the item content - a poor match may require renaming the hypothesized domain.
Number of participants contributing to concept mapping and the number of statements produced
Number of participants
Number of statements
Low back pain patient group
Low Back Pain Forum workshop participants
14 generated statements
18 completed sorting and rating of final set of statementsa
Some of the most notable features of the map shown in Figure 1 are: the large number of statements related to the interaction between the reactions of others and the person's psychological state (seen in the top right-hand corner); the variety of statements related to the effort of living (down the right-hand side), which range from having to think about and plan daily activities and the physical weariness of many activities to having to make enduring changes in lifestyle; the burden related to peoples' interactions with societal institutions, including workplaces and treatment services (left-hand side); and the concepts that have both individual and health service aspects, such as effects of treatment and health states (central clusters). The maps produced by the other groups had a similar range of concepts and a similar emphasis on issues associated with the reactions of others and the effort of daily living.
Clusters, subclusters and representative statements
Clusters and subclusters
Loss of expectations
Limitations on fulfillment of goals in life
Loss of enjoyment
Loss of enjoyment in life
Loss of self-confidence
Low self-esteem, especially from loss of roles
Feel helpless when people stop you doing things
Irritation, anger and frustration
Worry and negative beliefs about the future
Worry about the future
Fear that severe back pain will occur again
Secondary health effects
Difficult to address other health issues
May lead to weight gain
Effort of life/daily grind
Makes you feel old
Loss of motivation in life
Always having to think about what you can and cannot do
Domestic psychosocial challenges
Loss of family and intimate involvement
Left out of family activities
Difficulty caring for others
Loss of independence
Need to ask for help to do things
Challenged integrity/feeling believed
May be seen as a malingerer
Wrongly considered lazy by others
Self-worth degraded by how you feel others see you
Always trying to hide pain from family so they do not worry
Feel like a burden on workmates
Negative/discriminatory actions by others
Bullied by others
May lose friends
Functioning outside the home
Makes it hard to travel
Leisure activities are limited
Specific physical limitations
Daily living is hard including basic self-care
Difficulty lifting things
Hard to sit
General physical impact
More and more physically unfit
Frustration of treatment (quality)
Waste time and money on dubious treatments and practitioners
Unnecessary surgery and the problems this causes
Frustration with healthcare providers
Doctors not understanding there is anything wrong with you
Back pain can make you distrust the medical profession
Impact on othersb
Need help from carers
Dependent on more and more medication
Costs of treatment and equipment (necessary and unnecessary)
Challenges when out of work
Difficult to get back to paid employment
Reduced employment options, now and for the future
Challenges when workingc
Many limitations on what tasks can be done
Effects of employment challenges
Difficult to get health insurance
Reduced income resulting in poverty
As shown in Table 2 we identified four clusters (Psychosocial, Physical, Treatment and Employment), and each cluster included a variable number of subclusters. For example, within the Psychosocial cluster there were six subclusters including loss, negative affect, worry and negative beliefs about the future, global malaise, domestic psychosocial challenges, and negative reactions.
Validity-driven instrument development
Our approach to construct definition and instrument development is based on the tenet that construct validity needs to be the primary concern of all instrument development activities and of all proposed applications of instruments. This is consistent with the descriptions provided by Pedhazur and colleagues , and the Standards for Educational and Psychological Testing developed jointly by the American Educational Research Association, the American Psychological Association and the (American) National Council on Measurement in Education [29, 30]. The Standards describe validation as an ongoing process that commences with the conceptualization and continues each time someone proposes an additional interpretation or application of the tool .
While it is common practice in health research to refer to a tool as either validated or unvalidated, it is not tools but only their interpretations and applications that are validated. To maximize the likelihood of producing valid data in relation to a range of possible interpretations and applications of a tool, there are development processes that seek to protect the instrument against two categories of error; measuring less than the proposed construct (construct underrepresentation) or measuring more (construct irrelevant variance) . Protection against the first type of error requires rigor in the processes of conceptualization and definition and the identification of a range of indicators. Protection against the second type of error requires rigor in psychometric analysis. We believe that three disciplines help achieve this necessary rigor: the use of grounded approaches for construct definition; the development of a priori structural hypotheses (that define relevant versus irrelevant variance); and the development of a priori, relational hypotheses as a basis for future construct validation.
Standards relating validity to interpretations
A rationale should be presented for each recommended interpretation and use of test scores, together with a comprehensive summary of the evidence and theory bearing on the intended use or interpretation
The test developer should set forth clearly how test scores are intended to be interpreted and used. The populations(s) for which a test is appropriate should be clearly delimited and the construct that the test is intended to assess should be clearly described
If validity for some common or likely interpretation has not been investigated, or if the interpretation is not consistent with the available evidence, that fact should be made clear and potential users should be cautioned against making unsupported interpretations
If a test is to be used in a way that has not been validated, it is incumbent on the user to justify the new use, collecting new evidence if necessary
An important initial step in scale development, and the final step in development of the hypothesized model, involves writing (hypothesized) descriptors about characteristics of people with a high score and people with a low score on scales related to each hypothesized domain. This exercise helps to clarify whether the domain can be represented as a scale or whether it is simply a checklist of possible characteristics, the desired range of item difficulty, and possible relationships between scale scores and other variables (other scales, demographic and clinical variables, outcomes of interventions). This final point is an important and often neglected step in preparing for construct validation by developing a broad range of a priori hypotheses about the behavior of the scales in relation to other variables (the so-called nomothetic web) [28, 31].
Proposed interpretations/applications and evidence required to support the measure's validity for low back pain burden
Evidence of validity or activities to obtain this evidence
Interpretations/applications applied to groups - supported through initial development processes
Describe the burden of low back pain on a set of scales that reflects the full range of the experience of people with low back pain
Thorough, grounded identification of the range of issues that contribute to low back pain burden
Iterative process of organizing these into domains and potential scales
Comparison with interview data at a number of stages of development
Quantify variations in the effects of low back pain across a broad range of sufferers on a range of scales
Cluster analysis to identify score profiles and qualitative confirmation of these
Tests of structural invariance across groups
Interpretations/applications applied to groups - supported through subsequent applications of the tool a
Describe the relative importance of different domains of low back pain burden in comparing one population with another (for example, needs identification)
Accumulated evidence about what is a high average score and what is a low average score for each scaleb
Establishment of whole of population norms and subgroup norms
Tests of structural invariance
Validly assess changes in low back pain burden in a group over time or as a result of interventions
Application for a range of evaluation purposes including comparison with other subjective and objective indicators of change
Development of estimates of meaningful change
Interpretations/applications applied to individuals
Assess the relative needs of an individual with low back pain across a range of domains
Attention to item scaling properties during psychometric development
Comparison with other subjective and objective indicators of status
Measure changes in individuals over time or in response to interventions
Comparison with other subjective and objective indicators of change
Development of estimates of meaningful change
Implications for the measurement of the burden of low back pain
One of the primary reasons for conducting this research was the observation that existing instruments inadequately capture the range of impacts of low back pain that are commonly reported by people with low back pain and the clinicians that work with them. This project has produced a conceptual framework that includes many concepts not included in the tools most commonly used to assess needs and/or outcomes for people with low back pain.
At one end of the spectrum, because low back pain has until recently been thought mainly a work-related problem, outcome measures have often been limited to occupational aspects of burden: most of all, measures of absence from work, and the consequent financial costs. Such measures only capture part of the burden of low back pain.
At the other end of the spectrum, Deyo and colleagues proposed a core set of six indicators for routine clinical use that included pain symptoms, function, well-being, disability, social role and satisfaction with care . Another core set of measures proposed for evaluating the effectiveness of treatment in clinical trials and routine care was proposed by Bombardier . Recognizing the importance of the patient's perspective, she proposed the following five domains: back-specific function, generic health status, pain, work disability, and patient satisfaction . Similar to these proposals, the Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials group recommended a core set of six outcome domains be considered in chronic pain clinical trials: pain, physical functioning, emotional functioning, participant ratings of global improvement and satisfaction with treatment, symptoms and adverse events, and participant disposition .
More recently, Kopec and colleagues proposed a web-based computerized adaptive test (CAT-5D-QOL) to measure five domains of health-related quality of life (Daily Activities, Walking, Handling Objects, Pain or Discomfort, and Feelings) for patients with back pain based upon item banks developed for these domains relevant to arthritis . Many measures have been developed to specifically quantify the limitations that low back pain places upon functional status. For example, in a 2004 systematic review Grotle and colleagues identified a total of 36 back-specific questionnaires . The authors classified the content of the questionnaires based upon the World Health Organization's International Classification of Functioning, Disability and Health (ICF); they found that while most of the questionnaires had a focus on activity limitations, there was a wide variation in their underlying constructs and content. Many questionnaires also included constructs of pain and symptoms, sleep disturbances, psychological dysfunction, physical impairment and social functions.
The brief and comprehensive ICF core sets for low back pain, based upon the ICF framework, are further attempts to develop a standardized set of indicators to encompass the key functional problems of patients with low back pain envisaged to be used for a variety of purposes including clinical studies and multidisciplinary assessment in clinical care . These were formed by consensus among a group of international clinical experts comprising physicians, occupational and physical therapists, who integrated evidence from a Delphi exercise to identify the most relevant ICF categories in patients with chronic health conditions including back pain , a systematic review to identify the concepts contained in outcome measures in clinical trials of musculoskeletal disorders and chronic widespread pain , and a study in a convenience sample of people undergoing rehabilitation for one of several chronic conditions including low back pain who were administered the ICF checklist . The comprehensive and brief ICF core sets include 78 and 35 categories, respectively, which cover not only aspects related to pain but also a wide spectrum of activities, social and environmental factors that affect functioning. In keeping with our conceptual model, these core sets recognize the importance of support and relationships, attitudes of significant others and health professionals as predictors of disability in people with low back pain.
A Norwegian study in a convenience sample of 118 patients with low back pain, however, has identified gaps in the comprehensive ICF core set with respect to capturing problems of importance to patients . This study compared the relationship between health problems rated by health professionals using the comprehensive ICF core set and patients' self-reported health problems identified by the Oswestry Disability Index and the World Health Organisation Disability Assessment Schedule II. Relevant domains not covered by the ICF included the subjective domain related to the impact of back pain and the feeling of being a burden to their family, while problems with sexual functions and relationship were poorly reflected in the health professionals' assessments.
Our model for the measurement of the burden of low back pain aims to comprehensively capture all of the various impacts of this condition on the individual. The model includes several domains that have not until now been considered important to measure in patients with low back pain, although they may contribute significantly to the individual's burden; for instance, loss of independence, worry about the future, negative or discriminatory actions by others, and secondary health effects, among others.
The new tool will have a wide range of potential applications for researchers, clinicians, policy-makers and insurance agencies; and for a range of purposes, including needs identification, service planning, evaluation, research and, eventually, for individual clinical assessment and monitoring. In suggesting such a range of applications, we are aware of our responsibility to consider the evidence for validity in relation to each interpretation and application [29, 30].
To strengthen potential generalizability, we have used both a local approach and an international approach to scope and define low back pain burden, nominal group approaches and concept mapping. The questionnaire is being developed with input from an international team of experts in the field. To facilitate comparison of the burden of back pain between countries and between studies, steps are being taken to ensure its wide applicability and cross-cultural generalizability.
In assessing health priorities, allocating resources, and evaluating the potential costs and benefits of public health interventions, governments often consider the burden of a disease and its contribution to the overall health of the population. Information obtained from a single comprehensive measure of back pain burden will greatly enhance research efforts to identify major determinants of back pain burden and population groups that are most affected and to ensure efficient allocation of resources. This information may also inform the development and evaluation of novel new interventions that could improve patient-relevant outcomes.
While the measurement model (Figure 3) does test for a single underlying latent variable, which we have called the burden of low back pain, we expect the questionnaire will be used as a multidimensional tool providing a profile of scores across the various scales. We will not be attempting to provide a scoring mechanism to gain a single overall score. In our experience it is more useful to be able to use profiles of scores to describe the needs of different patient groups and to distinguish the benefits of different types of interventions than to generate a global indicator that is at such a high level of abstraction no-one will be clear what it means. A profile of scores will also serve to highlight the critical psychosocial aspects of the burden of low back pain that have not been adequately addressed in existing tools. It is hoped that this profile of scores will support a greater clinical emphasis and increased research focus on these aspects of the burden experienced by people with back pain.
The present paper has described the process of developing a strong, a priori hypothesis of a measurement model for a multidimensional measurement of the burden of low back pain. The model will now be tested with a sample of approximately 600 people and may be refined on the basis of structural equation modeling analysis of the data. The refined tool will be retested on a separate (validation) sample of another 600 people. These are all foundational steps in a process of establishing construct validity for an expanding range of applications of the tool.
This paper has demonstrated how the application of a rigorous set of disciplines -by which grounded consultation and conceptualization processes lead to strong a priori hypothesis relating to measurement - provides a firm foundation for building the evidence of validity for a wide range of potential interpretations and applications. The conceptualization process has led to a much richer and more extensive set of concepts relevant to assessing the needs of people with back pain than is captured in the outcome tools previously applied.
International Classification of Functioning Disability and Health
The authors would like to acknowledge all participants in the concept mapping workshops for their valuable contribution to this work. RBu is supported in part by an Australian National Health and Medical Research Council Practitioner Fellowship, and RHO is supported in part by an Australian National Health and Medical Research Council Population Health Career Development Award.
- World Health Organisation: The burden of musculoskeletal conditions at the start of the new millennium. World Health Organ Tech Rep Ser. 2003, 919: i-x. 1-218, back coverGoogle Scholar
- Von Korff M, Dworkin SF, Le Resche L, Kruger A: An epidemiologic comparison of pain complaints. Pain. 1988, 32: 173-183. 10.1016/0304-3959(88)90066-8.View ArticlePubMedGoogle Scholar
- Hoy D, Bain C, Williams G, March L, Brooks P, Blyth F, Woolf A, Vos T, Buchbinder R: Global prevalence of low back pain. Arthritis Rheum. 2011,Google Scholar
- Rapoport J, Jacobs P, Bell NR, Klarenbach S: Refining the measurement of the economic burden of chronic diseases in Canada. Chronic Dis Can. 2004, 25: 13-21.PubMedGoogle Scholar
- Ricci JA, Stewart WF, Chee E, Leotta C, Foley K, Hochberg MC: Back pain exacerbations and lost productive time costs in United States workers. Spine. 2006, 31: 3052-3060. 10.1097/01.brs.0000249521.61813.aa.View ArticlePubMedGoogle Scholar
- Freburger JK, Holmes GM, Agans RP, Jackman AM, Darter JD, Wallace AS, Castel LD: The rising prevalence of chronic low back pain. Arch Intern Med. 2009, 169: 251-258. 10.1001/archinternmed.2008.543.PubMed CentralView ArticlePubMedGoogle Scholar
- Walker B, Muller R, Grant W: Low back pain in Australian adults. Prevalence and assoicated disability. J Manipulative Physiol Ther. 2004, 27: 238-244. 10.1016/j.jmpt.2004.02.002.View ArticlePubMedGoogle Scholar
- Walker B, Muller R, Grant W: Low back pain in Australian adults. Health provider utilization and care seeking. J Manipulative Physiol Ther. 2004, 27: 327-335. 10.1016/j.jmpt.2004.04.006.View ArticlePubMedGoogle Scholar
- Dionne C, Dunn K, Croft P: Does back pain prevalence really decrease with increasing age? A systematic review. Age Ageing. 2006, 35: 3229-3234.View ArticleGoogle Scholar
- Walker B, Muller R, Grant W: Low back pain in Australian adults: the economic burden. Asia Pac J Public Health. 2003, 15: 79-87. 10.1177/101053950301500202.View ArticlePubMedGoogle Scholar
- Jeffries LJ, Milanese SF, Grimmer-Somers KA: Epidemiology of adolescent spinal pain. A systematic overview of the research literature. Spine. 2007, 32: 2630-2637. 10.1097/BRS.0b013e318158d70b.View ArticlePubMedGoogle Scholar
- Jette A: Toward a common language for function, disability, and health. Phys Ther. 2006, 86: 726-734.View ArticlePubMedGoogle Scholar
- Briggs AM, Buchbinder R: Back pain: a national health priority area in Australia?. Med J Aust. 2009, 190: 499-502.PubMedGoogle Scholar
- Messick S: Validity. Educational Measurement. Edited by: Linn RL. 1989, New York: Macmillan, 13-103. 3Google Scholar
- Messick S: Standards-based Score Interpretation: Establishing Valid Grounds for Valid Inferences. 1994, Princeton, NJ: Educational Testing ServiceGoogle Scholar
- Main C, Foster N, Buchbinder R: How important are back pain beliefs and expectations for satisfactory recovery from back pain?. Best Pract Res Clin Rheumatol. 2010, 24: 205-218. 10.1016/j.berh.2009.12.012.View ArticlePubMedGoogle Scholar
- Hayden J, Dunn K, van der Windt D, Shaw W: What is the prognosis of back pain?. Best Pract Res Clin Rheumatol. 2010, 24: 167-180. 10.1016/j.berh.2009.12.005.View ArticlePubMedGoogle Scholar
- Hawthorne G, Richardson J, Osborne R: The Assessment of Quality of Life (AQoL) instrument: a psychometric measure of health-related quality of life. Qual Life Res. 1999, 8: 209-224. 10.1023/A:1008815005736.View ArticlePubMedGoogle Scholar
- Batterham R, Southern D, Appleby N, Elsworth G, Fabris S, Dunt D, Young D: Construction of a GP integration model. Soc Sci Med. 2002, 54: 1225-1241. 10.1016/S0277-9536(01)00092-2.View ArticlePubMedGoogle Scholar
- Osborne RH, Elsworth GR, Whitfield K: The Health Education Impact Questionnaire (heiQ): an outcomes and evaluation measure for patient education and self-management interventions for people with chronic conditions. Pat Ed Counsel. 2007, 66: 192-201. 10.1016/j.pec.2006.12.002.View ArticleGoogle Scholar
- Osborne RH, Norquist JM, Elsworth GR, Busija L, Mehta V, Herring T, Gupta SB: Development and validation of the Influenza Intensity and Impact Questionnaire (FluiiQTM). Value Health. 2011, 14: 687-699. 10.1016/j.jval.2010.12.005.View ArticlePubMedGoogle Scholar
- Busija L: The avoidable burden due to arthritis in Australia. PhD thesis. 2010, University of MelbourneGoogle Scholar
- Jordan J: Understanding the role and impact of health literacy on patient health outcomes to facilitate effective health interventions. PhD thesis. 2010, University of MelbourneGoogle Scholar
- Ciciriello S: Development and testing of a methotrexate multimedia patient education module. PhD thesis. 2011, University of MelbourneGoogle Scholar
- Trochim W: An introduction to concept mapping for planning and evaluation. Eval Program Plann. 1989, 12: 1-16. 10.1016/0149-7189(89)90016-5.View ArticleGoogle Scholar
- van der Ven A, Delbecq A: The effectiveness of nominal, Delphi and interacting group decision making processes. Acad Management J. 1974, 17: 605-621.View ArticleGoogle Scholar
- Wishart D: ClustanGraphics 8. 2005, Edinburgh: Clustan Ltd, 8Google Scholar
- Pedhazur E, Pedhazur Schmelkin L: Measurement, Design and Analysis: An Integrated Approach. 1991, Hillsdale, NJ: Lawrence Erlbaum AssociatesGoogle Scholar
- American Educational Research Association, American Psychological Association, Joint Committee on Standards for Educational and Psychological Testing (US), National Council on Measurement in Education: Standards for Educational and Psychological Testing. 1999, Washington, DC: American Educational Research AssociationGoogle Scholar
- American Psychological Association, American Educational Research Association, National Council on Measurement in Education, American Psychological Association: Standards for Educational & Psychological Tests. Standards for Educational and Psychological Testing. 1985, Washington, DC: American Psychological AssociationGoogle Scholar
- Cronbach L, Meehl P: Construct validity in psychological tests. Psychol Bull. 1955, 52: 281-302.View ArticlePubMedGoogle Scholar
- Deyo RA, Battie M, Beurskens AJ, Bombardier C, Croft P, Koes B, Malmivaara A, Roland M, Von Korff M, Waddell G: Outcome measures for low back pain research. A proposal for standardized use. Spine. 1998, 23: 2003-2013. 10.1097/00007632-199809150-00018.View ArticlePubMedGoogle Scholar
- Bombardier C: Outcome assessments in the evaluation of treatment of spinal disorders. Summary and general recommendations. Spine. 2000, 25: 3100-3103. 10.1097/00007632-200012150-00003.View ArticlePubMedGoogle Scholar
- Turk DC, Dworkin RH, Allen RR, Bellamy N, Brandenburg N, Carr DB, Cleeland C, Dionne R, Farrar JT, Galer BS, Hewitt DJ, Jadad AR, Katz NP, Kramer LD, Manning DC, McCormick CG, McDermott MP, McGrath P, Quessy S, Rappaport BA, Robinson JP, Royal MA, Simon L, Stauffer JW, Stein W, Tollett J, Witter J: Core outcome domains for chronic pain clinical trials: IMMPACT recommendations. Pain. 2003, 106: 337-345. 10.1016/j.pain.2003.08.001.View ArticlePubMedGoogle Scholar
- Kopec JA, Badii M, McKenna M, Lima VD, Sayre EC, Dvorak M: Computerized adaptive testing in back pain: validation of the CAT-5D-QOL. Spine. 2008, 33: 1384-1390. 10.1097/BRS.0b013e3181732a3b.View ArticlePubMedGoogle Scholar
- Grotle M, Brox JI, Vollestad NK: Functional status and disability questionnaires: what do they assess? A systematic review of back-specific outcome questionnaires. Spine. 2005, 30: 130-140.PubMedGoogle Scholar
- Cieza A, Stucki G, Weigl M, Disler P, Jackel W, van der Linden S, Kostanjsek N, de Bie R: ICF core sets for low back pain. J Rehabil Med. 2004, 36: 69-74. 10.1080/16501960410016037.View ArticleGoogle Scholar
- Weigl M, Cieza A, Cantista P, Reinhardt JD, Stucki G: Determinants of disability in chronic musculoskeletal health conditions: a literature review. Eur J Phys Rehabil Med. 2008, 44: 67-79.PubMedGoogle Scholar
- Brockow T, Cieza A, Kuhlow H, Sigl T, Franke T, Harder M, Stucki G: Identifying the concepts contained in outcome measures of clinical trials on musculoskeletal disorders and chronic widespread pain using the international classification of functioning, disability and health as a reference. J Rehabil Med. 2004, 30-36.Google Scholar
- Ewert T, Fuessl M, Cieza A, Andersen C, Chatterji S, Kostanjsek N, Stucki G: Identification of the most common patient problems in patients with chronic conditions using the ICF checklist. J Rehabil Med. 2004, 22-29.Google Scholar
- Røe C, Sveen U, Bautz-Holter E: Retaining the patient perspective in the International Classification of Functioning, Disability and Health Core Set for low back pain. Patient Pref Adherence. 2008, 2: 337-347.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.