CHAPTER 13 PROGRAM EVALUATION IN HEALTH CARE

PURPOSE OF PROGRAM EVALUATION

Program evaluation is concerned with finding out how well programs work by using social and behavioral science research techniques to assess information of importance to program administrators and public policy makers. The fundamental purpose of program evaluation is to provide information for decision making. Ultimately, evaluation is a judgment of merit or worth about a particular person, place, or thing.

IS EVALUATION RESEARCH?

The term research refers to systematic inquiry that leads one to discover or revise knowledge about a particular subject. Basic research is generally focused on discovering facts, relationships, behaviors, and underlying principles. Applied research often deals with the same phenomena, but the focus is usually less on the discovery of basic knowledge and more on the development of tools or the application of knowledge to develop solutions to actual problems. Evaluation is an example of applied research. Administrators, educators, policy makers, and others face questions (problems) about designing, implementing, continuing, and improving social, educational, health, and other programs. Evaluators assess or evaluate those programs to discover or revise knowledge about them and the problems they were designed to address so that informed judgments can be made, modifications can be implemented, and solutions can be achieved.

As researchers, program evaluators engage in scientific inquiry. They use tests, questionnaires, and other measurement devices. They collect and analyze data systematically by using common statistical procedures. Finally, they typically describe their findings in formal reports.¹

An important difference between basic research and evaluation research is the generality of the findings. Ideally, the basic scientist is searching for basic knowledge; the more basic or fundamental, the better. Fundamental facts and principles, such as Einstein’s theory of relativity, have broad applicability. They generalize across wide areas of knowledge. Most applied scientists–and program evaluators, in particular–are usually dealing with specific problems in specific settings. Their findings or conclusions can seldom be generalized to “similar” problems.

To elaborate on this distinction between the basic science researcher and the evaluator, consider the role each individual might play in the testing of a fluoride rinse. In examining the value of fluoride rinse, the basic science researcher would probably be concerned with the effects of fluoride on teeth, the strength of the solution necessary to produce a reduction in caries, and whether the conclusions could be generalized across the population. The evaluator would be more concerned with determining whether the actual mouth-rinse program, initiated to test the researcher’s conclusion, was run correctly and followed the objectives that it stated. The evaluator’s concern for the fluoride rinse as such is only superficial. Once the evaluator can judge whether the program is an accurate test of the fluoride rinse, the secondary results might then relate to the positive or negative effects of fluoride rinse. In other words, the particular program’s operation is of prime importance to the evaluator, and the effect of fluoride is important only in terms of its results as applied to a realistic, closely monitored program.

Determining the value of things is another difference between evaluation and basic research. Evaluation eventually comes down to making a decision about what should be done or which course of action is best. Basic researchers strive only to obtain accurate, truthful information. There is no requirement to attach assessments of merit to the discovered knowledge.¹ Theoretically, the basic scientist’s task does not involve making value judgments. The evaluator walks a fine line when it comes to value judgements. By its nature, evaluation research is based in a value context–the ultimate question, after all, is whether or not the subject (program) being studied is “of value.” The evaluator must understand the value context within which he or she works. The best evaluation studies are those in which the evaluator is fully cognizant of this value context and is then able to “do objective science” that addresses critical questions.

FOCUS OF EVALUATIONS

Evaluation studies ultimately focus on the goals, objectives, or intent of the program or activity being studied. At the simplest level we ask, Does this program do what it was designed to do? There are, of course, many other facets to evaluation. One of the most useful frameworks for looking at the evaluation research task has been put forward by Donabedian.² He suggests that assessment or evaluation can profitably look at structure, process, and outcome.

Structure refers to the program setting and logistics (i.e., facilities, equipment, financing, human resources). Process refers to the techniques or methods employed in the provision of program services (i.e., delivering health care, educating children). Outcome refers to the “real world” impacts, effects, and changes brought about as a result of the program being evaluated.

Donabedian rightly sees structure, process, and outcomes as inextricably linked: the interrelationships are critical to the program’s ability to meet its goals or fulfill its intent. Examining structure, process, and outcomes allows the evaluator to identify more clearly where problems and program liabilities lie and, hence, where corrections can be made if goals are to be met. Looking at goals, structure, process, and outcomes should be the primary focus for the evaluator. A second set of concerns also exists, however. These questions might be classified as “client” questions; that is, for whom and why is the evaluation research being conducted? This is not a trivial question. The researcher must understand, for example, the hierarchy of authority in the organization involved, what their interests and objectives are in requesting an evaluation, and what sorts of questions need to be asked. Often, one of the evaluator’s biggest contributions lies in his or her ability to help administrators clarify their thinking about the need for and use of evaluation research.

By way of illustration, consider a situation in which a dental school implements a new curriculum for its students. An evaluator who is brought in designs and carries out a carefully planned study to determine if the program has the resources it needs (structure), how well the program is running (process), and how successful the graduates are (outcomes). Such an evaluation is appropriate if the client’s interest is to determine if the curriculum is functioning properly and meeting its goals. The design would not be appropriate, however, if the client wanted to know if the graduates of the new curriculum were better-trained professionals than those of the old curriculum. The evaluator must understand the client’s focus. Without such an understanding, valuable time and resources may be wasted without answering the fundamental questions and the client’s needs.

Individuals interested in the results of evaluation may include program developers, program staff, program directors, policy makers (state or federal bureaucrats), program directors in other similar agencies, or epidemiologists.³ Different groups of people have different needs and thus seek different information. Program developers seek information about ways to improve specific parts of programs that affect them directly. The director of the program is usually interested in knowing the overall effectiveness of the basic program, although he or she is generally more concerned with finding out what specific modifications will be needed to improve the organization and operation of the program. Financial issues are usually of concern to policy makers, who question whether a program should be continued as is, given more resources, or canceled. Costs and benefits are of paramount concern to them. Staff from other programs are interested in whether the program can be generalized for possible adaptation or adoption. Epidemiologists may seek to compare the effect of different program principles and generalize about the factors responsible for success.

Clearly, the evaluator faces a number of potentially competing interests. In responding to those interests the researcher must distinguish between different types of evaluation. As we have seen, Donabedian’s framework allows us to focus on the critical features or components that make up a program. These factors must be taken into account if evaluation efforts are to be successful and useful. At the same time, Scriven ⁴ draws our attention to the fact that evaluation research may be one of two types. He uses the terms formative and summative to describe these types.