Planning an Effective Test

 

Writing an effective test starts with good planning.  The most important goal is to match your test items with your course objectives.  Your course objectives are most likely aligned with Bloom’s Taxonomy.  Your expectation of the level of student learning will help determine the types of test items to be used.

 

Knowledge

·         Remembering previously learned material, such as terms, facts, trends and sequences, methodology, principles and theories.

 

Comprehension

·         Grasping the meaning of material

·         Converting from one form to another

·         Explaining or summarizing material

·         Extending the meaning beyond the data

 

Application

·         Using information in concrete situations

 

Analysis

·         Breaking down material into its parts

·         Identifying the parts and relationships

·         Identifying the organization

 

Synthesis

·         Putting parts together into a whole

·         Production of a plan or proposed set of operations

·         Derivation of a set of abstract relations

 

Evaluation

·         Judging the value of a thing for a given purpose using definite criteria

 

 

Outline your unit to be tested into major categories based on your objectives.  Determine how many questions should be asked in each category and at what level the student should be tested.  Remember that testing is a matter of sampling.  Select a representative sample of learning tasks and include an adequate number of items on your test.  The size of the domain of information covered by the objectives should help determine the length of the test.  For reliability, create parallel test items that assess the same objective.

 

You may want to construct a table to help you either construct a new test or evaluate an existing test.  Note in the following sample planning chart that there are four major objectives to be tested.  Within each category, different levels of learning will be examined.  The numbers indicate either how many questions the instructor wishes to ask for each outcome and at each cognitive level on a new test or how many are present on an existing test.

 

Table of Specifications for a Summative Test

 

Content/Outcomes

Knows

Comprehends Principles

Applies Principles

Total Number of Items

Terms

Facts

Procedures

Role of Tests in Instruction

4

4

 

2

 

10

Principles of Testing

4

3

2

6

5

20

Norm Referenced vs. Criterion Referenced

4

3

3

 

 

10

Planning the Test

3

5

5

2

5

20

Total Number of Items

15

15

10

10

10

60

 

 

Once you have constructed this chart, you will know how many questions to ask about the various course objectives, and you will know the level of learning you wish to test.  The next step is to determine the type of test questions that will best measure student learning at the appropriate level.

 

The following list includes commonly used types of test items.  For each of these test item types, we’ll look at the kind of content for which it is best suited, the level of Bloom’s Taxonomy that can be assessed by that type, and the advantages and disadvantages of each type.

 

TRUE/FALSE

 

Best suited for:  Naturally dichotomous content, or content that clearly has only two possible choices.

Bloom levels:  Knowledge, comprehension, and application (however, almost always written at knowledge level.

Advantages:  Typically easier to write than MC (usually because they are written at knowledge level), easily scored, can be submitted to statistical item analysis.

Disadvantages:  Test takers have 50/50 chance at correct answer by guessing; more difficult to write at comprehension and application levels.

 

 

MATCHING

 

Best suited for:  Understanding of homogeneous content (use of tools, types of illnesses, etc.)

Bloom levels:  Knowledge and comprehension (seldom written beyond knowledge level).

Advantages:  Easy to write, easily scored, can be submitted to statistical item analysis.

Disadvantages:  Limited to lowest levels of Bloom’s Taxonomy; if phrases listed in Column A are not homogeneous, matching items becomes relatively easy; test takers use process of elimination.

 

 

MULTIPLE CHOICE

 

Best suited for:  Any type of content.

Bloom levels:  Knowledge, comprehension, application, analysis (synthesis and evaluation require original responses).

Advantages:  Can assess content at a variety of levels, probability of guessing correctly is lower than with T/F questions, useful for diagnostic testing using appropriate distractors, easily scored, can be submitted to statistical item analysis.

Disadvantages:  Difficult and time-consuming to write, can’t assess objectives that require the test taker to recall information unassisted, can’t assess at synthesis and evaluation levels.

 

 

FILL-IN ITEMS

 

Best suited for:  Recalling or creating a correct answer rather than recognizing it.

Bloom levels:  Knowledge, comprehension, application (usually written at knowledge level).

Advantages:  Easy to write, good for recalling information.

Disadvantages:  Suitable only for answers that require a word or short phrase, correctness of an answer can be debated so judgment of scorer is called upon.

 


SHORT ANSWER

 

Best suited for:  Recalling information unassisted or creating original responses of relatively short length.

Bloom levels:  All levels except possibly evaluation (evaluation questions might need to be longer).

Advantages:  Able to elicit original responses from test takers, lower level questions are easier to write than MC questions, good for assessing objectives that can’t be assessed with closed-ended questions.

Disadvantages:  Very difficult to score reliably; evaluation prone to errors due to halo effects, scorer fatigue, and handwriting; scoring is time consuming; more time consuming for test takers.

 

 

ESSAY

 

Best suited for:  Objectives that require original, lengthy responses; good for assessing writing skills.

Bloom levels:  All levels of Bloom’s Taxonomy.

Advantages:  Capacity to assess the highest cognitive levels.

Disadvantages:  Very difficult to score reliably; evaluation prone to errors due to halo effects, scorer fatigue, and handwriting; scoring is time consuming; more time consuming for test takers.  Use only when the cognitive level of the objective requires it.

 

 

QUANTITATIVE PROBLEM-SOLVING

 

Best suited for:  Fields such as mathematics, the sciences, and engineering.

Bloom levels:  Application, analysis, synthesis, evaluation.

Advantages:  Capacity to assess at all cognitive levels, low probability of guessing

Disadvantages:  Can be difficult and time consuming to score/grade, evaluation prone to errors due to halo effects, scorer fatigue, handwriting, more time consuming for test takers.

 

 

Erwin, T. Dary, Assessing Student Learning and Development, Jossey-Bass Publishers, 1991.

Gronlund, Norman E., How to Construct Achievement Tests, 4th ed., Prentice-Hall, Inc., 1988.

Shrock, Sharon A., and William C.C. Coscarelli, Criterion-Referenced Test Development, Addison-Wesley Publishing Company Inc., 1989.