How to stop grading unfairly: 9 ways

It may be only the second week of class, but I have a stack of 45 tests to grade. The students had to answer 10 of 20 questions that generally could be answered in 3 to 5 sentences. Now I have to grade them. How can I possibly do this fairly? What I want is for the points I give a test to reflect what the student knows about the material. What could get in the way?

A lot of problems could arise, I am reminded from the workshop on grading that I went to right before classes began. Beth Fisher had some wise answers: have clear questions; have a grading rubric; make sure your test matches what you have taught. But in a way what most impressed me were the colleagues in attendance who were struggling with fair grading. This came out most strongly in the courses that were so large they had different teaching assistants grading different people. One young professor who was determined to improve mentioned that the teaching assistants had inadvertently both graded some students. Sounds fine, but the problem was that they both wrote their scores on the top of the page, somehow not noticing the other’s score. And the grades each gave were quite divergent, according to the professor. I wonder what kind of pandemonium in the class this led to.

Arbitrariness in the grade is horrifying to the students, for it might affect their whole career. I suppose to us faculty it is our dirty little secret, something few are as open to addressing as my forthright colleague. What can we do? Here are a few thoughts.

1. Have students put their student ID or course assigned number on the test and not their name. It has been very well documented that once we know our students, we give the ones that are generally talkative in class, or have a record of good performance better marks. We see things in their answers that are not there. Conversely, we are more critical of those we think less highly of. If we don’t know who they are, we will grade the questions more objectively. I don’t know many in the class at this early point, but I still had to make myself go back over the answers of the few I know to make sure I wasn’t being too stingy or generous.

2. Grade in clear-cut categories as much as possible. My students have 10 questions, each worth 10 points, with the whole test being worth about 6.7% of the course grade. Instead of grading each question on the scale of 1 to 10, I generally only use 3 numbers, 0, 5, and 10. They get a 0 if nothing is correct about their answer, a 5 if they have some correct information, and some wrong information, or are incomplete. If they are entirely correct and complete, they get 10 points. I’m not a total stickler, so many of them get 10 for most of their questions. A very few situations will give a student a 3 or a 7. This kind of grading helps reduce bias because the judgements are easier.

3. Have the same person grade all of the test, or of a section of the test. If this is not done, then there will be great inconsistencies, no matter how carefully the other parts are attended to. Some people get together after a test to grade together, each person taking a question or a page. This will not always be possible, though, in the large classes we have today. Lab notebooks in particular were mentioned as needing multiple graders. What to do?

4. If there must be multiple graders, standardize them on every assignment. Make a few copies of a few papers and have all the graders grade them. Compare scores, discuss, try it with a new set of papers and repeat until the graders are very uniform. Does this sound like too much trouble? Just think for a moment how important fair grades are for the students.

5. Have a clear rubric and a key. A rubric is just a list of things the assignment calls for and points assigned. If it is too detailed, it will make things harder. The more separate sections you have, the better. For my test, the rubric is quite simple. For the Wikipedia articles the students write, the rubric will be more complicated. Here is my rubric for the first things the students do on Wikipedia, evaluate articles already posted, below. You can see that most of the points are for completion of the category.

Grading rubric: 70 points in all, 14 points per organism, 5 organisms

For each organism:

5 points: What are the strengths of this entry? What have you learned that is most interesting?

5 points: Name 3 general categories in the outline that are missing and could be included. Explain why for each.

4 points: Look at the talk page. Comment on the details here, including the ranking and importance of the article.

Full points will be given to entries in each category that are thorough, exhibit careful thinking, and tie to the material of the course. Your writing should be intelligible without going back to the original Wikipedia page.

6. Grade a given section or question all in one sitting. We change how we grade according to mood. Just look at the decisions from an even more important arena: our justice system. Ed Yong reports on a study of parole granting and finds judges that have eaten recently or are at the beginning of a session are more lenient, to large effect. I purposely did not continue grading after the 5K I ran yesterday when I was feeling extremely mellow.

7. Grade the test or project question by question, piece by piece. You will be more consistent if you grade all of question 5 before grading any other question. Likewise, with a lab report, grade all of a certain section before grading any other section.

8. Mix up the order of the papers. If you are grading question by question, you can easily shuffle the papers a bit when you finish a question. That way each student will get the benefit and cost of position. You may be differently lenient at the beginning when you haven’t seen all the possible answers. You may be desperate for a break towards the end.

9. Be aware of inadvertent bias and try to avoid it. All of these things assume you want to be a fair grader and are trying hard. They address things inherent to human nature. Following these, and I’m sure there are other good tips, will simply make the learning process and its evaluation a more accurate reflection of what a given student is demonstrating on a given assignment.

About Joan E. Strassmann

Evolutionary biologist, studies social behavior in insects & microbes, interested in education, travel, birds, tropics, nature, food; biology professor at Washington University in St. Louis

View all posts by Joan E. Strassmann →

6 Responses to How to stop grading unfairly: 9 ways

µ says:

September 7, 2014 at 8:51 AM

re: “…other good tips”

To check for consistency:
For a large class, grade the first ten exams/problems, but don’t write the scores onto the papers, but record separately; add these ten exams to the bottom of the stack, then grade these again at the end; check for consistency between your first and second score. This pseudo-self-replication works best for large classes where you are most likely to forget the first set of scores by the time you are done grading the entire stack.

To reduce chance of students complaining:
Grade fairly each question/problem and write the score next to each question; write onto the top of the exam a sum-total score that is slightly greater (e.g., by 1-2%) than the actual sum if scores were tallied correctly for all questions. Students typically figure out that there is a discrepancy between what they deserve (based on correct sum total from all questions), and what was recorded incorrectly as the total; students then tend not complain about individual questions because they don’t want to risk discovery of the 2% grading “error” that benefits them. This works great for the professional complainers who would try to argue points no matter how fair you grade.

I agree that grading question-by-question maximizes consistency; much better than grading student-by-student.

- Joan E. Strassmann says:
  
  September 7, 2014 at 9:35 AM
  
  Wonderful trick, the few extra points. Furbo, as the Italians would say. Great tip on grading and regrading the questions. Onward and upward in the name of fairness!
  
- Camille Miller says:
  
  December 8, 2014 at 1:08 PM
  
  Much of this advice makes sense (especially if you are subject to mood swings), but the 0, 5, or 10 system is not a good idea. Be accurate and don’t lump answers of varying quality into one number category. That is just lazy. Even grading from 1 to 10 (by integers) is insufficient. Grade by tenths of a point. 8.8 is a little better than 8.7. Once you have graded all the answers, order the tests from highest to lowest, read them again, and adjust grades as necessary. Also, make notes on your key as you go. Students may surprise you, so a pre-planned rubric may need adjustments. Also, that helps you be consistent on certain mistakes or certain good ideas (say, -0.8 for claiming that Napoleon was crazy for thinking that he was Napoleon– or +1.1, depending on your style).
  
  - Joan E. Strassmann says:
    
    December 8, 2014 at 11:19 PM
    
    The advantage to the 0 5 10 system for short essays is it reduces error. A second reading is likely to come up with the same scores. Even if you read them twice, exactly hitting the example of 8.8 vs. 8.7 is impossible. The other issue is that a grader’s time is also valuable. A one time read through with the 0 5 10 system can work really well. The thing is, these are generally good students, generally get most of the points. If they were not, it might be different. Also the quizzes are not worth very much of the total grade.
Pingback: Friday links: falsificationist vs. confirmationist science, transgendered scientists, lizard vs. iPhone, and more | Dynamic Ecology
Camille Miller says:

December 9, 2014 at 11:57 AM

So if you grade a problem the same wrong way twice in a row, then you didn’t make an “error”? By that logic, if you give every answer, say, 8 every time, then you never make “errors,” because your scores are reproducible 100% of the time. 8 is a good general score, because you reward the student with most of the points, so they can’t complain TOO much, but it always leaves them room for improvement, so that they never stop striving. And then grading is really easy! (You can do it at the bar, even late at night.) Giving everyone the same grade is also fair, because it doesn’t unduly reward students have been given advantages in life that made them better students, or punish those who have been disadvantaged. Just to keep them on their toes, now and then you can give everyone a 9 or a 7. Otherwise they might catch on!

Mu’s scheme of “accidentally” adding up grades wrong is funny, but it will decrease your students’ respect for you if they think you can’t do third-grade arithmetic. Better to give the “professional complainers” the staredown, and then sneer, “So you want me to regrade your paper?” Watch them scurry away!