I’ve written before about my experience teaching my graduate course this semester, but I haven’t talked about one big experiment I tried: working towards standards-based grading. I started hearing about different implementations of this over the last year or so, and thanks to a Project NExT panel at the joint meeting from Eric Sullivan and Benjamin Braun (links go to their slides), I thought I’d take the plunge.
The idea behind typical standards, specs, or mastery-based grading is to track students’ progress with mastering skills directly, instead of filtered through a weighted average of percentages on assignments. In addition, students have multiple opportunities to improve their work and demonstrate mastery of these skills. Final grades are usually assigned based on how many skills have been mastered.
I didn’t do a full implementation of standards-based grading this semester, but I did get a good start. I skipped the really hard part of this – listing all the learning objectives for my students – because I’d never taught the course before and some of my goals for the class felt kind-of nebulous. This class is algebra for teachers, so a lot of the course is about teaching students to think abstractly, generalize, and write simple proofs. Writing these goals in simple, assessable terms seemed like too much work for a first attempt.
Instead, I started with the easy part of standards-based grading: the actual grading part. I graded each homework problem on a scale of 0-2. A 2 is mastery. Maybe not completely perfect, but it’s clear that the student has demonstrated that they have the skills I’m assessing. A 0 is work that is completely on the wrong track, and 1 is somewhere in between – maybe undeveloped or incomplete, but with a good start. I liked this scale a lot, because it reduces the variance in my grading: if I try to grade a problem out of ten points, the difference between a 5 and a 7 might depend more on my mood than on the actual quality of the work. But the lines between a 0, a 1, and a 2 are almost always clear.
I had a very small class, so I let my students resubmit their work as many times as they wanted. Others put a time limit on resubmissions to prevent a glut of grading at the end of the semester. But the idea is that students can keep refining their work until they are proficient at the skills we want them to learn. And it forces students to learn from their mistakes instead of just shoving a bad grade in the back of a folder and never thinking about it again.
I assigned final grades based on how many 0s, 1s, and 2s students had by the end of the semester. An A was more than 95% 2s, a B more than 85% 2s, C more than 75%. Students knew exactly what they needed to do in order to get the grade they wanted and (most) would resubmit accordingly. One student was terrified that she was going to fail in the beginning of the semester, but after working with her extensively she ended up bringing her grade up to an A.
I have to admit, this was not an unqualified success though. These were graduate students, most of them current teachers with families, so I was very lenient on deadlines. I figured everybody would turn in the required resubmissions by the end of the semester without a lot of hand-holding. That was not the case. If I had this to do over again, I would have been more proactive and made sure that every student knew if they were underperforming.
I will definitely be doing this again for my smaller classes. The simpler scale saved a ton of time and effort, so the re-grading didn’t feel like that much extra work. Next time I’ll even try to align my assignments with learning objectives. I’m also interested to see how undergraduates respond to this method – my guess is that they’ll jump on board a little more easily, but I’m not sure. I’ll report back next year.
Thanks for writing about your experience with new assessment techniques. I’ve been interested in applying specs grading in my classes, which I am preparing to do in the Fall. I still haven’t decided if I wanted to do the full implementation, or just partial implementation. I really like the 0-2 point scale, I definitely am going to use that, at the very least!
I have been working with SBG since 2012. I’m so glad to see you working towards this goal! You are right that implementing SBG is not a replacement for motivation. Though grading techniques are a huge part of SBG, it is about backwards planned assessments that truly make it feasible to integrate this style of grading! Please let me know if you have questions!
I like the idea of the 2 point scale, but I have a couple of questions:
1. On the grading itself: Was there any difference between how the 0s and 1s were incorporated into the final grade? You only mention percentages of 2s. Also, were there exams too, and, if so, how were they incorporated into the final grade?
2. One concern I’d have without synchronized deadlines would be that once enough students starting getting 2s then with enough time their answers would percolate out to the other students. Did you address this possibility in any particular way?
Thanks!
Thanks for reading! Initially I had a fairly complex rubric for grades that took 0s and 1s into account as well, but I ended up scrapping it at the end of the semester for simplicity’s sake. It probably wouldn’t have changed the grades much anyway. There were no exams, but there was a final presentation with a similar 0-1-2 rubric. To get a particular grade, students had to get a certain percentage of 2s on both the homework and the presentation. No student did dramatically better on one portion than the other, so I lucked out on not having to resolve cases like that.
Sharing answers didn’t appear to be much of an issue, possibly because these were graduate students who didn’t know one another very well. I think if I had noticed a lot of copying, I would have re-emphasized that I’m fine with students discussing general ideas about the assignments with each other, but the actual writing of solutions needs to be done independently. Then I would have escalated to making noises about honor code violations if it came to that.