{"id":1825,"date":"2019-05-31T18:20:43","date_gmt":"2019-05-31T22:20:43","guid":{"rendered":"http:\/\/blogs.ams.org\/inclusionexclusion\/?p=1825"},"modified":"2019-05-31T18:20:43","modified_gmt":"2019-05-31T22:20:43","slug":"set-theory-on-reading-student-evaluations-of-teaching","status":"publish","type":"post","link":"https:\/\/blogs.ams.org\/inclusionexclusion\/2019\/05\/31\/set-theory-on-reading-student-evaluations-of-teaching\/","title":{"rendered":"SET Theory: On reading student evaluations of teaching"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-1826\" src=\"http:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/comicSET.jpg\" alt=\"\" width=\"750\" height=\"376\" srcset=\"https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/comicSET.jpg 750w, https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/comicSET-300x150.jpg 300w\" sizes=\"auto, (max-width: 750px) 100vw, 750px\" \/>The school year is over, commencement has come and gone, grades are in, and the summer lies ahead of us, with all of its promise of research or rest or travel, and only one potential obstacle looms in the horizon \u2013 the dreaded teaching evaluations. We have all been traumatized and scarred by teaching evals at some point in our lives. If you\u2019re in a privileged position like mine, with tenure, chair of your department, and no promotion coming any time soon (I am only eligible to go up for promotion in like three years), you can avoid the trauma using one simple trick: just don\u2019t look, until you have to. \u00a0You know no one else will, either. But this is not the case for early career tenure-track faculty, postdocs and other visiting faculty, or for \u201ctenure exempt\u201d faculty (Linse, 2017). The fact is that so-called Student Evaluations of Teaching (SET) are still heavily used for reappointment and promotion, and sometimes requested by hiring committees. But another fact is that this data could potentially be useful even to senior faculty \u2013 for our own teaching but also as colleagues and mentors to these more vulnerable faculty. Last week, I attended a workshop at my institution run by my colleague in Chemistry <a href=\"https:\/\/www.bates.edu\/faculty-expertise\/profile\/lynn-a-mandeltort\/\">Dr. Lynn Mandletort<\/a>, designed to help us make the most of these teaching evaluations. In this post, I summarize some of my main takeaways from this workshop, and some suggestions and resources for further reading.<\/p>\n<p><!--more--><\/p>\n<p><strong>First, a disclaimer:<\/strong> This is NOT a post about how teaching evaluations are biased and awful \u2013 in my personal opinion they are, and there is a lot of writing on this very issue you can find elsewhere. For example, see this great post by <a href=\"https:\/\/blogs.ams.org\/blogonmathblogs\/2019\/04\/08\/do-evaluations-really-add-up\/\">Anna Haensch on the AMS Blog on Math Blogs<\/a> for a few resources. Also, after doing some of the reading recommended by Lynn, it is possible that evals are not as biased and awful as I thought, but more on that later. <strong>This is a post about what we do with the measures we already have.<\/strong><\/p>\n<p><strong>But why is this on the i\/e blog? <\/strong>Good question, dear reader. Like I said above, there is a lot of research showing that SETs are biased, especially against women professors and professors of color. This is so well known by now, that there are even comedic pieces about it, see this <a href=\"https:\/\/www.mcsweeneys.net\/articles\/reviewing-course-evaluations-the-drinking-game\">McSweeney\u2019s piece for a \u201cfun\u201d drinking game on this topic<\/a>\u2026 Linse, in her article <a href=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S0191491X16300232\">\u201cInterpreting and using student ratings data\u201d<\/a>, claims that much of this research is overblown and maybe gives too much weight to what she calls \u201crare\u201d data or outliers. I am not a sociologist nor am I an expert in these matters. But the fact is that even if not making a statistically significant difference to our ratings, the impact on women faculty, faculty of color, and other marginalized groups seen as \u201coutsiders\u201d to academia, is not just about statistical significance. This is emotionally taxing, on top of other emotionally taxing things that these faculty already have to do. I also wonder about how a person&#8217;s teaching\u00a0 is affected when they are constantly exposed to micro and macro aggressions from other faculty and students. At any rate, if you are faculty of color or woman faculty, \u201cwell, your low ratings might not ONLY be due to your gender\/ethnicity\/race\u201d is not really all that helpful.<\/p>\n<p>SO, that said, after being through that workshop last week, I realized that there were ways to digest the data that were more useful and less traumatic to me, and that I should share this with the i\/e community. The bonus is that this should be helpful to everyone, not just mathematics instructors from minoritized groups.<\/p>\n<p><strong>Finally, the workshop!<\/strong><\/p>\n<p>Lynn started by emphasizing a new term\/language for what we have so far called SETs \u2013 instead, we should call them Student Ratings of Instruction (SRIs). She gave us three guiding principles for what the data are (and are not), similar to those in (Linse, 2017):<\/p>\n<ul>\n<li>Students ratings are student <em>perception<\/em> data.<\/li>\n<li>Student ratings are <em>not<\/em> faculty evaluations.<\/li>\n<li>Student ratings are <em>not<\/em> measures of student learning.<\/li>\n<\/ul>\n<p>So, how do these student ratings (or student perceptions) help us with our own teaching? First, we have to be able to organize the data in ways that are useful. Lynn led us through three main steps for this process.<\/p>\n<p><strong>Step 1: Big picture<\/strong><\/p>\n<p>Bates does online teaching evaluations, and we can download a .csv file with all of the data (quantitative and qualitative). After downloading, Lynn recommended we sort the data by student. When you sort the data by question or item, you might miss the \u201ctrolls\u201d \u2013 students with a negative perception who are not representative of your class as a whole, or students with strong biases against you. We also did conditional formatting on the spreadsheet to the numerical scores to look for clumps of different colors. (See image below). This was so much easier to scan than numbers, and the feeling of seeing lots of green on my spreadsheet was great \u2013 even thought there were some yellows and reds. She then asked us to try to answer the questions: \u201cAre there individual students who seemed to have an overall \u201cbad\u201d time with a string of low numbers in their responses?\u201d<\/p>\n<div id=\"attachment_1827\" style=\"width: 650px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-1827\" class=\"size-large wp-image-1827\" src=\"http:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/color-coding-933x1024.jpeg\" alt=\"\" width=\"640\" height=\"702\" srcset=\"https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/color-coding-933x1024.jpeg 933w, https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/color-coding-273x300.jpeg 273w, https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/color-coding-768x843.jpeg 768w, https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/color-coding.jpeg 982w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><p id=\"caption-attachment-1827\" class=\"wp-caption-text\">Student 9 was not a total fan, but not a total hater either&#8230;<\/p><\/div>\n<p>She advised us to disregard the numerical data (especially the dreaded box plot comparing you to the whole campus, see below for the fuel of my nightmares), unless we had been teaching a course for many years, and only then to compare our new data to our old data, not to others (not even those teaching the same course in the same way).<\/p>\n<div id=\"attachment_1828\" style=\"width: 650px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-1828\" class=\"size-large wp-image-1828\" src=\"http:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/boxplot-1024x324.jpeg\" alt=\"\" width=\"640\" height=\"203\" srcset=\"https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/boxplot-1024x324.jpeg 1024w, https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/boxplot-300x95.jpeg 300w, https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/boxplot-768x243.jpeg 768w, https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/boxplot.jpeg 1314w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><p id=\"caption-attachment-1828\" class=\"wp-caption-text\">When you&#8217;re below the 25th percentile, but you also scored above a 4 out of 5&#8230; This is also a typical response to a &#8220;flipped&#8221; format, since class meetings are not everything&#8230; (see Hodges and Stanton below)<\/p><\/div>\n<p><strong>Step 2: Specifics<\/strong><\/p>\n<p>Once the data is sorted, we can \u201cinspect the comments for anomalous negative comments \u2013 obvious trolling, comments on your appearance or personality, or anything that can\u2019t be interpreted as actionable feedback on learning.\u201d We were advised to disregard those and put them aside. (I once had a comment that basically said &#8220;Adriana is nice, but she&#8217;s not worth the whatever thousand dollars my parents pay a year&#8221; &#8212; umm, ok?).<\/p>\n<p>Then we categorized the remaining comments and connected them to aspects of the course or the student experience. I particularly liked creating a table like the one shown below (I did this for both Number Theory and my Great Ideas in Mathematics course, but the latter had so few comments it wasn\u2019t super useful).<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-1829\" src=\"http:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/IMG_0518-1024x447.jpg\" alt=\"\" width=\"640\" height=\"279\" srcset=\"https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/IMG_0518-1024x447.jpg 1024w, https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/IMG_0518-300x131.jpg 300w, https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/IMG_0518-768x335.jpg 768w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><\/p>\n<p>Another recommendation I really liked was to look for places where students seem to disagree, and in particular positive comments that may counteract the negative ones. She mentioned \u201c<a href=\"https:\/\/en.wikipedia.org\/wiki\/Fear_conditioning\">fear conditioning<\/a>\u201d and the fact that it is easier to commit negative experiences (and comments) to memory (see comic above!).<\/p>\n<p>Then she prompted us to highlight any comments that refer to carefully thought out parts of the course, or anything that aligns closely with our teaching philosophy or other teaching goals. \u201cThis is the fodder for your dossier \u2013 either laying the groundwork for future changes or information that connects to earlier efforts.\u201d<\/p>\n<p><strong>Step 3: Check in<\/strong><\/p>\n<p>We wrapped up by checking in with each other. In particular, she pointed out, as I did earlier, that SRIs are emotionally taxing for newer faculty and non-tenured faculty, and that we should take care in having these conversations with each other.<\/p>\n<p>At the beginning of the workshop I thought this would be the part where I would be crying (the imposter syndrome does not really go away, no matter how tenured or furniture-like you are). But I was super satisfied. Not because my comments were overwhelmingly positive (although they were better than I expected \u2013 I suspect that I would have felt they were worse had I not done this workshop), but because I found this process to be illuminating. It was easy for me to see areas where I underperformed (I did not return work on time \u2013 I knew that already, but it is clear the students were unhappy about this, and understandably so), and areas that I was being particularly mindful of were noticed and valued (lots of comments about feeling respected and included in the classroom).<\/p>\n<p><strong>Advice for administration, tenure and promotion and hiring committees<\/strong><\/p>\n<p>Most of this post has been to share an activity that I found helpful, and thought could be helpful to readers of this blog. However, it is important to remember that this is a huge source of anxiety for faculty. I also want to emphasize that we DO care about how students feel and these measures are probably not going away soon. However, we can change the culture around these instruments. For example, be very clear that faculty are evaluated individually (not in relation to each other, otherwise we end up in Lake Wobegon where everyone is expected to be above average). That biases extend to committees, and are not just held by students, how do we make sure we evaluate our colleagues fairly, given that academia is structurally unfair? That SRIs are but ONE measure of a teacher, and more data need to be gathered to get a full picture. That for this to be actually useful to newer faculty, it needs to be considered a <em>formative<\/em>, not <em>summative<\/em> assessment \u2013 observe someone\u2019s ratings for a given course through the years. Chairs and other letter writers need to provide context for outlier courses (and by the way, we all have them). T&amp;P committees need to look at more than just an average of scores (numerical ratings of teaching will have a tail at the low end of the scale, this should not then be evaluated using a mean).<\/p>\n<p>Finally, be kind to yourselves \u2013 we are all learning. And lean on your colleagues and friends \u2013 I think a key part of why this workshop was successful for me is that I was with other people who were as anxious as me.<\/p>\n<p>Do you, dear readers, have any other strategies for reading your evals, both the quantitative and qualitative pieces?<\/p>\n<p>For example: My friend Megan Cook, English professor at Colby College, finds word clouds useful.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-1831\" src=\"http:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/megan-1017x1024.jpeg\" alt=\"\" width=\"640\" height=\"644\" srcset=\"https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/megan-1017x1024.jpeg 1017w, https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/megan-150x150.jpeg 150w, https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/megan-298x300.jpeg 298w, https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/megan-768x773.jpeg 768w, https:\/\/blogs.ams.org\/inclusionexclusion\/files\/2019\/05\/megan.jpeg 1208w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><\/p>\n<p>Please share any ideas in the comments section below!<\/p>\n<p><strong>References<\/strong><\/p>\n<p><a href=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S0191491X16300232\">Angela R. Linse, \u201cInterpreting and using student rating data: Guidance for faculty serving as administrators and on evaluation committees\u201d, Studies in Educational Evaluation 54 (2017) 94\u2014106.<\/a><\/p>\n<p><a href=\"http:\/\/provost.ucsd.edu\/SIXTHDOCS\/CAT_Hodges&amp;Stanton_Article.pdf\">Linda C. Hodges and Katherine Stanton, \u201cTranslating Comments on Student Evaluations into the Language of Learning\u201d, Innov High Educ (2007) 31: 279\u2014286.<\/a><\/p>\n<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" ><\/div>","protected":false},"excerpt":{"rendered":"<p>The school year is over, commencement has come and gone, grades are in, and the summer lies ahead of us, with all of its promise of research or rest or travel, and only one potential obstacle looms in the horizon &hellip; <a href=\"https:\/\/blogs.ams.org\/inclusionexclusion\/2019\/05\/31\/set-theory-on-reading-student-evaluations-of-teaching\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" data-url=https:\/\/blogs.ams.org\/inclusionexclusion\/2019\/05\/31\/set-theory-on-reading-student-evaluations-of-teaching\/><\/div>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[55,54,52],"tags":[],"class_list":["post-1825","post","type-post","status-publish","format-standard","hentry","category-student-evaluations-of-teaching","category-student-ratings-of-instruction","category-teaching"],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/p7Y6qR-tr","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/blogs.ams.org\/inclusionexclusion\/wp-json\/wp\/v2\/posts\/1825","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.ams.org\/inclusionexclusion\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.ams.org\/inclusionexclusion\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.ams.org\/inclusionexclusion\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.ams.org\/inclusionexclusion\/wp-json\/wp\/v2\/comments?post=1825"}],"version-history":[{"count":6,"href":"https:\/\/blogs.ams.org\/inclusionexclusion\/wp-json\/wp\/v2\/posts\/1825\/revisions"}],"predecessor-version":[{"id":1836,"href":"https:\/\/blogs.ams.org\/inclusionexclusion\/wp-json\/wp\/v2\/posts\/1825\/revisions\/1836"}],"wp:attachment":[{"href":"https:\/\/blogs.ams.org\/inclusionexclusion\/wp-json\/wp\/v2\/media?parent=1825"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.ams.org\/inclusionexclusion\/wp-json\/wp\/v2\/categories?post=1825"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.ams.org\/inclusionexclusion\/wp-json\/wp\/v2\/tags?post=1825"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}