Learn About: 21st Century | Charter Schools | Homework
Home / Edifier

The EDifier

August 28, 2014

Success of new teacher evaluation systems in districts’ hands

In just the past few years just about every state has revamped how their teachers are evaluated. In 2010 the vast majority of teachers were evaluated by being observed for 45 minutes every couple years. Now, most teachers are evaluated annually based on multiple measures of their effectiveness. Although these new comprehensive evaluation systems have the opportunity to significantly impact the overall quality of our nation’s teachers, a new report from Bellwether Education Partners shows that they are still in need of improvement.

While these new evaluation systems are superior to previous evaluation systems, the report points out there is still room for improvement and provide five major findings with lessons for policymakers:

  1. Districts are starting to differentiate between poor, fair, and great educator performance, rather than treating all teachers as interchangeable widgets.
  2. Schools are using higher-quality classroom observation rubrics to provide teachers with better, timelier feedback.
  3. Despite state policy changes, many districts still don’t factor student growth into teacher evaluation ratings.
  4. Districts have wide discretion even under “statewide” evaluation systems—meaning that evaluation systems within the same state may look very different from one another.
  5. Districts continue to ignore performance when making decisions about teacher hiring, compensation, tenure, and dismissal.


As you can see “district” is explicitly mentioned in four of the five lessons and the fifth lesson about classroom observations typically falls under the domain of districts as well. This pretty much means the success or failure of these new evaluation systems depends in large part on our nation’s school boards as they are the policymakers at the district level.  As the report points out, even in states that mandate the use of a statewide evaluation system school districts have significant discretion over how their teachers are evaluated.

As I argue in our report Trends in Teacher Evaluations it is imperative that districts have flexibility in how they evaluate their teachers. And it is good to see that this is the case, as the Bellwether report stated, “Evaluation reform has not meant the end of local discretion.”

While flexibility is necessary, districts also need support too. Few districts have the resources and expertise to implement an accurate and effective teacher evaluation system on their own. Districts need support from their states to help them align these new evaluation systems to the unique needs of their district.

While responsibility for designing teacher evaluation systems was originally placed on states, this new report clearly shows that these new teacher evaluation systems will only be successful if school boards are provided the resources not only to implement these evaluation systems but also to provide professional development opportunities that are aligned with the results of each teacher’s evaluation. Without proper support for districts, teacher evaluations are unlikely to have much of an impact on the quality of our nation’s teachers. – Jim Hull

Filed under: Teacher evaluation,teachers — Tags: , , — Jim Hull @ 10:37 am

July 12, 2013

Greater flexibility in teacher assignments has its benefits

Research has consistently shown that the most disadvantaged students are less likely to have an effective teacher than their more advantaged peers. This is not only true across districts but within districts as well. Some have blamed this phenomenon—at least in part—on policies that prevent school leaders from replacing ineffective teachers and have argued that if principals were given the flexibility to move an ineffective teacher to another school within the district (called forced transfer) more disadvantaged students would have access to high quality teachers.

To date, there has been little research on whether such flexibility would lead to the intended outcomes. But a recent study found in fact that flexibility can lead to greater teacher effectiveness in low-performing urban schools. The study is not definitive since it is based on only one school district (Miami-Dade). But it found that principals in low-performing schools identified low-performing teachers– as measured by their value-added scores and number of days the teacher was absent—for transfer and were able to replace those teachers with more effective teachers in terms of both test scores (value-added scores) and fewer teacher absences. While the less effective teachers were more likely to be transferred to a much higher performing school with fewer disadvantaged students, unfortunately they were no more effective in their new school. Of course this leads to bigger discussion about what districts should do with ineffective teachers which is a topic for another post.

Such findings give credence to the theory that greater district flexibility in teacher assignments will lead to a more equitable distribution of teachers so disadvantaged students can get their fair share of effective teachers. In the long-term such a policy can have a big effect on narrowing achievement gaps as research has consistently shown the tremendous impact effective teachers have on student achievement.—Jim Hull

Filed under: Achievement Gaps,research,Teacher evaluation,teachers — Tags: , , — Jim Hull @ 4:07 pm

June 11, 2013

A teacher’s misconceptions of using test scores to evaluate teachers

I certainly understand teachers’ angst when it comes to being evaluated based on their student test scores. Being evaluated is not a fun experience but when that evaluation is also based on the performance of others it makes the process that much more stressful. To make matters worse there appears to be a number of misconceptions regarding exactly how test scores are used to evaluate teachers.

USA Today editorial by Alexandria, VA high school teacher Patrick Welch is an example of some of the misconceptions many teachers have about using tests to evaluate them. Mr. Welch’s observations are far from unique. I have heard similar misconceptions from other teachers and even researchers which makes identifying such misconceptions all the more important.

New teacher evaluation systems more accurate than previous evaluations systems

First of all, it must be recognized that there is no such thing as a perfect evaluation system in any profession. So while even the best evaluation systems will certainly identify some truly effective teachers as ineffective it doesn’t mean teachers shouldn’t be evaluated at all. However, evaluation systems should be designed to minimize the chances of such occurrences.  As explained in our Building a Better Evaluation report, there are a number of tools available to make evaluations more accurate.

Test scores can accurately measure a teacher’s performance when used with other measures

Mr. Welch cites research showing the inaccuracies of using test scores to evaluate teachers. For example, some teachers are identified as effective using one test but identified as ineffective when based on a different test. While the research he cites is sound, they do not tell the full story because they are based on using one test to measure a teacher’s effectiveness. In no state that requires teachers to be evaluated based on their impact on student achievement is a teacher ever evaluated based on one measure. In fact, every state requires at least one measure of instructional practice such as classroom observations to make up at least half a teacher’s evaluation. Even in states where half of the teacher’s evaluation is based on student achievement measures, those states require multiple measures of student achievement, not just results from the state assessment.

Teachers are not penalized for teaching lower-performing students

Keep in mind when evaluating teachers based on measures of student achievement it doesn’t mean that teachers are evaluated based on their students’ overall scores. Instead teachers are evaluated on the growth of their students’ test scores from one year to the next. So teachers are evaluated based on the growth their students made while in their teacher’s class.

This is something Mr. Welch appears to misunderstand. This is typified by his story about how he improved the test scores from his Advanced Placement class by encouraging lower-performing students to dropout. He also quotes Jesse Rothstein who said of ranking teachers based on test scores “rewards or penalizes teachers for the kids they teach, not just for how well they do it.”

Fortunately for teachers and students teaching lower-performing students will not likely hurt a teacher’s evaluation. A well designed teacher evaluation system will not penalize teachers for teaching the hardest to teach students. Even if student test scores are a significant factor in the teacher’s evaluation. That is because teachers are evaluated based on the growth their students make from year to year and not on their overall achievement.

Truly rigorous evaluations will assess student growth using a value-added model (VAM) that takes into account each students prior achievement and other background factors to determine if they made as much or more growth than if those students had an average teacher. By making such comparisons, teachers are evaluated based on the quality of their instruction not on the students assigned to their class that particular year. So while Mr. Welch would be correct to worry about being evaluated on his students’ overall test scores, this is not what is happening across the country— although a report CPE will release later this summer finds teachers in some states are more likely to be evaluated accurately than in other states.

Recent research shows teachers can be evaluated accurately

While no evaluation system is perfect and all states should continually update their systems to make them more accurate, the Measures of Teacher Effectiveness (MET) study shows that combining measures of student growth with measures of instructional practice to evaluate teachers provides a reasonably accurate measure of a teacher’s true effectiveness. And when the results are looked at over multiple years they are a more accurate measure.

So while it is understandable that Mr. Welch and many teachers across the country are apprehensive about being evaluated in part on their student test scores, such anxiety should be put at ease knowing they are being evaluated based on the growth their students make and not on the overall performance. And those teachers who are evaluated using value-added scores should have even less anxiety as they provide the most accurate measure of a teacher’s true effectiveness especially when combined with measures of instructional practice. – Jim Hull

Filed under: Teacher evaluation,teachers — Tags: , — Jim Hull @ 11:34 am

May 3, 2013

Exciting possibilities: Coursera and professional development courses

Coursera, an organization currently facilitating free online access to courses taught by college professors, has announced it will be dipping its toes into the professional development arena.  I have to admit that when I read this headline, I was thrilled.  For teachers to have free, online access to courses offered by experts on education research and teaching methods is a step in the right direction.

First, these courses could allow schools to have resources for teachers to improve their skills that are differentiated for the specific content teachers teach.  Because hiring consultants is expensive, districts often rely on generic workshops that they offer to all teachers.  I’ve sat through my fair share of these: classroom management, assessment, alignment.  However, research shows that teachers aren’t interested in generic professional development, and it doesn’t have an impact on teacher practice or student achievement.  On the other hand, professional development that is tailored to the content one teaches, specifically exploring the elements of the course students struggle with, has been shown to make a real difference in teachers’ practice and students’ learning.  With free online courses, teachers could focus on courses tailored to their content area.

Furthermore, each teacher brings his or her own unique set of strengths and weakness to the profession.  Teaching is a job that demands a lengthy list of skills which are both emotional and cognitive.  Just as some students have more natural talents in certain areas than others, the same is true with teachers.  When I co-taught a class with another teacher, I got to see this full force.  My co-teacher managed the emotional needs of a class flawlessly, while my own strengths were in lesson planning.  Working together, we got to improve our areas of weakness.  Having online courses which are free for teachers allows teachers to think about what areas they need to improve on, taking courses focused on those areas instead of sitting through PD sessions not tailored to their area of need.  Just like we urge teachers to differentiate for students, recognizing that not all students are the same, access to online PD taught by experts allows for differentiation for teachers.

Second, it could save districts lots of professional development money that they can spend more wisely.  There’s a decent amount of evidence to show that districts spend a substantial amount of money on professional development, anywhere from 2 to 7% of their total budget.  Unfortunately, most of that spending is going towards one-shot, generic workshops.  Consultants are expensive, certainly one reason that districts can only afford to have whole-school, generic sessions instead of content-specific sessions.  Nonetheless, by spending copious amounts of money on consultants and staff for workshops, districts often don’t build in professional development support as teachers aim to implement those new skills into the classroom.  The reason that’s problematic is that research studies consistently show that teachers struggle immensely with new skills during implementation of those skills in the classroom, and that without support at this stage, teachers are likely to get frustrated and simply abandon the new skill altogether.  Of course, this makes sense.  Learning how to write is easier than actually writing; learning how to ride a bike is easier than actually riding a bike. Implementation is challenging.  Therefore, schools need to develop support during the implementation stage.   When schools do this, through individual instructional coaches who observe and conference with teachers or through time for collaboration, teachers improve their teaching and students learn more.

However, having teachers meet with coaches or collaborate with colleagues takes time, and teacher time is exceptionally expensive.  School districts either have to buy this time in a teacher’s contract, pay substitutes to cover classes, or hire more staff to reduce teaching loads.  Despite that fact, research on professional development shows that opening up this time and having teachers supported during implementation of new skills is exceptionally important.  In an analysis of over 1,300 studies of professional development programs, researchers found that programs that were less than 14 hours had no impact on student achievement .   But if schools were able to cut down on some of their consultant costs by having teachers participate in free, open Coursera courses, schools might be able to buy more teacher time for deep learning experiences such as coaching or collaboration.

Of course, in all discussions about the role of online courses, it’s important to note that they can never stand alone as one’s only exposure to learning, something that’s been validated repeatedly .  However, there’s good reason to think these courses could be a nice addition to a school’s professional development tool kit.  –Allison Gulamhussein

Filed under: CPE,instruction,online learning,teachers — Tags: , — Allison @ 10:06 am

April 4, 2013

Blame the adults not the tests

I have no problem with a debate about the proper role of standardized testing in our public schools. Standardized tests are not perfect but play a vital role in improving student achievement. However, exactly what role is not clear and is certainly open for debate.

While Eugene Robinson of the Washington Post admits that standardized tests are a vital tool in improving student achievement, he essentially blames them for the cheating crisis in Atlanta and elsewhere. Arguing, the reliance on standardized test scores to make decisions about bonuses drove otherwise honorable adults in Atlanta’s classrooms to cheat.

I whole heartily disagree with Mr. Robinson’s thesis. The tests didn’t make the adults cheat, the adults made that decision. Whether it was the teachers themselves or administrators that encouraged such actions, no one is to blame for the cheating than those adults who took part in the cheating directly or ordered that wrong answers be changed. Those are the ones to blame not the tests.

It should be noted Atlanta made significant gains on NAEP– where there is no evidence of cheating –during this time as well which shows that most Atlanta teachers and administrators responded to the pressures to raise student achievement by actually teaching and not cheating. Unfortunately, their hard work is being overshadowed by a number of teachers and administrators who decided to cheat instead of facing the reality they were not meeting the needs of their students.

It is true that Atlanta may have put too much emphasis on test scores alone. As I showed in Building a Better Evaluation System and CPE encourages with data-first.org, no one indicator can accurately evaluate the true effectiveness of a teacher, administrator or school. If Atlanta wanted to reward their most effective teachers and administrators with bonuses their effectiveness should have be measured using multiple measures such as observations, student learning objectives, or portfolios along with standardized test scores. Using multiple measures not only provides a more accurate measure of  true effectiveness but it would also lessen the likelihood those being evaluated can manipulate the results by cheating.

Another reason using multiple measures would help is while teachers should be evaluated on how much they improve their students’ achievement—as measures by their students’ standardized test scores—such measures do not provide any feedback on how they can improve their students’ performance in the future. As such, a teacher who received a low rating based solely on their students’ test scores may resort to cheating to save their job or get a bonus while a teacher who received a low rating but was provided feedback on how to improve their performance is more likely to modify their instructional techniques instead.

Using multiple measures to evaluate teachers and administrators lessens the chances of cheating but that does not mean that Atlanta’s reliance on test scores caused those involved to cheat. If only those teachers and administrators who cheated had more faith in their students and their abilities to teach, more Atlanta students would be better off today. – Jim Hull

Older Posts »
RSS Feed