Learn About: 21st Century | Charter Schools | Homework
Home / Edifier


The EDifier

January 26, 2012

Working through further misconceptions about value-added measures

Filed under: Growth Models,Teacher evaluation,teachers,Uncategorized — rstandrie @ 4:53 pm

In previous posts, I’ve taken issue with many of the criticisms leveled by Diane Ravitch and Valerie Strauss, who disputed the findings from the recent study by Harvard and Columbia economists. That study found students who had teachers with high value-added scores were more successful later in life than students who had teachers with lower value-added scores.

Since I believe many of the criticisms were based on common misconceptions about value-added measures, let’s keep going:

Criticism 6: The study predominately studied teachers and students in the 1990’s when there was no test-based accountability. Value-added measures as well as future student outcomes may differ if consequences were attached to test scores since teachers may focus more on raising scores than providing richer instruction and would likely negatively impact students’ future outcomes.

Response 6: Basing the study on teachers and students from the 1990’s was necessary in order to examine the impact of teachers on elementary students ten years after they graduated from high school. So the findings on the long-term impact are actually based on very recent data.

 It is also true that the results may differ if high stakes were placed on the test results.  Keep in mind, however, that if the data did indeed come from New York City (as Ravitch and Stauss assume), test-based accountability had been in place since the late 1980’s, and that accountability was particularly focused on low-performing schools enrolling predominately poor students.  Even if the data is not from New York City, the fact that the district was testing students and collecting so much data shows it is likely there was some sort of test-based accountability in place at the time.

Yet the study found the impact of having a high value-added teacher was the same for students of all income levels. So even the poorest students, who were much more likely to attend schools where teachers were under pressure to raise test scores, benefited as much from high value-added teachers as higher-income students, who were less likely to attend a school that faced any accountability pressure.

 Criticism 7: The study doesn’t show that value-added can accurately identify individual teachers as effective or ineffective.

Response 7: As I stated previously, no one seriously argues that that individual teachers should be evaluated using value-added measures alone. I am not aware of any proposed or current teacher evaluation systems that have value-added scores account for more than 50 percent of a teacher’s overall evaluation. As the Center for Public Education report Building a Better Evaluation System states, value-added scores can be an effective tool in accurately identifying effective and ineffective teachers, but they should be used within the context of a comprehensive evaluation system that includes observations and other qualitative measures of a teacher’s performance.

Tomorrow I’ll finish the list of misconceptions and criticisms by Strauss and Ravitch. – Jim Hull






January 25, 2012

More misconceptions about value-added measures

Filed under: Growth Models,Teacher evaluation,teachers,Uncategorized — rstandrie @ 4:17 pm

Yesterday I responded to some common criticisms of value-added measures to evaluate teachers. As I mentioned yesterday, these criticisms have been used by critics such as Diane Ravitch and Valerie Strauss to  dispute the findings from the recent study by Harvard and Columbia economists. That study found students who had teachers with high value-added scores were more successful later in life than students who had teachers with lower value-added scores. While many, including me, believed that the study provides strong evidence that value-added measures can be an effective tool in identifying effective teachers, Ravitch and Strauss took exception to such a conclusion.

To be clear, neither Ravitch nor Strauss is questioning the report’s methodology or its findings.  As Ravitch notes “The problems of the study are not technical, but educational.” However, I believe that the criticisms they raise stem more from common misconceptions about value-added scores and how they are used. Here are some additional criticisms, with my responses.

Criticism 3: Teachers in subjects not subject to testing, such as art, music, physical education, and many times social studies do not have value-added scores.

Response 3: I took on this issue in the Center’s Building a Better Evaluation System report. It is a limitation of value-added models, but there are systems in place that still are able to evaluate teachers in non-tested subjects. One technique is to use the school’s value-added score in place of the individual teacher’s value-added score, since all teachers contribute to the success of the school. The school’s value-added score, combined with other measures like observations, would provide a more accurate measure of teacher effectiveness. Not even the most ardent support of value-added scores believes that a decision about an individual teacher should be made based on a value-added score alone.

Criticism 4: It is no wonder higher performing students are more likely to go to college.

Response 4: The key word here is ‘higher.’ If the study found that previously high-performing students were more likely to go on to college, there’d be little to talk about. But it didn’t. The study found a student who had a teacher with a high value-added score was more likely to go to college than if that same student had had an average teacher, and much more likely than if that student had had a teacher with a low value-added score.

Criticism 5: The impact of highly effective teachers on student incomes is relatively small at $250 per year.

Response 5: The number that Ravitch and Strauss cite is the impact of having one high value-added teacher for one year and having average teachers for all other years. Much like compound interest, having more than one highly effective teacher really starts to pay off, especially when you compare it to the impact of poor teachers. For instance, having four highly effective teachers instead of four teachers with low value-added scores would improve a student’s future income by $2,000 per year. That’s not chump change! And the income difference was calculated at age 28 – just before the income gap between college and non-college students really starts to widen.

Keep reading tomorrow for the final set of criticisms. – Jim Hull






Answering the critics: Misconceptions about value-added measures

Filed under: Growth Models,Teacher evaluation,teachers — Jim Hull @ 10:04 am

Neither education historian Diane Ravitch or Washington Post blogger Valerie Strauss are fans of using value-added measures to evaluate teachers.  [Note: value-added is a statistical term describing the measure of a teacher’s impact on their student’s academic growth – see our report for a further explanation.] Both Ravitch and Strauss are particularly upset with the attention given a recent study on value-added measures, which I wrote about earlier this month.

Apparently, Ravitch and Strauss do not believe, as I do, that the results are that big of a deal. They argue that despite the study’s positive results, using value-added measures to evaluate teachers is a bad idea. Their criticisms pretty much capture the general consensus of value-added critics. But many of these criticisms, though well-intended, are based on misunderstandings of value-added measures, especially when used in teacher evaluation formulas.

In the next few posts, I’ll examine the merits of common criticisms of value-added measures that Ravitch, Strauss and others have highlighted, and point out the misconceptions.

Criticism 1: Studies have shown value-added measures to be unreliable, invalid, and unfair.

Response 1: This is an overstatement. Yes, there are several rigorous studies showing that this is the case, but only if you use a single value-added score to evaluate an individual teacher.

Nobody is seriously proposing to use value-added measures this way. There is no teacher evaluation system I am aware of that even proposes using a value-added score for more than 50 percent of a teacher’s total evaluation. At least half of a teacher’s evaluation would be based on qualitative measures such as principal and peer observations – which, by the way, correlate highly with value-added scores. Other systems propose using statistical techniques that make value-added scores more reliable, such as averaging a teacher’s scores over multiple years.

Keep in mind, too, that although value-added measures are not perfect they are better at identifying the true effectiveness of teachers than the teacher evaluation systems in place now as I show in our report Building a Better Evaluation System.

Criticism 2: Teachers would avoid teaching the most challenging students and avoid teaching in the most challenging schools and districts if teachers were evaluated using value-added.

Response 2: Value-added measures were designed specifically to combat this problem. Yes, previous attempts to evaluate teachers using quantitative measures did result  in teachers avoiding challenging positions. However, value-added measures more accurately isolate a teacher’s impact on students’ test scores by explicitly taking into consideration students’ prior achievement. This means, for instance, that teachers who teach low-performing students are compared to other teachers of low-performing students. In addition, value-added measures are based on the amount of growth students make in a year – not their overall score at the end of the year, as previous methods did.

Strauss adds that value-added can’t possibly measure a teacher’s true effectiveness, since 22 percent of children are in poverty and that poverty is strongly correlated to student achievement. I guess she is assuming that value-added doesn’t take into account a student’s socioeconomic status, but this is untrue. Value-added measures account for all student characteristics, including poverty level. Strauss is correct there is a strong correlation between poverty and a student’s achievement level – that is, a student’s achievement at one point in time. But there is little correlation between poverty and achievement growth — the change in student achievement over time. And value-added measures are based on achievement growth, not level. It’s this focus on growth that makes value-added measures so valuable – and why you should come back tomorrow to read more answers to the criticisms about value added.

– Jim Hull






January 19, 2012

Evaluating teachers: look at the data, please

Filed under: Teacher evaluation,teachers,Uncategorized — rstandrie @ 3:55 pm

A new report by the Measures of Effective Teaching project and supported by the Bill & Melinda Gates Foundation looks into the accuracy of classroom observations, as compared to and combined with other things such as value-added measures. The results are too long to discuss in a blog post, but one point caught my eye.

Much of the report compared the “predictive power” of different evaluations. That is, it looked at how likely any given type of evaluation (classroom observations, rigorous student surveys, value-added measures, or a combination), was to predict student gains on state tests in math and reading.

The graph that caught my eye compared the predictive value of a combination of these evaluations vs. the predictive value of graduate degrees or years of experience, the two measures used most often in determining raises and layoffs. For graduate degrees and experience, the predictive value was slight; but for the combined evaluation measures, the predictive value was noticeably more significant. Students who had the top 25% of teachers (as identified by the combined evaluation) gained roughly between one and five more months’ worth of learning than those who had teachers in the bottom 25%.

The point is this: a combined evaluation, using multiple measures of a teacher’s effectiveness including value-added measures, was more effective in identifying the teachers who boosted students learning than any of the traditional measures such as experience or degrees. And the report showed elsewhere that a combined evaluation was more effective than any single evaluation measurement.

My takeaway? Please, let’s use data in evaluating teachers. It doesn’t just have to be numbers — for instance, those rigorous student surveys did a good job, too. But as we argued in Building a Better Evaluation System, value-added models have a place in evaluating teachers as part of a system of multiple measures.

It’s time to build a better evaluation system. It won’t be perfect, but the evidence is mounting that it will be better than what we’ve got. –Rebecca St. Andrie

 






January 12, 2012

More States Improved Their Standards

Filed under: Public education,Report Summary — Jim Hull @ 12:05 pm

This morning, Education Week (EdWeek) released its annual special report Quality Counts 2012, which included its annual State of the States report card. For the fourth year running, Maryland earned top honors with a B plus while the nation as a whole once again received a C. The report card shows that states have been taking steps to improve their standards, assessments, and accountability systems: 20 states improved their grades in this area since it was last reported in 2010. Mainly, the improvement was due to 19 earning perfect marks in the Standards subcategory. On the other hand, states remained stagnant in their teacher policies, where most states earned grades similar or lower than the ones they received in 2010.

Here are some of the key findings from this year’s report card:

Summative Grades

How did the nation as a whole and each individual state perform across all policy and performance areas?

  • Overall, the nation received a grade of a C across all policy and performance areas, which remained the same as a year ago.
  • Maryland earned the highest grade (B plus) for the fourth consecutive year, followed by New York, Massachusetts, and Virginia, who all earned a B.
    • The vast majority of states (41) earned grades between a C minus and a C plus.
    • No states received an F. South Dakota earned a D plus.

Chance for Success

What are the odds that the average child who grows up in a particular state will do as well as the average child in the top-ranked state, at each stage of his or her educational life? (these stages are: the early childhood years, participation and performance in formal education, and educational attainment and workforce outcomes during adulthood)

  • Massachusetts ranked first for the fifth consecutive year by being the only state to receive an A, while once again New Hampshire and New Jersey each received an A minus. 
    • This means that children in Massachusetts have the best chance of achieving positive life outcomes.
  • On the other hand, children in Nevada, New Mexico, and Mississippi have the least chance of achieving positive life outcomes by earning a D and D pluses, respectively.
  • The nation as a whole earned a C plus.

K-12 Achievement

How do states compare on the academic achievement of their students in elementary through high school?

  • Overall, our nation’s schools improved from a D plus to a C minus in the academic preparation of school children.
    • The grade is based on the academic status and growth over time in math and reading scores, narrowing of poverty-based achievement gaps, as well as high school graduation rates and the performance on the advanced placement test.
  • Once again, Massachusetts received the highest grade with a B. Maryland and New Jersey scored slightly lower, but still earned B’s. 
  • Louisiana, West Virginia, Mississippi, and the District of Columbia all received failing marks, just as they did a year ago.

Transition and Alignment

How do states compare on implementing various education policies to better coordinate the connections between K-12 schooling and other segments of the education pipeline, such as early-childhood education, college readiness, and links to the world of work?

  • Arkansas, Maryland, Tennessee, Texas, and West Virginia are leaders in ensuring students are ready to move up the education ladder, earning A’s for their policy work in this area.
  • In contrast, six states earned D’s, while Nebraska earned an F, as it did last year.
  • Although the nation as a whole earned just a C plus, 33 states have defined college readiness, which is an increase of 13 states since 2009. 

 

School Finance

How much do states spend on their schools? Is the spending distributed equitably?

  • Although no state received an A, seven states received B minuses for adequately funding their schools.
    • On the other hand, four states — Idaho, South Dakota, Nevada, and North Carolina –received grades of D or D minus. No state received a failing grade.
  • As a whole, the nation received a C on funding education. However, on average, the nation spends more money on wealthier school districts than poorer districts nationwide.
    • Just six states spent as much or more on their poorer districts as on their wealthier counterparts. 





Older Posts »
RSS Feed