Former Philly principal details destructive impacts of test cheating

In July 2010, when Saliyah Cruz was named principal of Communications Technology High, state test scores said the small school in Southwest Philadelphia was one of the best in the city.

Everything else said something different.

SAT scores were poor. Summer enrichment programs were empty. Loads of kids tested into remedial reading and math. According to Cruz, even the police complained that many students had spent much of the previous school year at the nearby Penrose Plaza strip mall instead of in class.

"Those kinds of things didn't add up for me," she said. "If my kids were out in the street when they were supposed to be in school, how were [75] percent scoring proficient?"

Two years later, an answer appeared: A mountain of circumstantial evidence now suggests that Comm Tech's results on the 2010 Pennsylvania System of School Assessment (PSSA) exams were inflated by adult cheating.

The school, which takes students from all over the city, is one of 53 District schools and four area charters involved in a state-led investigation that has prompted questions about the validity of test results between 2009 and 2011.

Cruz, who left the Philadelphia school system in frustration and is now a middle school principal in Delaware, says the suspect scores at Comm Tech hurt students. She also believes they reflected a district-wide culture at that time that rewarded improbable PSSA gains while dismissing steady improvement.

"The message quite clearly was 'here's what's expected in the School District of Philadelphia,'" Cruz said, talking about a time period when Arlene Ackerman was still schools superintendent. "All the principals, all the teachers, all the kids need to be able to make these giant leaps forward."

Testing experts say the ripple effects of inflated scores go even wider, especially because the District continues to rely on data that are likely tainted to measure success and make high-stakes policy decisions.

"I think the implications are pretty profound," said Jonathan Supovitz, a University of Pennsylvania professor who co-directs the Consortium for Policy Research in Education (CPRE).

"If we can't assume the stability of the data, then any sense of guidance about what we're doing well or not well is broken down."

Sudden spikes, sudden drops

District officials declined to be interviewed for this story.

"It is too early to say how the PSSA scores have been affected by the allegations of testing improprieties," wrote spokesman Fernando Gallard in a statement, citing the ongoing investigation.

Each year, students in grades 3-8 and 11 take the PSSA in reading and math. Their scores are used to determine whether schools meet federally mandated performance targets, known as adequate yearly progress (AYP). In Philadelphia, they're also used to make big decisions, such as which schools get closed or converted to charters.

In 2010, 75 percent of 11th graders at Comm Tech scored proficient or above in reading. That was a 22 percentage-point jump over the previous year.

In math, 70 percent of Comm Tech 11th graders scored proficient or above, 40 points higher than the year before.

An analysis commissioned by the Pennsylvania Department of Education suggested the results may be illegitimate. In both 2009 and 2010, a high number of student response sheets at Comm Tech had suspicious patterns of "wrong-to-right" erasures – a telltale sign of adult cheating.

Before the 2010-11 school year, Comm Tech's principal, Barbara McCreery, was replaced. That year, under Saliyah Cruz, the suspicious erasures went away. The school's scores tanked, dropping 38 points in reading and 45 points in math.

McCreery, now the principal at Bok Technical High, declined to comment for this story.

Cruz says she wasn't sure what to think when she walked into Comm Tech.

"I thought I was taking the helm of a high-performing school," said Cruz. "Although there were some red flags."

She says she tried talking to Comm Tech staff to get a handle on what was going on:

"'Guys, help me understand this. What were we doing last year that accounted for the kind of academic performance the kids had?'"

In response, says Cruz, staff pointed to "Study Island," a computer-based test prep program used at many District schools.

"It didn't make any sense," she said.

Despite her skepticism, Cruz says the 2010 PSSA results still led her to believe that only a small proportion of Comm Tech's students needed remedial help. Rather than overhaul staffing patterns and course schedules to allow for a schoolwide intervention, she expanded use of Study Island.

But early indicators signaled disaster. Reports generated by Study Island suggested that students didn't understand the material. Interim tests used to predict PSSA performance pointed to huge score drops. Cruz's own eyes told her that students weren't learning.

Some of her staff refused to believe any of it, she says.

"I got a lot of pushback," said Cruz. "'I don't care what all this data is saying, our PSSA scores say something different.'"

Her efforts to get some staff to change their instruction or re-teach content were rebuffed.

"I felt like I was running into a brick wall," said Cruz.

As a result, says Cruz, students at Comm Tech got a Band-Aid when what they really needed was surgery.

"I don't think the kids got the supports they needed," she said flatly.

Shortly after beginning her second year at Comm Tech, Cruz left the district.

'Wow'

Supovitz, the head of CPRE, has studied educational testing for 15 years.

A strong believer in standardized tests, he says that exams like the PSSA provide a reasonably accurate look over time at whether kids across a school or district are learning what they're supposed to learn.

The sharp spike in the 2010 Comm Tech scores should have provoked a closer look from central office, he said.

"That's the usefulness of these kinds of data," said Supovitz. "An administrator overseeing 250 schools can look and ask questions."

That's not generally how the scores have been used in Philadelphia, however.

Cruz says that if anyone inside Ackerman's central staff took a critical look at Comm Tech's PSSA results, she wasn't aware of it.

"I was not a part of any conversations like that," she said.

Instead, she says, district officials held up schools with improbable results as exemplars.

Cruz recalls vividly a citywide principals' meeting in 2010 at which former Roosevelt Middle School principal Stefanie Ressler was invited to present on her school's astronomical test score gains.

"I'm sitting there going, 'Well, how in the heck did she do that?'" recalled Cruz, who had just been removed as principal of West Philadelphia High. "I have the same resources, and I'm pulling my hair out, and I can't make those kinds of leaps."

Several members of Roosevelt's staff later alleged widespread adult cheating at their school. The state-commissioned analysis found overwhelming signs of suspicious erasures in every tested grade and subject at Roosevelt between 2009 and 2011. An investigation is ongoing. Ressler has previously declined comment.

Accounts from the unfolding cheating scandal have been hard to swallow, says Cruz.

Despite making widely praised improvements in the climate at West Philadelphia High, she was told during the 2009-10 school year that test scores weren't rising fast enough. Superintendent Arlene Ackerman designated the school for a complete overhaul as part of her Renaissance turnaround initiative. Cruz was ousted.

Since 2010, 26 schools - including West - have been either converted to "Promise Academies" or handed over to charter operators, largely on the basis of poor test scores. Last year, the District closed eight schools, based in part on the same scores.

While it's impossible to undo any of those decisions, it is not too late to "build a system that produces stable data we can have confidence in," said Supovitz.

No action yet from city or state

To date, though, both the Pennsylvania Department of Education (PDE) and the School District have declined to address the distorting effects of artificially inflated PSSA scores.

Both continue to use the three years of questionable results to guide significant policy decisions.

In September of this year, Secretary of Education Ronald Tomalis contended that 2012 is the first year in which the public can be confident that "PSSA scores are a true reflection of student achievement and academic progress."

Regardless, PDE does not appear to have adjusted the past AYP status of any district or school. Roosevelt Middle, for example, is still deemed to have met its performance targets in both 2009 and 2010 – and thus to have a more favorable AYP status now – despite the likelihood that its results from those years were tainted by cheating.

Through a spokesman, Tomalis did not respond to interview requests.

In an email, PDE spokesman Timothy Eller suggested that the state is waiting for its cheating investigation to conclude before making any decisions about adjusting AYP determinations.

"The department is considering the various options, and decisions will be announced when they are made," he offered.

The District has taken a similar stance.

No moves have yet been made to either remove the suspect data from use or to adjust it. Officials in Philadelphia still plan to use AYP status to help determine which schools to close this year. They also will apparently continue feeding questionable PSSA results into their School Performance Index, used to rank schools.

"The District will wait for the [investigation] findings before providing further comment on this issue," wrote Gallard.

What might have been

Saliyah Cruz has been affected by it all as much as anyone.

From her office at Kirk Middle School in Newark, Del., she still wonders where Philadelphia schools would be now if officials had promoted and supported a "realistic target" for test score growth, instead of touting implausible gains as the norm.

"When you set a system like that," Cruz concludes, "it's only a matter of time before you get the issues that we have now."

This article was reported as part of a joint project in covering the Philadelphia schools with the Public School Notebook.

A version of this story appears in the current print edition of the Notebook.

Support provided by

Saliyah Cruz while serving as West Philadelphia High School Principal. (Harvey Finkle / for the Public School Notebook)

What to do with PSSA scores tainted by cheating?

Howard Wainer poses a question:

If you measured the height of a group of kids and later found out some had been standing on a stool, what would you do?

"The statistical approach would be to try to estimate the height of the stool and make [an] adjustment," says Wainer, answering himself. "If the stool is a foot [tall], and everybody is suddenly a foot shorter than they measured, then you have to do something."

A leading national expert on testing and statistics, Wainer spent 21 years as the principal research scientist at the Educational Testing Service. He says the stool analogy is useful when considering the dilemma facing the Pennsylvania Department of Education (PDE) and the School District of Philadelphia.

A state-commissioned analysis found strong circumstantial evidence of adult cheating on state standardized tests in 2009, 2010, and 2011 at dozens of Pennsylvania schools, including 53 District-run schools and four area charters.

After scores dropped precipitously last year, Pa. Secretary of Education Ronald Tomalis suggested that PDE did not have confidence in the validity of the past results.

So what has been done?

A source familiar with the cheating investigation says officials from both the state and the District have quietly attempted to calculate how big an impact the cheating had on scores during the years in question. But so far, it appears that no one has used their results. The questionable, unadjusted scores continue to be used to hold schools accountable and make a wide range of high-stakes policy decisions.

Through their spokespersons, both PDE and the District suggested they are waiting for investigations to conclude before taking action. Both declined interview requests.

The Ohio Department of Education recently took a different approach. After finding evidence that attendance records in the Lockland School District near Cincinnati had been manipulated in order to boost test results, the department downgraded the ratings of the district and some of its schools.

"We want to make sure we give the taxpayers, the parents, and the students an accurate picture of how well their school is doing," said John Charlton, the associate director of communications for the department.

Wainer says it's not difficult to understand why an agency might move slowly to correct inflated test scores that created the false impression of improvement.

"Trust me, [a test score] wouldn't be used if it was 20 percent lower than it should have been," he says. "They would fix it."

Your browser is out-of-date!

Some features of this website (and others) may not work correctly with Internet Explorer 8 and below. Click below and we'll show you your upgrade options (they're free). -your friends at NewsWorks. Update my browser now