- Home
- Commentaries and Reports
- An On–Going Discussion with Reid Lyon
An On–Going Discussion with Reid Lyon
- By Michael F. Shaughnessy Senior Columnist EducationNews.org
- Published 06/3/2008
- Commentaries and Reports
-
Rating:




Michael F. Shaughnessy Senior Columnist EducationNews.org
Dr. Shaughnessy is currently Professor in Educational Studies and is a Consulting Editor for Gifted Education International and Educational Psychology Review. In addition, he writes for www.EdNews.org and the International Journal of Theory and Research in Education. He has taught students with mental retardation, learning disabilities and gifted. He is on the Governor's Traumatic Brain Injury Advisory Council and the Gifted Education Advisory Board in New Mexico. He is also a school psychologist and conducts in-services and workshops on various topics.
View all articles by Michael F. Shaughnessy Senior Columnist EducationNews.orgAn On–Going Discussion with Reid Lyon
Senior Columnist Columnist EducationNews.org
Eastern New Mexico University
QUESTION :
Reid, since I have interviewed you recently about Reading First, the "comments " section has been a veritable " hot bed of activity" with a number of questions, comments and concerns.
Since not all of our readers read each and every comment, could you briefly summarize what you see in your mind, as the TOP issues that the comments section seems to have brought up?
ANSWER
One top issue that not only seems to come up in the comments to the interview is that some folks may not have a clear understanding of what the Reading First Implementation Study (RFIS) is evaluating.This can be seen in comments both in EdNews and elsewhere that the program to include the assessment, instructional, and professional development components were not effective.But the RFIS did not evaluate the effect of any particular program(s), assessments, or professional development strategies, alone or in combination, on reading comprehension.The RFIS was designed to measure the extent to which a specific funding stream in the form of Reading First (RF) money impacted reading comprehension.
The impact of Reading First funding was addressed by comparing eligible RF schools who received Reading First money with eligible schools that did not receive R F money.The RFIS IS NOT an experiment to test the efficacy of the intervention packet defined by RF (e.g., instructional programs, assessment and professional development strategies, etc.). It's an impact evaluation of a treatment (THE GIVING OF MONEY) in the setting of an effectiveness trial. In an effectiveness study, the "control" is not controlled, nor is the treatment. The study team was not able to prescribe any behaviors on the part of the comparison schools other than compliance with testing of students, and observation of instruction. For this type of question – funding versus no funding, the regression-discontinuity design the evaluators used was entirely appropriate. But it is possible, if not probable, that the funding of Reading First eligible schools caused changes in non-reading first schools (the comparison group) that were not anticipated. For example, we know from state Reading First evaluation reports, that some eligible RF schools not receiving funding implemented similar professional development and instruction programs as did the funded schools. They may – and many did – receive additional state/district funding to do so (more on this later). So the assumption that the eligible non- funded RF schools would continue doing what they were always doing is not valid in many cases.
Again, it is critical to understand that the RFIS did not examine the specific effects of programs, materials, or the impact of professional development, etc., on reading outcomes. Answers to these questions would have been more informative in an impact study that was designed to look at variance in treatment effects.The RFIS was supposed to do this among many other analyses, but it did not.It is possible that some data on program specific effectiveness with better comparisons will be produced in the final report, but the current design and scope of the study makes this doubtful.
As has been seen, neither the education press reporting on the study nor several commentators in the reading community had a clear understanding of what the study is evaluating.Toppo from USA Todayled out with "Study: Bush's Reading First program ineffective,without explaining that the study only examined the impact of a funding stream and not the specific programs being purchased by the funding stream.He goes on to write, "Advocates of Reading First, an integral part of the 2002 No Child Left Behind law, have long maintained that its emphasis on phonics, scripted instruction by teachers and regular, detailed analyses of children's skills would raise reading achievement, especially among the low-income kids it targets. But the new study by the U.S. Education Department's Institute of Education Sciences (IES) shows that children in schools receiving Reading First funding had virtually no better reading skills than those in schools that didn't get the funding".Unfortunately, the RFIS, as designed, is not capable of examining whether "scripted" phonics instruction had a differential impact on reading comprehension.Somehow, he forgets this critical feature while at the same time, reverting back to his obsession with phonics as synonymous with Reading First (Greg, please read the darn legislation).Sam Dilllon, reporting in the N.Y., Times leads out with "An Initiative on Reading Is Rated Ineffective"without explaining that that the RFIS examined the impact of funding rather than the impact of instructional programs, assessments, professional development programs and the like.Why is this a problem?Because he goes on to associate the null findings reported in the Interim Report with statements from Higgins, Kennedy, and Miller that allude to publishers and programs.But specific programs, no matter who published them, were not evaluated for effectiveness.
So why did the RFIS not evaluate the impact of what was transpiring in schools and classrooms that received Reading First funding (other than the amount of time spent in instruction by reading component).I can only guess at this point.First, it does not appear that IES or the contractors actually examined the legislative language that required the evaluation of the Reading First program.Had they done so, this is what they would have seen:
the evaluation Shall (meaning must) conduct:
1) An analysis of the relationship between each of the essential components of reading instruction and overall reading proficiency.
(2) An analysis of whether assessment tools used by State educational agencies and local educational agencies measure the essential components of reading.
(3) An analysis of how State reading standards correlate with the essential components of reading instruction.
(4) An analysis of whether the receipt of a targeted assistance grant under section 1204 results in an increase in the number of children who read proficiently.
(5) A measurement of the extent to which specific instructional materials improve reading proficiency.
(6) A measurement of the extent to which specific screening, diagnostic, and classroom-based instructional reading assessments assist teachers in identifying specific reading deficiencies.
(7) A measurement of the extent to which professional development programs implemented by State educational agencies using funds received under this subpart improve reading instruction.
(8) A measurement of how well students preparing to enter the teaching profession are prepared to teach the essential components of reading instruction.
(9) An analysis of changes in students' interest in reading and time spent reading outside of school
(10) Any other analysis or measurement pertinent to this subpart that is determined to be appropriate by the Secretary.
Second, given that the recruitment of contractors and their planning of the evaluation was delayed for unknown reasons, the amount of time requiredto carry out the tasks required in the evaluation (above) were probably not possible.Apparently then, the narrow questions addressing the impact of Reading First funding, while an important part of the evaluation, were addressed in isolation.Note that the delay in starting the evaluation was a concern expressed early on by staff from the House Education and Work Force Committee – a concern expressed in documents sent to the Secretary of Education (Paige) and in face to face meetings with IES and the contractors.
Third, in discussing the RFIS with people working on the evaluation, some were under the impression that the current study was the best that could be done given the resources at hand.One advisor stated that he was literally shocked to learn (1) that the congressional intent was to address tasks 1 through 10 above, and (2) that the Department had been allocated $150 million dollars ($25 million per year) to address the evaluation tasks in detail.
Not only did the education press not report these issues, they were remarkably silent on what appears to be a significant problem with the study – at least as reported in the Interim Report.While there are several confounds that limit interpretation of the data presented in the RFIS Interim Report, a hefty one is a lack of control over what is taking place in eligible Reading First Schools that were funded and Eligible Reading First schools that were not funded.A major problem is that the funded schools and the non-funded schools were doing the same thing in many cases.Tim Shanahan, an advisor to the RFIS, and one deeply familiar with not only the current study but previous implementation studies has explained this clearly in a Q and A with Eduflack.Rather than summarize, it is important to look at the details.With Tim's and Eduflack's permission, here is the interview:
EDUFLACK: What does the IES study really say? How strong are the findings?
SHANAHAN: THE IMPLEMENTATION STUDIES INDICATE THAT THE DIFFERENCES BETWEEN RF AND NON-RF SCHOOLS WERE PRETTY MODEST (ABOUT 50 MINUTES OF INSTRUCTIONAL DIFFERENCE PER YEAR IN AMOUNT OF INSTRUCTION), MEANING THAT RF KIDS PROBABLY RECEIVED FEWER THAN 30 HOURS OF ADDITIONAL READING INSTRUCTION EACH YEAR DUE TO THE INTERVENTION. CLEARLY A MODEST INTERVENTION, ESPECIALLY GIVEN THE SIMILARITIES IN CURRICULUM, INSTRUCTIONAL MATERIALS, PROFESSIONAL DEVELOPMENT, AND ASSESSMENTS.
Q: How valid are the findings, knowing there may be contamination across groups (that both the RF and non-RF groups may have been doing the same things in the classroom)?
A: MOST SCHOOLS EMPLOY SOME KIND OF COMMERCIAL CORE PROGRAM. WHEN READING FIRST EMPHASIZED THE ADOPTION OF PROGRAMS WITH CERTAIN DESIGNS ALL MAJOR PUBLISHERS CHANGED THEIR DESIGNS TO MATCH THE REQUIREMENTS.
READING FIRST SCHOOLS ALL BOUGHT NEW PROGRAMS IN YEAR 1; ALMOST ALL OTHER TITLE I SCHOOLS ADOPT NEW CORE PROGRAMS EVERY FOUR OR FIVE YEARS. THAT MEANS IN YEAR 1, 100% OF THE RF SCHOOLS GOT A NEW PROGRAM, AND 25% OF THE OTHER SCHOOLS DID. IN YEAR 2, THAT NUMBER WENT TO 50%, IN YEAR THREE 75%. ALL RF SCHOOLS HIRED COACHES IN YEAR 1, SO DID MORE THAN 80% OF THE OTHER SCHOOLS. ETC.
THIS ISN'T A CASE OF SPOT CONTAMINATION, IT WAS INTENTIONAL AND PERVASIVE (IN FACT, IT WAS PART OF THE RF LAW ITSELF—20% OF THE STATE MONEY, THAT MEANS $1 BILLION TOTAL WAS DEVOTED TO GETTING NON-READING FIRST SCHOOLS TO ADOPT THESE REFORMS).
Q: Given that contamination, are there contamination rates that can be tolerated in the design? For example, let's say 15 percent of the RF and comparison groups received identical programs/PD. Is this level of contamination tolerable? What if there is a 30 percent overlap – is this level tolerable? Are there ways to estimate the degree to which percent contamination will indicate a need to increase sample size?
A: THE PERCENTAGES OF OVERLAP WERE 75-100% DEPENDING ON THE VARIABLE. THE ONLY ONE WHERE WE HAVE ANY KIND OF IDEA ABOUT WHAT IS TOLERABLE IS WITH TIME.
FROM PAST RESEARCH, ONE SUSPECTS THAT 100 HOURS OF ADDITIONAL INSTRUCTION WOULD HAVE A HIGH LIKELIHOOD OF GENERATING A LEARNING DIFFERENCE, A 50-60 HOUR DIFFERENCE WOULD STILL HAVE A REASONABLE CHANCE OF RESULTING IN A DIFFERENCE. AT 25-30 HOURS A SMALL DIFFERENCE IN LEARNING MIGHT BE OBTAINED, BUT IT IS MUCH LESS LIKELY (ESPECIALLY IF THE CURRICULA WERE THE SAME).
Q: Did the evaluation design include procedures/strategies to avoid contamination between RF and the comparison group?
A: IT [THE IES STUDY] NOT ONLY DID NOT TRY TO AVOID CONTAMINATION, IT COULDN'T POSSIBLY DO IT SINCE THE SOURCES OF THE CONTAMINATION WERE SO PERVASIVE. FIRST, THE FEDERAL POLICY EXPLICITLY CALLED FOR SUCH CONTAMINATION TO BE PUSHED. SECOND, STATES AND LOCAL DISTRICTS MADE THEIR OWN CHOICES (AND THEY FELT ENTICED OR PRESSURED TO MATCH RF).
FOR EXAMPLE, SYRACUSE, NY RECEIVED READING FIRST MONEY FOR SOME SCHOOLS, BUT MANDATED THAT ALL OF ITS SCHOOLS ADOPT THE SAME POLICIES AND PROGRAMS. THERE SHOULD HAVE BEEN NO DIFFERENCES BETWEEN RF AND NON-RF SCHOOLS IN SYRACUSE, THE ONLY DIFFERENCE WOULD BE IN FUNDING STREAM—HOW THE CHANGES WERE PAID FOR, AS THE NON-RF SCHOOLS ATTENDED THE SAME MEETINGS AND TRAININGS, ADOPTED THE SAME BOOKS AND ASSESSMENTS, RECEIVED THE SAME COACHING, PUT IN PLACE THE SAME POLICIES, ETC.
Q: Did the evaluation design describe practices in the comparison groups?
A: YES, THE IMPLEMENTATION STUDIES SHOW THE SIMILARITIES IN PRACTICES AND HOW, OVER TIME, THE PRACTICES THAT WERE SIMILAR AT THE BEGINNING BECAME INCREASINGLY SIMILAR EACH YEAR. THAT WILL BE CLEARER IN THE NEXT STUDY OUT
Q: Did the evaluation design account in any way for contamination, crossover, compensatory rivalry, etc.?
A: NO. THE FEDERAL LAW CALLED FOR THE EVALUATION OF READING FIRST IN TERMS OF THE EFFECTIVENESS OF THE INSTRUCTIONAL MODEL, BUT DID NOT CALL FOR A STUDY OF THE IMPACT OF READING FIRST UPON THE ENTIRE EDUCATIONAL SYSTEM.
EVEN THOUGH I HAD PERSONALLY MADE A BIG DEAL OUT OF THE PROBLEM FROM THE VERY FIRST STUDY DESIGN MEETING, THE METHODOLOGISTS THOUGHT THEY COULD HANDLE MY PROBLEM SIMPLY BY ACCOUNTING FOR THE RF ROLLOUT EACH YEAR. THEIR ASSUMPTION WAS THAT RF WOULD IMPLEMENT SOME CHANGES IN YEAR 1, OTHERS IN YEAR 2, AND STILL OTHERS IN YEAR 3 AND THAT THIS PATTERN OF IMPLEMENTATION WOULD ALLOW THEM TO EXAMINE A CONTINUING LAG BETWEEN THE RF AND NON-RF SCHOOLS.
I DIDN'T UNDERSTAND THAT THEY WERE THINKING THAT AND THEY NEVER ASKED DIRECTLY ABOUT THAT. LAST YEAR, I FIGURED OUT WHAT THEY WERE THINKING AND I HAD TO EXPLAIN SEVERAL TIMES THAT RF PUT ALL OF ITS REFORMS IN PLACE DURING YEAR 1, WITH NOTHING NEW IN YEARS 2 AND 3, SO IT WOULD BE IMPOSSIBLE TO TEST THE EFFECTS OF DIFFERENT PARTS OF THE IMPLEMENTATION, ETC. USING THEIR APPROACH. I MIGHT HAVE BEEN ABLE TO GET THIS FIXED IF I HAD UNDERSTOOD THAT THEY WERE ASSUMING THAT KIND OF DESIGN (OR IF THEY HAD ASKED ME ABOUT THAT SPECIFICALLY).
Q: Can we assume that the RF group is just like the comparison group except for exposure to RF funding? I the counterfactual valid?
A: READ THE IMPLEMENTATION PART OF THE REPORT (AND THERE IS ANOTHER STUDY COMING LATER THAT WILL MAKE THIS CLEARER) AND YOU'LL SEE THE DEGREE OF SIMILARITY IN THE KEY FACTORS BETWEEN THE TWO SETS OF SCHOOLS. I RAISED THIS AS A THEORETICAL PROBLEM ORIGINALLY, BUT THE IMPLEMENTATION STUDY CLEARLY SHOWS THAT CONTAMINATION WAS A BIG PROBLEM (IT CANNOT TELL US WHETHER THE CONTAMINATION CAME FROM THE $1 BILLION FEDERAL EXPENDITURE ON THIS, BECAUSE THE STATES AND LOCAL DISTRICTS OFTEN SIMPLY ADOPTED THE SAME IDEAS.
AS ONE ILLINOIS DISTRICT TOLD ME, "IF THIS IS THE RIGHT STUFF TO DO, THEN WE ARE GOING TO DO IT WITH EVERYONE."
All this makes you want to scratch your head.Somehow, an opportunity to design and conduct one of the most comprehensive evaluations of an educational program was squandered despite having the resources to carry it out.It boggles the mind.
It is interesting that the public was not presented more accurate information about what the RFIS was designed to do and what it did not do.It examined the impact of funding on an educational outcome, not the impact of what the funding paid for and its impact on reading outcomes.It is also interesting that the risk and the effects of contamination across Reading First and comparison schools were not more fully addressed in the Interim Report and in the press. Shanahan's analysis indicates clearly that the evaluators knew that this was a problem.The fact that the evaluation that was carried out was not the one that Congress intended is, for lack of a better word, strange.
Published June 4, 2008
Spread The Word
Comments

















