NIH Regional Consultation Meeting on Peer ReviewMeeting SummaryOctober 22, 2007 – Washington, D.C. Overview All of us at the agency thank you for taking time to help us address the important issue of peer review. You are here because of a community effort, a partnership is the way our agency works. We're asking you to think hard about this topic and share your ideas with us. The two-tiered NIH peer review system is the cornerstone of the enterprise of biomedical and behavioral research in this country and is emulated and respected worldwide. Let me just briefly describe how it works; if you want to know more, I encourage you to go to the web site for NIH's Office of Extramural Research. The first level of peer review takes place in scientific review groups (SRGs) composed mainly of scientists from the extramural community. These SRGs, also known as study sections, are managed by scientific review administrators (SRAs), who are located at the Center for Scientific Review (CSR), but also at each institute and center. The study sections in CSR help evaluate investigator-initiated applications. In contrast, the review branches of the rewarding institutes and centers have their own review staffs that manage study sections to help evaluate applications submitted in response primarily to RFAs or other unique programs targeted to a specific institute or center. The second-level review is carried out by the NIH advisory councils. These councils are composed of extramural scientists and public representatives. They ensure that NIH receives advice from a cross-section of the U.S. population geographically and demographically, and from interested constituencies. Over 30,000 scientists and members of the public advise us via study sections and advisory committees every year. This extraordinary guidance helps NIH choose the best scientists and best science to address the most important and compelling public health problems. The funds go to over 300,000 scientists at various stages and levels of the system, over 3,000 research institutions in every state, and many countries. According to A Half Century of Peer Review, 1946-1996, the first study section was devoted entirely to research related to syphilis. Over the next few months, other review panels were set up to look at the important scientific and medical issues of the day. The topics were a reflection both of the public health challenges the country faced in the 1940s and of the scientific opportunities most likely to advance broad biomedical research in this country. In 1949, Dr. Eleanor Darby was appointed as an executive secretary of the study sections, the first female scientist to serve on a study section. It took nearly 20 years for an African-American – Dr. Frank Johnson, an Army pathologist – to be appointed to a study section. Today we devote an extraordinary amount of time to ensuring our study sections include a diverse group of scientists, in terms of demographics and perspectives. We need to act to ensure that our peer review system keeps up with the ever-changing scientific and public health world before us. How can we find experienced and able reviewers to supply needed expertise? Are we encouraging more applications by offering too many grant mechanisms? What about technical issues? NIH grant applications are thought to be among the longest in the world in scientific review. Questions have been raised about success rates for first applications. And these are just a few of the questions we keep hearing. Although there have been periodic reviews of the system, the last one was about 7 years ago. The NIH leadership has decided to enlarge the conversation to determine how our peer review system can best work in today's context. We really care about your opinions, and we'll be listening to you. I want to thank Drs. Larry Tabak and Keith Yamamoto, who have devoted extraordinary time, effort, and thought to this process. Review of Ongoing Activities Dr. Lawrence Tabak We are involved in this self-study – in partnership with members of the scientific community and advocates – to strengthen peer review in these times when science has increased in its breadth and complexity. NIH must continue to adapt to rapidly changing science and ever-growing public health challenges. However we accomplish this, we have to ensure that the processes we employ are both efficient and effective for applicants and reviewers alike. The approach is straightforward: We're seeking input from a broad range of stakeholders, and today we're focusing on voluntary health organizations. We've also sought input from our own staff. Two committees were created to help oversee this program. The first is the Working Group of the Advisory Committee to the Director (ACD) of NIH, which Dr. Yamamoto and I co-chair. A number of people on this committee have served on the previous formal report of peer review, the so-called Boundaries Report. The second, equally strong group of individuals, comprise the internal NIH Steering Committee Working Group on Peer Review, which I co-chair with Dr. Jeremy Berg, director of the National Institute of General Medical Sciences. Here is a quick review of how peer review is accomplished at NIH. An investigator comes up with an idea for a proposal. He or she submits this to what we refer to as the first level of scientific review: a group of the individual's scientific peers, who gather to evaluate the merit of the proposal. About 70% of that review is conducted centrally through the CSR at NIH, and the remaining 30% is distributed across the institutes and centers at NIH. The information related to the review is then simultaneously transmitted to the applicant and the program officer in the appropriate NIH institute or center. Together, they review and evaluate the feedback from the first level of peer review. That information is then transmitted to the respective national advisory council of the institute or center in question. This group serves as our Board of Directors, and it conducts the second level of peer review by assessing not only the applications alone, but how they stand in the context of the overall mission of the institute and the balance of the scientific portfolio the institute and center support. Then they make their recommendation to the institute leadership, which makes final decisions related to the allocation of funds. Of course, these allocations need to be justified each year to Congress, which appropriates funds for our use in supporting research and training activities around the country. In our decision making process, every voice counts. We seek and listen to diverse opinion from many quarters, including the general public, scientists, patients and their advocacy groups, voluntary organizations, and the review committees I just outlined for you. Congress is not shy about letting us know how they feel about certain things, and then we have various advisory boards, health professional organizations, professional societies, industrial societies, and so forth. We are now in the diagnostic phase of this assessment. NIH solicited a request for information (RFI), where we asked for opinion from stakeholders on six specific questions. Respondents were asked to describe what they saw as the challenges to the NIH support system, specific challenges related to peer review, and solutions to these challenges. We then asked questions about the core values of the peer review process, the criteria we use to evaluate grant applications, and the scoring system we use. The final question related to whether the current process is appropriate for investigators at different stages of the career pipeline. Although the timeline for the RFI has ended, we're still accepting feedback at peerreviewrfi@mail.nih.gov. Additional activities include the following:
What will we do with all this information? NIH leadership will begin to determine next steps, and these likely will include pilot experiments. Armed with the results of the pilots and the evaluations, we will develop an implementation plan. Presumably the subset of pilots that prove most successful will be expanded and extended, ultimately leading to the development of a new peer review policy for NIH. I’d like to share with you some emerging ideas that might be of particular interest. This is a subset of the many ideas we have heard. These are not prioritized in any way, and they are presented only to facilitate discussion.
Goals for the Meeting The title of my presentation comes from the charge that Dr. Zerhouni gave the Working Group: “Fund the best science, by the best scientists, with the least administrative burden.” Another quote – "The only possible source for adequate support of our medical schools and medical research is the taxing power of the federal government. . . Such a program must assure complete freedom for the institutions and the individual scientists in developing and conducting their research work" – came from the Surgeon General in December 1945, virtually the month before the peer review system was established at NIH. It was under that framework and that hope that NIH undertook this task of assessing the kind of science that was being done and the ways it would be supported by the federal government. Against that backdrop, here is a 2005 quote from Tom Cech, Nobel Laureate and head of the Howard Hughes Medical Institute: "Discovery and innovation are to some extent taking place in spite of, rather than because of, the current policies and practices of major biomedical funding agencies.” Many would agree that peer review is the only system for funding the best science. At the same time, we need to acknowledge that the system has intrinsic conflicts that are not going to go away. The first is reviewer self-interest, in that the very people who make the assessments are active researchers drawing from the same resource pool as the applicants. The second intrinsic conflict is reviewer conservatism. To have the best system, we need to have the best scientists participate in the review process. These scientists created the prevailing paradigms, and so they will defend them. The doing and reviewing of science have changed dramatically, not just in this last 60 years, but in the last few years. And it continues to be a very strong dynamic that is affecting the way these processes are carried out. The nature of research itself is changing. This means that every investigator can undertake research with a much broader scope than before. Technology continues to be a major driver in research, much more than in times past. This creates more complexity, requiring more expertise on the part of the investigator and more people to get the job done, often in multidisciplinary teams. It's actually become relatively rare that science is carried out as a single-investigator endeavor without other very important collaborators coming into play. Over recent years, the doubling and then flattening of the budget has driven an explosion in applications. More people were hired during the doubling, buildings were built, new institutes were established, and many people flooded the system. When the budget flattened, everyone had to scramble to find more money, and this meant an increase in applications. Currently, NIH processes and reviews 80,000 applications a year. With these increased applications has come a vast increase in the number of reviewers. Twenty years ago, NIH used 1,800 reviewers; last year, the total was 18,000. A relatively small number of these were senior scientists. The study sections, which are mandated by Congress to have 15 to 20 chartered members each, began to fill up with ad hoc reviewers who might be able to address one point in one grant, but were otherwise unable to bring the kind of perspective that's required. Finally, with this perception of concern about resources, reviewers increasingly have taken an adversarial stance in the process. The peer review system clearly needs to evolve and adapt to these changes. We are looking for bold thinking in all areas. All of the following, and more, are open for discussion:
To fund the best science by the best scientists with the least administrative burden means we can't do it piecemeal. We need to be rigorous, tough, and fair, and we need to:
Statements/Proposals from External Scientific Community Offering Specific Strategies or Tactics for Enhancing NIH Peer Review and Research Support Dr. Marc C. Hochberg, The Arthritis Foundation My comments represent the views of the research advisory council of the Arthritis Foundation. They were prepared by Dr. John A. Hardin, the chief scientific officer of the Foundation, who regrets he was unable to be here today.
Dr. F. Owen Black, Director, Neurotology Research, Legacy Clinical Research and Technology Center, Legacy Health System, Portland, Oregon; Vestibular Disorders Association Medical Advisory Board The NIH peer review process has not benefited vestibular patients. According to NIH, about 30 million people in the United States have hearing losses. Although at least twice as many as those patients have vestibular disorders, the vestibular community receives less than 10% of the NIDCD budget. From our perspective, the peer review process has deteriorated into an opinion-swapping event, in the sense that many, if not most, of the opinions expressed cannot be factually substantiated. If one reviews the summary statements, most are not backed by peer- reviewed references to the literature, nor can one find a rationale for those comments. We recommend that a process be developed to qualify reviewers via training in all the areas listed by Dr. Yamamoto. One possible basic model is the airline industry methods for pilot qualification and re-qualification. These principles have been applied to monitoring anesthesia procedures in the operating room, resulting in a drop in morbidity and mortality from approximately 3% to 5% percent to less than 1/10 of 1%. Another good example is the Center for Certification of Rehabilitation Institutes, in which the system reviews the reviewers, and the reviewers anonymously review each other. I'd like to read the statement we agreed upon: It is herewith proposed that the implementation of an objective – that is, a fact- and performance-based process – to qualify NIH reviewers be considered. The simultaneous development of objective measures for acceptable peer review performance, with respect to the goals outlined, for example, in the “NIH Roadmap,” would likely yield a much higher return on the public investment toward improving accomplishments of the NIH mandate to conduct health care research would be better served in the public interest, in our opinion. Dennis Coleman, Director, NIH Liaison Office I based my comments on the six issues defined on the web site:
Amy Comstock Rick, Chief Executive Officer, Parkinson’s Action Network Our comments address whether there is a sufficient connection between the current peer review system and NIH's mission of promoting research that will help prevent, diagnose, and treat disease. We do not disagree in the least that it is scientists who should be assessing the scientific merit of grant applications, but we do think there are ways to increase efficiencies. Study sections may have a bias toward basic research, which would then fundamentally interfere with NIH's mission of better diagnosing and treating diseases. For example, more innovative ideas in translational or clinical research that may not have the data to support the hypotheses may have a lower score, simply because of the nature of the research. I recently served 2 years on a working group for the NINDS Council that assessed 12 NINDS centers. In the course of this review, we encountered some problems with the peer review process and how it funds the centers. There was unanimous agreement in the working group that the study section reviewer didn't have the opportunity to learn about the program and its unique aspects. In the report we presented to the Council for NINDS in August, we recommended that only one study section handle all the grant applications for this particular center program, and that they be trained to understand the applications they review. Given the limited number of hours that each institute’s council meets each year, it is unrealistic to think the council could adequately assess each grant application in terms of public health relevance. Yet, this is of critical importance to American families facing Parkinson’s and other diseases. We recommend that the NIH peer review process be revamped to ensure an assessment of relevance to the agency mission, importance to the translational and critical research portfolio, and context of the application in terms of disease portfolio and gaps in knowledge. We recommend a three-tier model, whereby council continues to assess program priorities and gaps, and a separate body assesses the grant applications in terms of significance and relevance to public health, ensures there isn't redundancy in areas where validation is not needed at this time, and determines whether this is a good use of taxpayer dollars. The Department of Defense’s Congressionally Directed Medical Research Program (CDMRP) has a system we think works pretty well. The CDMRP panel determines annual priorities by identifying gaps in knowledge of disease, areas that will accelerate the field of research, among other criteria. The annual priorities are then incorporated into program announcements. After grant applications are received, the peer review panel evaluates the science, based on the program announcement’s established evaluation factors as well as the budget of each submission. The programmatic review panel evaluates the proposals on a comparison basis and identifies those with the greatest programmatic relevance, as well as disease relevance, innovation, and other factors. We also recommend that study sections be targeted by basic vs. translational and clinical research. There are different levels of supporting data, and that level of expertise is important for the study sections to understand. M. Carolina Hinestrosa, Executive Vice President, Planning and Programs, National Breast Cancer Coalition Advocates must be knowledgeable and confident in order to participate in the decision-making process of science and medicine. NBCC's Project Live is an intensive science course that teaches breast cancer advocacy, biostatistics, and epidemiology. Graduates also participate in continuing education workshops and in an online community. NBCC believes that to have a meaningful impact on the research process, advocates involved must be individuals who have been personally affected by breast cancer; in some cases, as in the peer review process, they must be individuals who have had the disease. Nevertheless, the peer review process has traditionally excluded those most affected by breast cancer research. Additionally, activists must be involved in the programmatic review process, determining priorities for funding and mechanisms. The Department of Defense Peer Review Breast Cancer Research Program has proven that this is an effective and valuable model of scientist/activist collaboration. NBCC has been closely involved throughout the evolution of this program. This collaboration of leading scientists and consumers has brought about a mindset that fosters the generation of new ideas and risk-taking in research and the training of scholars in nontraditional disciplines that could potentially offer new insights into this complex disease. Innovative funding mechanisms that fill important gaps in the landscape of breast cancer research – such as concept awards, idea awards, synergistic idea awards, and innovator awards – are examples of many products of this collaboration. Many of these mechanisms were followed a couple of years later by the NIH; these really filled a gap and have been replicated by other funding agencies. However, we find that the peer review panels often struggle when reviewing proposals and the new mechanisms, particularly in the assessment of unusual criteria such as innovation, synergy, multidisciplinary training, and even impact. While some criteria may be subjective, as is the case with innovation, it is disconcerting how risk-averse scientists can be. The opposite extreme is how often applications that establish techniques in a different cell line, for example, are considered innovative; or how just bringing a biostatistician to a clinical research project is considered multidisciplinary research by some. It is also disconcerting when we look at peer review scores and find that a low-scoring proposal, with more weaknesses than strengths listed, still gets a rating of excellent. When consumer advocates have been meaningfully involved in the research process, they have changed the culture of research and research programs for the better. This is corroborated by the Institute of Medicine's assessment of the DOD Breast Cancer Research Program, and it is a model we urge this panel to consider. Jane Holt, President, National Pancreas Foundation I have chronic pancreatitis, and I'm here as a patient to ask for your help. The National Pancreas Foundation addresses all diseases of the exocrine pancreas. Although our area of interest is specialized, it shares something important with a number of other research areas: It is vastly underfunded in proportion to the burden of the disease. We believe that with the proper level of funding, it is possible to adjust this imbalance to benefit patients and their families, clinicians, hospitals, and research labs. NIH selects its currently funded researchers for its study sections. These reviewers have already proven they can meet the rigorous standards of excellence NIH demands of its principal investigators. But this becomes a problem for diseases that are underfunded. The ranks from which NIH can choose its reviewers have become smaller. Fewer reviewers with this expertise makes for fewer funded applications, and it becomes a vicious cycle. There are excellent doctors out there that want to do this research and who also deserve consideration as reviewers on these study sections. Our organization will be more than happy to help funnel these doctors to NIH for consideration as reviewers. It's especially important for pancreatic applications to be submitted to reviewers with direct pancreatic expertise. Pancreatic science is notoriously difficult to do. When reviewers from other disciplines are asked to review pancreatic applications, they hold them to the same standards set for their own more sophisticated fields. As a result, the applications get graded down, which hurts our field tremendously. For our field, any advance, no matter how small or seemingly inelegant, can make a huge difference for our patients. The situation has other repercussions. Doctors who have dedicated their lives to unlocking the mysteries of pancreatic disease now find that in order to procure funding, they may need to alter their ideas in order to even be considered. The science suffers, but most of all, the field suffers. Young fellows see what is going on and turn away from the field, fearing for their future livelihood. The use of special-interest panels and RFAs helps, but it doesn't do enough. We propose that NIH establish new study sections devoted to specific underfunded areas of research. Fund the good science, advance it to the next level of sophistication, and then review whether the new study sections need to continue to exist. Susan W. Kayne, Director, Marketing and Communications, National Eating Disorders Association It is clear to our constituency that while the number of diagnosed cases of eating disorders is rising sharply, support for research to find a cure is sorely limited. We come here today to support the Academy of Eating Disorders' recommendations to improve the NIH peer review process. Our concern, like the Academy's, is that NIH reviews are often conducted by scientists who are not familiar with the state of the science in eating disorders. Review groups often have no one or only one person with eating disorder expertise, which places inordinate value on the opinion of non-experts or a single expert in the field. We suspect this occurs in other small fields as well. We support two Academy recommendations to improve the peer review process:
Sue Levi-Pearl, Vice President, Medical & Scientific Programs, Tourette Syndrome Association, Inc. For over 2 decades, the Tourette Syndrome Association has provided annual seed grants to individual investigators interested in both clinical and basic science projects relevant to Tourette syndrome (TS). From the outset, we have relied on funding recommendations from an expert, conflict-free, multidisciplinary scientific advisory board that uses the NIH peer review system as a model. Our objective has always been to support creative, methodologically sound studies. We have succeeded extraordinarily well, with almost all NIH-funded TS investigators having first received grant awards from our association. The following are our observations concerning experiences with NIH peer review procedures:
Compilation of Key Points Made during Open Discussion Sessions Applicant Feedback/Interaction
Applications
Criteria for Funding
Expertise
Funding
Grant Mechanisms
Junior Investigators
Miscellaneous
Reviewer Issues
Scoring/Ranking
Setting Priorities
Staffing Panels
Study Section and Review Models
Training
Closing Remarks Dr. Kington We want to thank all of you again for coming here today and giving us the opportunity to listen to you. I'm always impressed by the breadth of comments from advocacy groups who really care about what we do because you want us to be effective.
You served two roles here today: to deliver ideas, information, feedback, suggestions, and criticisms; and to listen to each other. As you go back into your constituent communities, I hope you will keep this lively debate going. I'm going to make a few comments on your comments. These are not meant as criticisms or invalidations, but just to point out how complex and multifaceted these issues are. Several words have continued to recur in the discussion today. One is the “third-tier” idea, with a particular eye toward being able to asses early on the relevance of a particular area of investigation, focus of an individual application, and so forth. We do have a wise body looking across the entire spectrum of the 80,000 grant applications that come in. Is there a group that would be able to oversee the whole show in a way in which you would be happy to entrust them? It may be a tough challenge. Secondly, relevance is very tough to assess. It is not intrinsic, nor does one know what will be relevant tomorrow to your disease area of research. The bold and innovative research that we're all reaching for depends on giving scientists the freedom the Surgeon General’s quote was about in 1945. Providing that freedom is what attracts the kind of people you want to have working in this field. So there is a certain fragility to this in terms of saying, "No, we'll tell you what to work on, and you just work on it." “Milestones” and “deliverables” were words that came up early on. These reflect efforts to try to make research more responsible to the initial goals – or, as one speaker said, to give study sections a little more assurance that they can go for a bold idea because there will be milestones in 5 years to show the money won't be wasted. But let me remind you that the other side to this coin is that it will make investigators more conservative in their grant applications. The need for separate reviews of basic and clinical research was voiced by many of you, and I totally understand that the expertise for your particular focus just isn't there. We all want that. The peer review system depends upon that kind of expertise in the room or established by some other mechanism, like the editorial board system described today. Having the right expertise in the room is critical. But keep in mind that an increasing number of grant applications will come forward because of this new breadth we're enjoying in the field, and these will include basic, clinical, and translational components. Should all grantees be required to serve on study sections? Should we give them a “free pass" of extended support on their current applications? Maybe. I think wanting to incentivize people to be involved is important. It's not quite clear to me that we want every single NIH grantee to be a study section member. We certainly want the best ones. And if we inserted incentives such as extending the grant period, wouldn't we suddenly have transformed the peer review system into an entitlement system? Is that the kind of incentive we're looking for? Separate study sections for new investigators is an interesting idea. NIH has been looking for ways to better recognize and support new investigators to help them traverse the difficult transition from post-doc. The average age for a new investigator establishing a laboratory at the Ph.D. level is now 38, and the average age of getting the first independent grant support is 42. For M.D.s, add 2 years onto each of those numbers. So all the numbers are scary, but the scariest to me is the 4 years between getting the job and getting the grant. During that time, a lot of corrosive things can happen to somebody's psyche. So how do we support new investigators? One idea is separate study sections, but the other side of the coin is that you want the right expertise in the room. If you just segregate out new investigators, they're going to cover all the research NIH does. Are we going to be able to have the right expertise in the room for those who might, arguably, need it the most? Are there ways to shorten grants, and what are the costs of doing that? It’s been said this could strongly disadvantage young/new investigators, but I'd need to be convinced of that. All of us in academia are used to making bets on young people. That's how we hire our new faculty and choose the post-docs that come to our laboratories. We don't make them write 25-page grant applications to do that. How have they climbed up the ladder to succeed to the point that they have carried out all their training and competed with hundreds of people to get their new jobs? They've done it by writing 2- and 3-page applications for fellowships and support, abstracts for going to meetings and giving presentations, and so forth. It's not completely clear to me that 25 pages is a minimum for a young person to be able to write applications. It would literally be the first time they will have done that. Some innovative suggestions include the idea of formalizing a risk-benefit ratio to assess the kind of difficult-to-grasp notion of higher risk or transformative research; the notion of some sort of qualifying or certifying requirement for reviewers to ensure they have the right expertise; and the third tier. Actually, two third-tier proposals were put forth: One would assess significance, relevance, and so forth; the other would send a letter of intent to all the experts in the field and see what they think before moving on to the formal study section review. Many good ideas were put forth today, and you should feel very good about the service you have done here on behalf of NIH and of the system as a whole. We are very grateful for your participation and want to thank you again for sharing your energy and ideas with us. This page was last reviewed on November 13, 2007.
|

