In May, the OpenTrialsFDA team (a collaboration between Erick Turner, Dr. Ben Goldacre and the OpenTrials team at Open Knowledge) was selected as a finalist for the Open Science Prize. Working towards a first prototype in early December, OpenTrialsFDA will make the Drug Approval Packages (DAPs) from the FDA website easily accessible and searchable and link these to documents and data related to clinical trials. Other interested parties will also be able to access, search and present this information through the application programming interfaces (APIs) the team will produce.
The Food and Drug Administration (FDA) publishes DAPs as part of the general information on drugs via its data portal known as [email protected]. These documents contain detailed information about the methods and results of clinical trials, and are unbiased, compared to reports of clinical trials in academic journals. This is because FDA reviewers require adherence to the outcomes and analytic methods prespecified in the original trial protocols, so, in contrast to most journal editors, they are unforgiving of practices such as post hoc switching of outcomes and changes to the planned statistical analyses. These review packages also often report on clinical trials that have never been published.
However, despite their high value, these FDA documents are notoriously difficult to access, aggregate, and search. The website itself is not easy to navigate, and much of the information is stored in PDFs or non-searchable image files for older drugs. As a consequence, they are rarely used by clinicians and researchers. OpenTrialsFDA will work on improving this situation, so that valuable information that is currently hidden away can be discovered, presented, and used to properly inform evidence-based treatment decisions.
The team has started to scrape the FDA website, extracting the relevant information from the PDFs through a process of OCR (optical character recognition). A new OpenTrialsFDA interface will be developed to explore and discover the FDA data. In addition, the information will be integrated into the OpenTrials database, so that for any trial for which a match exists, users can see the corresponding FDA data.
We will be sharing future progress through this blog as the work develops: the final prototype will be presented in early December at the Open Science Prize Showcase.
More information about the Open Science Prize: https://www.openscienceprize.org/res/p/finalists/