Abstract

Introduction

It is important to monitor the safety profile of drugs, and mining for strong associations between drugs and adverse events is an effective and inexpensive method of post-marketing safety surveillance.

Objective

The objective of our work was to compare the accuracy of both common and innovative methods of data mining for pharmacovigilance purposes.

Methods

We used the reference standard provided by the Observational Medical Outcomes Partnership, which contains 398 drug–adverse event pairs (165 positive controls, 233 negative controls). Ten methods and algorithms were applied to the US FDA Adverse Event Reporting System data to investigate the 398 pairs. The ten methods include popular methods in the pharmacovigilance literature, newly developed pharmacovigilance methods as at 2018, and popular methods in the genome-wide association study literature. We compared their performance using the receiver operating characteristic (ROC) plot, area under the curve (AUC), and Youden’s index.

Results

The Bayesian confidence propagation neural network had the highest AUC overall. Monte Carlo expectation maximization, a method developed in 2018, had the second highest AUC and the highest Youden’s index, and performed very well in terms of high specificity. The regression-adjusted gamma Poisson shrinkage model performed best under high-sensitivity requirements.

Conclusion

Our results will be useful to help choose a method for a given desired level of specificity. Methods popular in the genome-wide association study literature did not perform well because of the sparsity of data and will need modification before their properties can be used in the drug–adverse event association problem.

Notes

Compliance with Ethical Standards

Funding

This project was funded by the Florida Department of Health Ed and Ethel Moore Alzheimer’s Disease Research Program (Grant number 7AZ23), and the University of South Florida Proposal Enhancement Grant to Feng Cheng. These grants enabled all the FAERS data submissions to be combined and stored in a local database.

Conflicts of interest

Minh Pham, Feng Cheng, and Kandethody Ramachandran have no conflicts of interest that are directly relevant to the content of this study.