Overview of Funding

We also looked at the total NIH funding for each year from 2013-2018. We see that funding increased until it leveled off in 2016 and there is a projected decrease for 2018.

final_NIH_funding %>%
  group_by(year) %>%
  summarize(total_funding = sum(funding, na.rm = TRUE)) %>%
  plot_ly(x = ~ year, y = ~ total_funding, mode = "lines", type = "scatter") %>% 
  layout(title = "Change in Total NIH Funding (millions USD) over Time")

Does Funding Correlate with Drug Prices?

Then, we looked to determine whether funding correlated with drug prices. This does not really seem to be the case from the scatterplot. Along the entire range of funding amounts, most drugs still have relatively low prices. It is interesting that different drugs that treat the same disease seem to be largely clustered together.

We can see one breast cancer drug, Afinitor, which actually can be used to treat various types of cancers including kidney, pancreas, breast, and brain cancer, has an unusually high price. We think these other applications are what are influencing the price. Another unusual drug is Belviq, used to treat obesity.

This drug has a very low price, which we think is because of the extremely high prevalence of obesity or other factors such as competitors.

#does funding correlate with drug prices?

final_dataset %>%
  group_by(disease, trade_name) %>%
  summarize(avg_funding = mean(funding, na.rm = TRUE), 
            avg_price_disease = mean(avg_price, na.rm = TRUE)) %>%
  plot_ly(x = ~ avg_funding, y = ~ avg_price_disease, type = "scatter", color = ~disease, text = ~paste('Drug: ', trade_name, '<br> Disease:', disease)) %>%
  layout(title = "Average Funding vs Average Price", showlegend = FALSE)

Is there a relationship between NIH Funding and Prevalence (demand for drug)?

We also created a scatter plot of the mean NIH funding per drug (assumed to be the same as the NIH funding for disease research) against the demand of the drug (defined as the 2015 disease prevalence). Obesity seems to have received the most funding on average and to have the highest 2015 prevalence. This may be due to Michelle Obama’s Let’s Move initiative to fight childhood obesity.

We looked to explore whether there was a relationship between prevalence and funding. While the number of diseases with the required complete data was relatively small, it appears from the scatterplot that when you have a low prevalence, there is a wide range of average funding. As the prevalence increases the funding amount generally seems to be on the lower side. However, if you look more closely at which diseases are included, some interesting observations can be made.

Many of the diseases with high funding are cancers, which have a low prevalence. Most cancers have large subgroups with no good treatment options, so it makes sense that there is a lot of research focused on cancer and new drug development. Some diseases with higher prevalence but lower funding are hypertension, migraines, ADD, and COPD, which do already have treatment options available on the market.

final_dataset %>% 
  filter(pricing_unit == "EA" & !is.na(funding) & !is.na(prevalence)) %>% 
  group_by(trade_name, disease, prevalence) %>% 
  summarize(mean_fund = mean(funding)) %>% 
  plot_ly(x = ~prevalence, y = ~mean_fund, type = "scatter", alpha = 1, text = ~paste("Drug: ", trade_name, '<br>Disease:', disease), color = ~disease) %>%
  layout(title = "Funding (millions USD) vs Prevalence", showlegend = FALSE)

Disease & Funding

  • Does funding differ by disease?

  • There are 27 unique diseases in our dataset. Obesity has the highest mean funding of $906.2 million a year. Breast cancer has the second highest mean funding of $675.98 million per year. Alzheimer’s Disease has the third highest mean funding of $646 million per year. Paget’s Disease, a disease that disrupts the replacement of old bone tissue with new bone tissue, received the lowest mean funding of $1 million per year. Other diseases that received the lowest mean funding are Psoriasis ($16.7 million/year) and Migraines ($19.2 million/year). Obesity, Paget’s Disease, Psoriasis and Migraines are acute conditions, while Breast cancer and Alzheimer’s Disease are chronic conditions.

# final_dataset = read.csv("./data/final_dataset.csv")
#shiny application: funding by disease
final_dataset %>%
  filter(!is.na(disease) & !is.na(funding)) %>%
  mutate(disease = factor(disease)) %>%
  group_by(disease) %>%
  summarize(mean_funding = mean(funding)) %>%
  mutate(disease = fct_reorder(disease, mean_funding)) %>%
  plot_ly(x = ~disease, y = ~mean_funding, color = ~disease, type = "bar", colors = "Set3") %>% 
  layout(title = "NIH Funding per Disease (millions USD)", xaxis = ax)