The growing challenges and opportunities of Big Data for Public Health have revealed the potential to improve the efficiency and cost-effectiveness of public policy, for example through better targeting of… Click to show full abstract
The growing challenges and opportunities of Big Data for Public Health have revealed the potential to improve the efficiency and cost-effectiveness of public policy, for example through better targeting of resources with regard to General Practice (GP) prescribing. Open data has an important role due to its easy access and potential to complement proprietary data sources from, e.g., regional hospitals, and also itself be complemented with social data acquired by specialized approaches. MIDAS pipeline of open source tools aiming integrating, analysing and visualising Open Data enabling health professionals and decision-makers to: (i) improve the usability of open data in combination with proprietary data through combining multiple visualisation tools in an integrated dashboard (ii) to explore the meaning of data in a global/local context based on new information using tone analysis and natural language techniques; and (iii) to have better informed decision-making based on evidence from trusted knowledge-bases. Specific data sources used have included information extracted from the biomedical database MEDLINE, worldwide news and government open data. Social media sources have also been used to gather information from the general public. Results include a strong correlation between antidepressant prescribing and economic deprivation, and a wide variation in how individual GP practices respond to demographic conditions. Automated anomaly detection based on the Local Outlier Probability has also been shown to be an easily understood and controllable approach to identifying prescribing outliers. MIDAS demonstrates the significant value of open data from heterogeneous sources as basis decision-making in public health and healthcare, particularly when it is combined with proprietary or closed datasets. A key challenge in this regard is the ability to integrate and utilize data from diverse sources in a variety of formats and standards. MIDAS is exemplar on tackling the need for improved standards of open data, and new software architectures, tools and platforms addressing a complex ecosystem of heterogenous data sources and formats. MIDAS demonstrates the significant value of open data from heterogeneous sources as basis decision-making in public health and healthcare, particularly when combined with proprietary datasets.
               
Click one of the above tabs to view related content.