1
|
Yang K, Fan S, Deng J, Xia J, Hu X, Yu L, Wang B, Yu W. Public concerns analysis and early warning of Mpox based on network data platforms-taking Baidu and WeChat as example. Front Public Health 2025; 13:1523408. [PMID: 40109434 PMCID: PMC11919868 DOI: 10.3389/fpubh.2025.1523408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2024] [Accepted: 02/17/2025] [Indexed: 03/22/2025] Open
Abstract
With the outbreak of Mpox in non-endemic countries in May 2022, which has captured international attention. In response, this study leveraged the real-time, predictive, and wide coverage advantages of big data to reflect the public's needs and interests regarding the Mpox epidemic, and explore its potential early warning role. We carried out a systematic data search weekly on two major network data platforms-Baidu Search Index (BDI) and WeChat Search Index (WCI) in China, and the index data overview, main concern information, hotspot regional distribution were analyzed. Besides, the correlation between the search index and the number of new cases of Mpox globally and within China were also investigated. Our results showed that both BDI and WCI mirrored the trends of the Mpox epidemic, with peaks in interest aligning with the release of relevant policies and events. The public's interest evolved from basic knowledge of the disease to a focus on treatment and prevention, with attentiveness centrally placed in economically developed areas such as Guangdong, Beijing, and Shanghai. A positive correlation was observed between the Chinese epidemic and the BDI (r = 0.372, p = 0.047) and WCI (r = 0.398, p = 0.044), whereas non-correlation was noted globally. Notably, when the search time was delayed by 1 week, both BDI and WCI showed a positive correlation with the epidemic in China and globally. Overall, the integrated use of big data offers a platform for rapid understanding public concerns and early warning signs of emerging infectious diseases such as Mpox.
Collapse
Affiliation(s)
- Kai Yang
- Chengdu Workstation for Emerging Infectious Disease Control and Prevention, Chinese Academy of Medical Sciences, Chengdu Center for Disease Control and Prevention, Chengdu, China
| | - Shuangfeng Fan
- Chengdu Workstation for Emerging Infectious Disease Control and Prevention, Chinese Academy of Medical Sciences, Chengdu Center for Disease Control and Prevention, Chengdu, China
| | - Jiali Deng
- Department of Orthopaedics, The First Affiliated Hospital of Chengdu Medical College, Sichuan, China
| | - Jinjie Xia
- Chengdu Workstation for Emerging Infectious Disease Control and Prevention, Chinese Academy of Medical Sciences, Chengdu Center for Disease Control and Prevention, Chengdu, China
| | - Xiaoyuan Hu
- Emergency Response Office, Xinjiang Uighur Autonomous Region Center for Disease Control and Prevention, Urumqi, China
| | - Linlin Yu
- Chengdu Workstation for Emerging Infectious Disease Control and Prevention, Chinese Academy of Medical Sciences, Chengdu Center for Disease Control and Prevention, Chengdu, China
| | - Bin Wang
- Comprehensive Emergency Office, Center for Disease Control and Prevention of Qingbaijiang District, Chengdu, China
| | - Wei Yu
- Comprehensive Emergency Office, Center for Disease Control and Prevention of Qingbaijiang District, Chengdu, China
| |
Collapse
|
2
|
Deng JL, Yang K, Zhang S, Wang B, Zhang L, Zhao X. Discussion of the public interest in arthroscopy based on the Baidu index and its implications for nursing care. World J Orthop 2025; 16:101895. [PMID: 40027958 PMCID: PMC11866108 DOI: 10.5312/wjo.v16.i2.101895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/30/2024] [Revised: 01/06/2025] [Accepted: 01/21/2025] [Indexed: 02/12/2025] Open
Abstract
BACKGROUND Despite the widespread application of big data in topic analysis, the public's attention and nursing requirements for arthroscopy remain inadequate. AIM To understand netizens' concerns and spatial distributions regarding arthroscopy and to provide customized nursing strategies. METHODS The Baidu index was employed to gather and analyze the search index, demand graph, keyword popularity, and regional distribution data for the keywords "arthroscopy," "knee arthroscopy," and "arthroscopy surgery" from 2018 to 2023. RESULTS A total of 254692 items of information were searched for these keywords, with 59.86% from mobile terminals. Netizens' interest in arthroscopy showed a fluctuating pattern, which was consistent with fluctuations in the elasticity coefficient, and was primarily concentrated in the provinces of Guangdong, Jiangsu, and Shandong. CONCLUSION The Baidu index provides new avenues for exploring public demand for arthroscopy. Nursing personnel can utilize these data to develop more precise health education plans and guidance, enhancing the quality and satisfaction of patient care.
Collapse
Affiliation(s)
- Jia-Li Deng
- Department of Orthopaedics, The First Affiliated Hospital of Chengdu Medical College, Chengdu 610500, Sichuan Province, China
| | - Kai Yang
- Emergency and Business Management Office, Chengdu Center for Disease Control and Prevention, Chengdu 610041, Sichuan Province, China
| | - Shuai Zhang
- Department of Anesthesiology, The First Affiliated Hospital of Chengdu Medical College, Chengdu 610500, Sichuan Province, China
| | - Bin Wang
- Comprehensive Emergency Office, Qingbaijiang District Center for Disease Control and Prevention of Chengdu, Chengdu 610300, Sichuan Province, China
| | - Li Zhang
- Department of Orthopaedics, The First Affiliated Hospital of Chengdu Medical College, Chengdu 610500, Sichuan Province, China
| | - Xia Zhao
- Department of Orthopaedics, The First Affiliated Hospital of Chengdu Medical College, Chengdu 610500, Sichuan Province, China
| |
Collapse
|
3
|
Albrecht S, Broderick D, Dost K, Cheung I, Nghiem N, Wu M, Zhu J, Poonawala-Lohani N, Jamison S, Rasanathan D, Huang S, Trenholme A, Stanley A, Lawrence S, Marsh S, Castelino L, Paynter J, Turner N, McIntyre P, Riddle P, Grant C, Dobbie G, Wicker JS. Forecasting severe respiratory disease hospitalizations using machine learning algorithms. BMC Med Inform Decis Mak 2024; 24:293. [PMID: 39379946 PMCID: PMC11462891 DOI: 10.1186/s12911-024-02702-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2024] [Accepted: 09/30/2024] [Indexed: 10/10/2024] Open
Abstract
BACKGROUND Forecasting models predicting trends in hospitalization rates have the potential to inform hospital management during seasonal epidemics of respiratory diseases and the associated surges caused by acute hospital admissions. Hospital bed requirements for elective surgery could be better planned if it were possible to foresee upcoming peaks in severe respiratory illness admissions. Forecasting models can also guide the use of intervention strategies to decrease the spread of respiratory pathogens and thus prevent local health system overload. In this study, we explore the capability of forecasting models to predict the number of hospital admissions in Auckland, New Zealand, within a three-week time horizon. Furthermore, we evaluate probabilistic forecasts and the impact on model performance when integrating laboratory data describing the circulation of respiratory viruses. METHODS The dataset used for this exploration results from active hospital surveillance, in which the World Health Organization Severe Acute Respiratory Infection (SARI) case definition was consistently used. This research nurse-led surveillance has been implemented in two public hospitals in Auckland and provides a systematic laboratory testing of SARI patients for nine respiratory viruses, including influenza, respiratory syncytial virus, and rhinovirus. The forecasting strategies used comprise automatic machine learning, one of the most recent generative pre-trained transformers, and established artificial neural network algorithms capable of univariate and multivariate forecasting. RESULTS We found that machine learning models compute more accurate forecasts in comparison to naïve seasonal models. Furthermore, we analyzed the impact of reducing the temporal resolution of forecasts, which decreased the model error of point forecasts and made probabilistic forecasting more reliable. An additional analysis that used the laboratory data revealed strong season-to-season variations in the incidence of respiratory viruses and how this correlates with total hospitalization cases. These variations could explain why it was not possible to improve forecasts by integrating this data. CONCLUSIONS Active SARI surveillance and consistent data collection over time enable these data to be used to predict hospital bed utilization. These findings show the potential of machine learning as support for informing systems for proactive hospital management.
Collapse
Affiliation(s)
- Steffen Albrecht
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand.
| | - David Broderick
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand
| | - Katharina Dost
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand
| | - Isabella Cheung
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand
| | - Nhung Nghiem
- Australian National University, 131 Garran Rd, Acton, Canberra ACT, 2601, Australia
| | - Milton Wu
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand
| | - Johnny Zhu
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand
| | | | - Sarah Jamison
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand
| | | | - Sue Huang
- Institute of Environmental Science and Research, 34 Kenepuru Drive, Kenepuru, Porirua, 5022, New Zealand
| | - Adrian Trenholme
- Health New Zealand Counties Manukau, Middlemore Hospital, 100 Hospital Road, Auckland, 2025, New Zealand
| | - Alicia Stanley
- Health New Zealand Te Toka Tumai Auckland, Auckland City Hospital, 2 Park Road, Auckland, 1023, New Zealand
| | - Shirley Lawrence
- Health New Zealand Counties Manukau, Middlemore Hospital, 100 Hospital Road, Auckland, 2025, New Zealand
| | - Samantha Marsh
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand
| | | | - Janine Paynter
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand
| | - Nikki Turner
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand
| | - Peter McIntyre
- University of Otago, 362 Leith Street, Dunedin, 9016, New Zealand
| | - Patricia Riddle
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand
| | - Cameron Grant
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand.
| | - Gillian Dobbie
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand.
| | - Jörg Simon Wicker
- University of Auckland, 20 Symonds Street, Auckland, 1010, New Zealand.
| |
Collapse
|
4
|
Wang Y, Zhou H, Zheng L, Li M, Hu B. Using the Baidu index to predict trends in the incidence of tuberculosis in Jiangsu Province, China. Front Public Health 2023; 11:1203628. [PMID: 37533520 PMCID: PMC10390734 DOI: 10.3389/fpubh.2023.1203628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Accepted: 07/05/2023] [Indexed: 08/04/2023] Open
Abstract
Objective To analyze the time series in the correlation between search terms related to tuberculosis (TB) and actual incidence data in China. To screen out the "leading" terms and construct a timely and efficient TB prediction model that can predict the next wave of TB epidemic trend in advance. Methods Monthly incidence data of tuberculosis in Jiangsu Province, China, were collected from January 2011 to December 2020. A scoping approach was used to identify TB search terms around common TB terms, prevention, symptoms and treatment. Search terms for Jiangsu Province, China, from January 2011 to December 2020 were collected from the Baidu index database. Correlation coefficients between search terms and actual incidence were calculated using Python 3.6 software. The multiple linear regression model was constructed using SPSS 26.0 software, which also calculated the goodness of fit and prediction error of the model predictions. Results A total of 16 keywords with correlation coefficients greater than 0.6 were screened, of which 11 were the leading terms. The R2 of the prediction model was 0.67 and the MAPE was 10.23%. Conclusion The TB prediction model based on Baidu Index data was able to predict the next wave of TB epidemic trends and intensity 2 months in advance. This forecasting model is currently only available for Jiangsu Province.
Collapse
|
5
|
Wang H, Ye H, Liu L. Constructing big data prevention and control model for public health emergencies in China: A grounded theory study. Front Public Health 2023; 11:1112547. [PMID: 37006539 PMCID: PMC10060899 DOI: 10.3389/fpubh.2023.1112547] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 02/27/2023] [Indexed: 03/18/2023] Open
Abstract
Big data technology plays an important role in the prevention and control of public health emergencies such as the COVID-19 pandemic. Current studies on model construction, such as SIR infectious disease model, 4R crisis management model, etc., have put forward decision-making suggestions from different perspectives, which also provide a reference basis for the research in this paper. This paper conducts an exploratory study on the construction of a big data prevention and control model for public health emergencies by using the grounded theory, a qualitative research method, with literature, policies, and regulations as research samples, and makes a grounded analysis through three-level coding and saturation test. Main results are as follows: (1) The three elements of data layer, subject layer, and application layer play a prominent role in the digital prevention and control practice of epidemic in China and constitute the basic framework of the “DSA” model. (2) The “DSA” model integrates cross-industry, cross-region, and cross-domain epidemic data into one system framework, effectively solving the disadvantages of fragmentation caused by “information island”. (3) The “DSA” model analyzes the differences in information needs of different subjects during an outbreak and summarizes several collaborative approaches to promote resource sharing and cooperative governance. (4) The “DSA” model analyzes the specific application scenarios of big data technology in different stages of epidemic development, effectively responding to the disconnection between current technological development and realistic needs.
Collapse
Affiliation(s)
- Huiquan Wang
- School of Politics and Public Administration, China University of Political Science and Law, Beijing, China
| | - Hong Ye
- School of Foreign Studies, China University of Political Science and Law, Beijing, China
- *Correspondence: Hong Ye
| | - Lu Liu
- School of Engineering and Technology, China University of Geosciences, Beijing, China
| |
Collapse
|
6
|
Hammond A, Kim JJ, Sadler H, Vandemaele K. Influenza surveillance systems using traditional and alternative sources of data: A scoping review. Influenza Other Respir Viruses 2022; 16:965-974. [PMID: 36073312 PMCID: PMC9530542 DOI: 10.1111/irv.13037] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 08/03/2022] [Accepted: 08/04/2022] [Indexed: 11/28/2022] Open
Abstract
OBJECTIVE While the World Health Organization's recommendation of syndromic sentinel surveillance for influenza is an efficient method to collect high-quality data, limitations exist. Aligned with the Research Recommendation 1.1.2 of the WHO Public Health Research Agenda for Influenza-to identify reliable complementary influenza surveillance systems which provide real-time estimates of influenza activity-we performed a scoping review to map the extent and nature of published literature on the use of non-traditional sources of syndromic surveillance data for influenza. METHODS We searched three electronic databases (PubMed, Web of Science, and Scopus) for articles in English, French, and Spanish, published between January 1 2007 and January 28 2022. Studies were included if they directly compared at least one non-traditional with a traditional influenza surveillance system in terms of correlation in activity or timeliness. FINDINGS We retrieved 823 articles of which 57 were included for analysis. Fifteen articles considered electronic health records (EHR), 11 participatory surveillance, 10 online searches and webpage traffic, seven Twitter, five absenteeism, four telephone health lines, three medication sales, two media reporting, and five looked at other miscellaneous sources of data. Several articles considered more than one non-traditional surveillance method. CONCLUSION We identified eight categories and a miscellaneous group of non-traditional influenza surveillance systems with varying levels of evidence on timeliness and correlation to traditional surveillance systems. Analyses of EHR and participatory surveillance systems appeared to have the most agreement on timeliness and correlation to traditional systems. Studies suggested non-traditional surveillance systems as complements rather than replacements to traditional systems.
Collapse
Affiliation(s)
- Aspen Hammond
- Global Influenza Programme, World Health OrganizationGenevaSwitzerland
| | - John J. Kim
- Global Influenza Programme, World Health OrganizationGenevaSwitzerland
- School of PharmacyUniversity of WaterlooKitchenerOntarioCanada
| | - Holly Sadler
- Global Influenza Programme, World Health OrganizationGenevaSwitzerland
| | | |
Collapse
|
7
|
Uda K, Hagiya H, Yorifuji T, Koyama T, Tsuge M, Yashiro M, Tsukahara H. Correlation between national surveillance and search engine query data on respiratory syncytial virus infections in Japan. BMC Public Health 2022; 22:1517. [PMID: 35945532 PMCID: PMC9363139 DOI: 10.1186/s12889-022-13899-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2022] [Accepted: 07/25/2022] [Indexed: 11/10/2022] Open
Abstract
Background The respiratory syncytial virus (RSV) disease burden is significant, especially in infants and children with an underlying disease. Prophylaxis with palivizumab is recommended for these high-risk groups. Early recognition of a RSV epidemic is important for timely administration of palivizumab. We herein aimed to assess the correlation between national surveillance and Google Trends data pertaining to RSV infections in Japan. Methods The present, retrospective survey was performed between January 1, 2018 and November 14, 2021 and evaluated the correlation between national surveillance data and Google Trends data. Joinpoint regression was used to identify the points at which changes in trends occurred. Results A strong correlation was observed every study year (2018 [r = 0.87, p < 0.01], 2019 [r = 0.83, p < 0.01], 2020 [r = 0.83, p < 0.01], and 2021 [r = 0.96, p < 0.01]). The change-points in the Google Trends data indicating the start of the RSV epidemic were observed earlier than by sentinel surveillance in 2018 and 2021 and simultaneously with sentinel surveillance in 2019. No epidemic surge was observed in either the Google Trends or the surveillance data from 2020. Conclusions Our data suggested that Google Trends has the potential to enable the early identification of RSV epidemics. In countries without a national surveillance system, Google Trends may serve as an alternative early warning system. Supplementary Information The online version contains supplementary material available at 10.1186/s12889-022-13899-y.
Collapse
Affiliation(s)
- Kazuhiro Uda
- Department of Pediatrics, Okayama University Graduate School of Medicine, Dentistry, and Pharmaceutical Sciences, 2-5-1 Shikata, Okayama, 700-8558, Japan. .,Department of Pediatrics, Okayama University Hospital, 2-5-1 Shikata, Okayama, 700-8558, Japan.
| | - Hideharu Hagiya
- Department of General Medicine, Okayama University Graduate School of Medicine, Dentistry, and Pharmaceutical Science, 2-5-1 Shikata, Okayama, 700-8558, Japan
| | - Takashi Yorifuji
- Department of Epidemiology, Graduate School of Medicine, Dentistry, and Pharmaceutical Sciences, Okayama University, 2-5-1 Shikata, Okayama, 700-8558, Japan
| | - Toshihiro Koyama
- Department of Health Data Science, Graduate School of Medicine, Dentistry, and Pharmaceutical Sciences, Okayama University, 2-5-1 Shikata, Okayama, 700-8558, Japan
| | - Mitsuru Tsuge
- Department of Pediatrics Acute Diseases, Okayama University Academic Field of Medicine, Dentistry, and Pharmaceutical Science, 2-5-1 Shikata, Okayama, 700-8558, Japan
| | - Masato Yashiro
- Department of Pediatrics, Okayama University Hospital, 2-5-1 Shikata, Okayama, 700-8558, Japan
| | - Hirokazu Tsukahara
- Department of Pediatrics, Okayama University Graduate School of Medicine, Dentistry, and Pharmaceutical Sciences, 2-5-1 Shikata, Okayama, 700-8558, Japan
| |
Collapse
|
8
|
Katayama Y, Kiyohara K, Hirose T, Ishida K, Tachino J, Nakao S, Noda T, Ojima M, Kiguchi T, Matsuyama T, Kitamura T. An Association of Influenza Epidemics in Children With Mobile App Data: Population-Based Observational Study in Osaka, Japan. JMIR Form Res 2022; 6:e31131. [PMID: 35142628 PMCID: PMC8874815 DOI: 10.2196/31131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 08/28/2021] [Accepted: 12/22/2021] [Indexed: 11/30/2022] Open
Abstract
Background Early surveillance to prevent the spread of influenza is a major public health concern. If there is an association of influenza epidemics with mobile app data, it may be possible to forecast influenza earlier and more easily. Objective We aimed to assess the relationship between seasonal influenza and the frequency of mobile app use among children in Osaka Prefecture, Japan. Methods This was a retrospective observational study that was performed over a three-year period from January 2017 to December 2019. Using a linear regression model, we calculated the R2 value of the regression model to evaluate the relationship between the number of “fever” events selected in the mobile app and the number of influenza patients ≤14 years of age. We conducted three-fold cross-validation using data from two years as the training data set and the data of the remaining year as the test data set to evaluate the validity of the regression model. And we calculated Spearman correlation coefficients between the calculated number of influenza patients estimated using the regression model and the number of influenza patients, limited to the period from December to April when influenza is prevalent in Japan. Results We included 29,392 mobile app users. The R2 value for the linear regression model was 0.944, and the adjusted R2 value was 0.915. The mean Spearman correlation coefficient for the three regression models was 0.804. During the influenza season (December–April), the Spearman correlation coefficient between the number of influenza patients and the calculated number estimated using the linear regression model was 0.946 (P<.001). Conclusions In this study, the number of times that mobile apps were used was positively associated with the number of influenza patients. In particular, there was a good association of the number of influenza patients with the number of “fever” events selected in the mobile app during the influenza epidemic season.
Collapse
Affiliation(s)
- Yusuke Katayama
- Department of Traumatology and Acute Critical Medicine, Osaka University Graduate School of Medicine, Suita, Japan
| | - Kosuke Kiyohara
- Department of Food Science, Faculty of Home Economics, Otsuma Women's University, Tokyo, Japan
| | - Tomoya Hirose
- Department of Traumatology and Acute Critical Medicine, Osaka University Graduate School of Medicine, Suita, Japan
| | - Kenichiro Ishida
- Department of Acute Medicine and Critical Care Center, Osaka National Hospital, National Hospital Organization, Osaka, Japan
| | - Jotaro Tachino
- Department of Traumatology and Acute Critical Medicine, Osaka University Graduate School of Medicine, Suita, Japan
| | - Shunichiro Nakao
- Department of Traumatology and Acute Critical Medicine, Osaka University Graduate School of Medicine, Suita, Japan
| | - Tomohiro Noda
- Department of Traumatology and Critical Care Medicine, Osaka City University Graduate School of Medicine, Osaka, Japan
| | - Masahiro Ojima
- Department of Acute Medicine and Critical Care Center, Osaka National Hospital, National Hospital Organization, Osaka, Japan
| | - Takeyuki Kiguchi
- Division of Trauma and Surgical Critical Care, Osaka General Medical Center, Osaka, Japan
| | - Tasuku Matsuyama
- Department of Emergency Medicine, Kyoto Prefectural University of Medicine, Kyoto, Japan
| | - Tetsuhisa Kitamura
- Division of Environmental Medicine and Population Sciences, Department of Social and Environmental Medicine, Osaka University Graduate School of Medicine, Suita, Japan
| |
Collapse
|
9
|
Jia S, She W, Pi Z, Niu B, Zhang J, Lin X, Xu M, She W, Liao J. Effectiveness of cascading time series models based on meteorological factors in improving health risk prediction. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:9944-9956. [PMID: 34510340 DOI: 10.1007/s11356-021-16372-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 09/02/2021] [Indexed: 06/13/2023]
Abstract
Meteorological factors, which are periodic and regular in a long run, have an unignorable impact on human health. Accurate health risk prediction based on meteorological factors is essential for optimal allocation of resource in healthcare units. However, due to the non-stationary and non-linear nature of the original hospitalization sequence, traditional methods are less robust in predicting it. This study aims to investigate hospital admission prediction models using time series pre-processing algorithms and deep learning approach based on meteorological factors. Using the electronic medical record data from Panyu Central Hospital and meteorological data of Panyu district from 2003 to 2019, 46,089 eligible patients with lower respiratory tract infections (LRTIs) and four meteorological factors were identified to build and evaluate the prediction models. A novel hybrid model, Cascade GAM-CEEMDAN-LSTM Model (CGCLM), was established in combination with generalized additive model (GAM), complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), and long-short term memory (LSTM) networks for predicting daily admissions of patients with LRTIs. The experimental results show that CGCLM multistep method proposed in this paper outperforms single LSTM model in the prediction of health risk time series at different time window sizes. Moreover, our results also indicate that CGCLM has the best prediction performance when the time window is set to 61 days (RMSE = 1.12, MAE = 0.87, R2 = 0.93). Adequate extraction of exposure-response relationships between meteorological factors and diseases and suitable handling of sequence pre-processing have an important role in time series prediction. This hybrid climate-based model for predicting LRTIs disease can also be extended to time series prediction of other epidemic disease.
Collapse
Affiliation(s)
- Shuopeng Jia
- School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, #639 Longmian Avenue, Jiangning District, 211198, Nanjing, Jiangsu Province, China
| | - Weibin She
- Medical Affairs, Science and Education Department, Foshan Fosun Chancheng Hospital, #3 Sanyou South Road, Chancheng District, Foshan, Guangdong Province, 52800, China
| | - Zhipeng Pi
- School of Pharmacy, China Pharmaceutical University, #639 Longmian Avenue, Jiangning District, Nanjing, Jiangsu Province, 211198, China
| | - Buying Niu
- School of Science, China Pharmaceutical University, #639 Longmian Avenue, Jiangning District, Nanjing, Jiangsu Province, 211198, China
| | - Jinhua Zhang
- Meteorological Bureau of Panyu District, #5 Landscape Avenue, Hengjiang village, Shatou Street, Panyu District, 511400, Guangzhou, Guangdong Province, China
| | - Xihan Lin
- School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, #639 Longmian Avenue, Jiangning District, 211198, Nanjing, Jiangsu Province, China
| | - Mingjun Xu
- School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, #639 Longmian Avenue, Jiangning District, 211198, Nanjing, Jiangsu Province, China
| | - Weiya She
- Meteorological Bureau of Panyu District, #5 Landscape Avenue, Hengjiang village, Shatou Street, Panyu District, 511400, Guangzhou, Guangdong Province, China
| | - Jun Liao
- School of Science, China Pharmaceutical University, #639 Longmian Avenue, Jiangning District, Nanjing, Jiangsu Province, 211198, China.
| |
Collapse
|
10
|
Lee K, Ray J, Safta C. The predictive skill of convolutional neural networks models for disease forecasting. PLoS One 2021; 16:e0254319. [PMID: 34242349 PMCID: PMC8270135 DOI: 10.1371/journal.pone.0254319] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 06/24/2021] [Indexed: 11/18/2022] Open
Abstract
In this paper we investigate the utility of one-dimensional convolutional neural network (CNN) models in epidemiological forecasting. Deep learning models, in particular variants of recurrent neural networks (RNNs) have been studied for ILI (Influenza-Like Illness) forecasting, and have achieved a higher forecasting skill compared to conventional models such as ARIMA. In this study, we adapt two neural networks that employ one-dimensional temporal convolutional layers as a primary building block-temporal convolutional networks and simple neural attentive meta-learners-for epidemiological forecasting. We then test them with influenza data from the US collected over 2010-2019. We find that epidemiological forecasting with CNNs is feasible, and their forecasting skill is comparable to, and at times, superior to, plain RNNs. Thus CNNs and RNNs bring the power of nonlinear transformations to purely data-driven epidemiological models, a capability that heretofore has been limited to more elaborate mechanistic/compartmental disease models.
Collapse
Affiliation(s)
- Kookjin Lee
- Computing, Informatics and Decision Systems Engineering, Arizona State University, Tempe, AZ, United States of America
- Extreme-Scale Data Science and Analytics, Sandia National Laboratories, Livermore, CA, United States of America
| | - Jaideep Ray
- Extreme-Scale Data Science and Analytics, Sandia National Laboratories, Livermore, CA, United States of America
| | - Cosmin Safta
- Quantitative Modeling and Analysis, Sandia National Laboratories, Livermore, CA, United States of America
| |
Collapse
|
11
|
Abstract
Influenza forecasting in the United States (US) is complex and challenging due to spatial and temporal variability, nested geographic scales of interest, and heterogeneous surveillance participation. Here we present Dante, a multiscale influenza forecasting model that learns rather than prescribes spatial, temporal, and surveillance data structure and generates coherent forecasts across state, regional, and national scales. We retrospectively compare Dante's short-term and seasonal forecasts for previous flu seasons to the Dynamic Bayesian Model (DBM), a leading competitor. Dante outperformed DBM for nearly all spatial units, flu seasons, geographic scales, and forecasting targets. Dante's sharper and more accurate forecasts also suggest greater public health utility. Dante placed 1st in the Centers for Disease Control and Prevention's prospective 2018/19 FluSight challenge in both the national and regional competition and the state competition. The methodology underpinning Dante can be used in other seasonal disease forecasting contexts having nested geographic scales of interest.
Collapse
Affiliation(s)
- Dave Osthus
- Los Alamos National Laboratory, Statistical Sciences Group, Los Alamos, NM, USA.
| | - Kelly R Moran
- Los Alamos National Laboratory, Statistical Sciences Group, Los Alamos, NM, USA.,Department of Statistical Science, Duke University, Durham, NC, USA
| |
Collapse
|
12
|
Improving influenza surveillance based on multi-granularity deep spatiotemporal neural network. Comput Biol Med 2021; 134:104482. [PMID: 34051452 DOI: 10.1016/j.compbiomed.2021.104482] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Revised: 04/16/2021] [Accepted: 05/06/2021] [Indexed: 11/23/2022]
Abstract
Influenza is a common respiratory disease that can cause human illness and death. Timely and accurate prediction of disease risk is of great importance for public health management and prevention. The influenza data belong to typical spatiotemporal data in that influenza transmission is influenced by regional and temporal interactions. Many existing methods only use the historical time series information for prediction, which ignores the effect of spatial correlations of neighboring regions and temporal correlations of different time periods. Mining spatiotemporal information for risk prediction is a significant and challenging issue. In this paper, we propose a new end-to-end spatiotemporal deep neural network structure for influenza risk prediction. The proposed model mainly consists of two parts. The first stage is the spatiotemporal feature extraction stage where two-stream convolutional and recurrent neural networks are constructed to extract the different regions and time granularity information. Then, a dynamically parametric-based fusion method is adopted to integrate the two-stream features and making predictions. In our work, we demonstrate that our method, tested on two influenza-like illness (ILI) datasets (US-HHS and SZ-HIC), achieved the best performance across all evaluation metrics. The results imply that our method has outstanding performance for spatiotemporal feature extraction and enables accurate predictions compared to other well-known influenza forecasting models.
Collapse
|
13
|
Use Internet search data to accurately track state level influenza epidemics. Sci Rep 2021; 11:4023. [PMID: 33597556 PMCID: PMC7889878 DOI: 10.1038/s41598-021-83084-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Accepted: 01/28/2021] [Indexed: 11/22/2022] Open
Abstract
For epidemics control and prevention, timely insights of potential hot spots are invaluable. Alternative to traditional epidemic surveillance, which often lags behind real time by weeks, big data from the Internet provide important information of the current epidemic trends. Here we present a methodology, ARGOX (Augmented Regression with GOogle data CROSS space), for accurate real-time tracking of state-level influenza epidemics in the United States. ARGOX combines Internet search data at the national, regional and state levels with traditional influenza surveillance data from the Centers for Disease Control and Prevention, and accounts for both the spatial correlation structure of state-level influenza activities and the evolution of people’s Internet search pattern. ARGOX achieves on average 28% error reduction over the best alternative for real-time state-level influenza estimation for 2014 to 2020. ARGOX is robust and reliable and can be potentially applied to track county- and city-level influenza activity and other infectious diseases.
Collapse
|
14
|
Shan S, Yan Q, Wei Y. Infectious or Recovered? Optimizing the Infectious Disease Detection Process for Epidemic Control and Prevention Based on Social Media. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:E6853. [PMID: 32961734 PMCID: PMC7559250 DOI: 10.3390/ijerph17186853] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Revised: 09/11/2020] [Accepted: 09/16/2020] [Indexed: 01/02/2023]
Abstract
Detecting the period of a disease is of great importance to building information management capacity in disease control and prevention. This paper aims to optimize the disease surveillance process by further identifying the infectious or recovered period of flu cases through social media. Specifically, this paper explores the potential of using public sentiment to detect flu periods at word level. At text level, we constructed a deep learning method to classify the flu period and improve the classification result with sentiment polarity. Three important findings are revealed. Firstly, bloggers in different periods express significantly different sentiments. Blogger sentiments in the recovered period are more positive than in the infectious period when measured by the interclass distance. Secondly, the optimized disease detection process can substantially improve the classification accuracy of flu periods from 0.876 to 0.926. Thirdly, our experimental results confirm that sentiment classification plays a crucial role in accuracy improvement. Precise identification of disease periods enhances the channels for the disease surveillance processes. Therefore, a disease outbreak can be predicted credibly when a larger population is monitored. The research method proposed in our work also provides decision making reference for proactive and effective epidemic control and prevention in real time.
Collapse
Affiliation(s)
- Siqing Shan
- School of Economics and Management, Beihang University, Beijing 100191, China; (S.S.); (Y.W.)
- Beijing Key Laboratory of Emergency Support Simulation Technologies for City Operation, Beijing 100191, China
| | - Qi Yan
- School of Economics and Management, Beihang University, Beijing 100191, China; (S.S.); (Y.W.)
- Beijing Key Laboratory of Emergency Support Simulation Technologies for City Operation, Beijing 100191, China
| | - Yigang Wei
- School of Economics and Management, Beihang University, Beijing 100191, China; (S.S.); (Y.W.)
- Beijing Key Laboratory of Emergency Support Simulation Technologies for City Operation, Beijing 100191, China
| |
Collapse
|
15
|
Chen SH, Tzeng IS, Lan CC, Chen JY, Ng CY, Wang YC, Su WL, Yiang GT, Chen TY, Wu CW, Hsieh PC, Kuo CY, Wu MY. Age, Period and Cohort Analysis of Rates of Emergency Department Visits Due to Pneumonia in Taiwan, 1998-2012. Risk Manag Healthc Policy 2020; 13:1459-1466. [PMID: 32943963 PMCID: PMC7481296 DOI: 10.2147/rmhp.s255031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2020] [Accepted: 07/25/2020] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND Emergency room (ER) physicians need to face clinically suspected pneumonia patients in the front line of medical care and must do to give major medical interventions if patients show severity in pneumonia. METHODS The data of pneumonia-related ER visit rates were categorized based on the International Classification of Disease (ICD) Codes (480-486) between 1998 and 2012. We use an age-period-cohort (APC) model to separate the pneumonia-related ER visit rates to identify the effects of age, time period, and cohort for a total of 1,813,588 patients. RESULTS The age effect showed high risk for pediatric and elder populations. There is a significant increasing period effect, which increased from 1998 to 2012. The cohort effect tended to show an oscillation from 1913 to 1988 and the reverse in a recent cohort. Furthermore, the visit rate of pneumonia showed an increase from 1998 to 2012 for both genders. CONCLUSION Age is a risk factor for pneumonia-related ER visits, especially for children and adolescents and older patients. Period and cohort effects were also found to increase the pneumonia visit rates. An APC model used to provide an advance clue for trend of pneumonia-related ER visit rates diversified.
Collapse
Affiliation(s)
- Shin-Hong Chen
- Department of Education and Research, Taiwan Adventist Hospital, Taipei10556, Taiwan
| | - I-Shiang Tzeng
- Department of Statistics, National Taipei University, Taipei10478, Taiwan
- Department of Applied Mathematics; Department of Exercise and Health Promotion, Chinese Culture University, Taipei11114, Taiwan
- Department of Research, Taipei Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, New Taipei City23142, Taiwan
| | - Chou-Chin Lan
- Division of Pulmonary Medicine, Taipei Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, New Taipei City23142, Taiwan
- School of Medicine, Tzu-Chi University, Hualien97004, Taiwan
| | - Jau-Yuan Chen
- Department of Family Medicine, Chang-Gung Memorial Hospital at Linkou, Taoyuan33305, Taiwan
| | - Chau Yee Ng
- Department of Dermatology, Drug Hypersensitivity Clinical and Research Center, Chang Gung Memorial Hospital, Taipei, Linkou and Keelung10507, Taiwan
| | - Yao-Chin Wang
- Department of Emergency Medicine, Min-Sheng General Hospital, Taoyuan33044, Taiwan
| | - Wen-Lin Su
- Division of Pulmonary Medicine, Taipei Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, New Taipei City23142, Taiwan
| | - Giou-Teng Yiang
- Department of Emergency Medicine, Taipei Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, New Taipei City23142, Taiwan
| | - Tsu-Yi Chen
- Department of Emergency Medicine, Taipei Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, New Taipei City23142, Taiwan
| | - Chih-Wei Wu
- Division of Pulmonary Medicine, Taipei Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, New Taipei City23142, Taiwan
| | - Po-Chun Hsieh
- Department of Chinese Medicine, Taipei Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, New Taipei City23142, Taiwan
| | - Chan-Yen Kuo
- Department of Research, Taipei Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, New Taipei City23142, Taiwan
| | - Meng-Yu Wu
- Department of Emergency Medicine, Taipei Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, New Taipei City23142, Taiwan
| |
Collapse
|
16
|
The relationship between seasonal influenza and telephone triage for fever: A population-based study in Osaka, Japan. PLoS One 2020; 15:e0236560. [PMID: 32760164 PMCID: PMC7410252 DOI: 10.1371/journal.pone.0236560] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Accepted: 07/08/2020] [Indexed: 11/19/2022] Open
Abstract
Background Replacing traditional surveillance with syndromic surveillance is one of the major interests in public health. However, it is unclear whether the number of influenza patients is associated with the number of telephone triages in Japan. Methods This retrospective, observational study was conducted over the six-year period between January 2012 to December 2017. We used the dataset of a telephone triage service in Osaka, Japan and the data on influenza patients published from the Information Center of Infectious Disease in Osaka prefecture. Using a linear regression model, we calculated Spearman’s rank-order coefficient and R2 of the regression model to assess the relationship between the number of telephone triages for fever and the number of influenza patients in Osaka. Furthermore, we calculated Spearman’s rank-order coefficient and R2 between the predicted weekly number of influenza patients from the linear regression model and the actual weekly number of influenza patients for influenza outbreak season (December-April). Results There were 465,971 patients with influenza, and the number of telephone triages for fever was 420,928 among 1,065,628 total telephone triages during the study period. Our analysis showed that the Spearman rank-order coefficient was 0.932, and R2 and adjusted R2 were 0.869 and 0.842, respectively. The Spearman rank-order coefficient was 0.923 (P<0.001) and R2 was 0.832 in December-April (P<0.001). Conclusion We revealed a positive relationship in this population between the number of influenza patients and the number of telephone triages for fever.
Collapse
|
17
|
Huang R, Luo G, Duan Q, Zhang L, Zhang Q, Tang W, Smith MK, Li J, Zou H. Using Baidu search index to monitor and predict newly diagnosed cases of HIV/AIDS, syphilis and gonorrhea in China: estimates from a vector autoregressive (VAR) model. BMJ Open 2020; 10:e036098. [PMID: 32209633 PMCID: PMC7202716 DOI: 10.1136/bmjopen-2019-036098] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
OBJECTIVES Internet search engine data have been widely used to monitor and predict infectious diseases. Existing studies have found correlations between search data and HIV/AIDS epidemics. We aimed to extend the literature through exploring the feasibility of using search data to monitor and predict the number of newly diagnosed cases of HIV/AIDS, syphilis and gonorrhoea in China. METHODS This paper used vector autoregressive model to combine the number of newly diagnosed cases with Baidu search index to predict monthly newly diagnosed cases of HIV/AIDS, syphilis and gonorrhoea in China. The procedures included: (1) keywords selection and filtering; (2) construction of composite search index; (3) modelling with training data from January 2011 to October 2016 and calculating the prediction performance with validation data from November 2016 to October 2017. RESULTS The analysis showed that there was a close correlation between the monthly number of newly diagnosed cases and the composite search index (the Spearman's rank correlation coefficients were 0.777 for HIV/AIDS, 0.590 for syphilis and 0.633 for gonorrhoea, p<0.05 for all). The R2 were all more than 85% and the mean absolute percentage errors were less than 11%, showing the good fitting effect and prediction performance of vector autoregressive model in this field. CONCLUSIONS Our study indicated the potential feasibility of using Baidu search data to monitor and predict the number of newly diagnosed cases of HIV/AIDS, syphilis and gonorrhoea in China.
Collapse
Affiliation(s)
- Ruonan Huang
- School of Public Health, Sun Yat-Sen University, Guangzhou, China
| | - Ganfeng Luo
- School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, China
| | - Qibin Duan
- The Kirby Institute, University of New South Wales, Sydney, New South Wales, Australia
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Queensland, Australia
| | - Lei Zhang
- China-Australia Joint Research Center for Infectious Diseass, School of Public Health, Xi'an Jiaotong University, Xi'an, China
- Melbourne Sexual Health Centre, Alfred Health, Melbourne, Victoria, Australia
- Central Clinical School, Faculty of Medicine Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia
- Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Qingpeng Zhang
- School of Data Science, City University of Hong Kong, Kowloon, Hong Kong
| | - Weiming Tang
- University of North Carolina Project China, Guangzhou, China
- Southern Medical University Dermatology Hospital, Guangzhou, China
| | - M Kumi Smith
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota Twin Cities, Minneapolis, Minnesota, USA
| | - Jinghua Li
- School of Public Health, Sun Yat-Sen University, Guangzhou, China
- Sun Yat-sen Global Health Institute, Sun Yat-Sen University, Guangzhou, China
| | - Huachun Zou
- School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, China
- The Kirby Institute, University of New South Wales, Sydney, New South Wales, Australia
| |
Collapse
|
18
|
Barros JM, Duggan J, Rebholz-Schuhmann D. The Application of Internet-Based Sources for Public Health Surveillance (Infoveillance): Systematic Review. J Med Internet Res 2020; 22:e13680. [PMID: 32167477 PMCID: PMC7101503 DOI: 10.2196/13680] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2019] [Revised: 09/18/2019] [Accepted: 11/26/2019] [Indexed: 12/30/2022] Open
Abstract
Background Public health surveillance is based on the continuous and systematic collection, analysis, and interpretation of data. This informs the development of early warning systems to monitor epidemics and documents the impact of intervention measures. The introduction of digital data sources, and specifically sources available on the internet, has impacted the field of public health surveillance. New opportunities enabled by the underlying availability and scale of internet-based sources (IBSs) have paved the way for novel approaches for disease surveillance, exploration of health communities, and the study of epidemic dynamics. This field and approach is also known as infodemiology or infoveillance. Objective This review aimed to assess research findings regarding the application of IBSs for public health surveillance (infodemiology or infoveillance). To achieve this, we have presented a comprehensive systematic literature review with a focus on these sources and their limitations, the diseases targeted, and commonly applied methods. Methods A systematic literature review was conducted targeting publications between 2012 and 2018 that leveraged IBSs for public health surveillance, outbreak forecasting, disease characterization, diagnosis prediction, content analysis, and health-topic identification. The search results were filtered according to previously defined inclusion and exclusion criteria. Results Spanning a total of 162 publications, we determined infectious diseases to be the preferred case study (108/162, 66.7%). Of the eight categories of IBSs (search queries, social media, news, discussion forums, websites, web encyclopedia, and online obituaries), search queries and social media were applied in 95.1% (154/162) of the reviewed publications. We also identified limitations in representativeness and biased user age groups, as well as high susceptibility to media events by search queries, social media, and web encyclopedias. Conclusions IBSs are a valuable proxy to study illnesses affecting the general population; however, it is important to characterize which diseases are best suited for the available sources; the literature shows that the level of engagement among online platforms can be a potential indicator. There is a necessity to understand the population’s online behavior; in addition, the exploration of health information dissemination and its content is significantly unexplored. With this information, we can understand how the population communicates about illnesses online and, in the process, benefit public health.
Collapse
Affiliation(s)
- Joana M Barros
- Insight Centre for Data Analytics, National University of Ireland Galway, Galway, Ireland.,School of Computer Science, National University of Ireland Galway, Galway, Ireland
| | - Jim Duggan
- School of Computer Science, National University of Ireland Galway, Galway, Ireland
| | | |
Collapse
|
19
|
Galetsi P, Katsaliaki K. Big data analytics in health: an overview and bibliometric study of research activity. Health Info Libr J 2019; 37:5-25. [DOI: 10.1111/hir.12286] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2018] [Accepted: 10/23/2019] [Indexed: 12/16/2022]
Affiliation(s)
- Panagiota Galetsi
- School of Economics, Business Administration & Legal Studies International Hellenic University Thessaloniki Greece
| | - Korina Katsaliaki
- School of Economics, Business Administration & Legal Studies International Hellenic University Thessaloniki Greece
| |
Collapse
|
20
|
Tideman S, Santillana M, Bickel J, Reis B. Internet search query data improve forecasts of daily emergency department volume. J Am Med Inform Assoc 2019; 26:1574-1583. [PMID: 31730701 PMCID: PMC7647136 DOI: 10.1093/jamia/ocz154] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Revised: 07/25/2019] [Accepted: 08/06/2019] [Indexed: 11/15/2022] Open
Abstract
OBJECTIVE Emergency departments (EDs) are increasingly overcrowded. Forecasting patient visit volume is challenging. Reliable and accurate forecasting strategies may help improve resource allocation and mitigate the effects of overcrowding. Patterns related to weather, day of the week, season, and holidays have been previously used to forecast ED visits. Internet search activity has proven useful for predicting disease trends and offers a new opportunity to improve ED visit forecasting. This study tests whether Google search data and relevant statistical methods can improve the accuracy of ED volume forecasting compared with traditional data sources. MATERIALS AND METHODS Seven years of historical daily ED arrivals were collected from Boston Children's Hospital. We used data from the public school calendar, National Oceanic and Atmospheric Administration, and Google Trends. Multiple linear models using LASSO (least absolute shrinkage and selection operator) for variable selection were created. The models were trained on 5 years of data and out-of-sample accuracy was judged using multiple error metrics on the final 2 years. RESULTS All data sources added complementary predictive power. Our baseline day-of-the-week model recorded average percent errors of 10.99%. Autoregressive terms, calendar and weather data reduced errors to 7.71%. Search volume data reduced errors to 7.58% theoretically preventing 4 improperly staffed days. DISCUSSION The predictive power provided by the search volume data may stem from the ability to capture population-level interaction with events, such as winter storms and infectious diseases, that traditional data sources alone miss. CONCLUSIONS This study demonstrates that search volume data can meaningfully improve forecasting of ED visit volume and could help improve quality and reduce cost.
Collapse
Affiliation(s)
- Sam Tideman
- Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA
| | - Mauricio Santillana
- Computational Health Informatics Program, Boston Children’s Hospital, Boston, Massachusetts, USA
- Department of Pediatrics, Harvard Medical School, Boston, Massachusetts, USA
| | - Jonathan Bickel
- Computational Health Informatics Program, Boston Children’s Hospital, Boston, Massachusetts, USA
- Department of Pediatrics, Harvard Medical School, Boston, Massachusetts, USA
| | - Ben Reis
- Computational Health Informatics Program, Boston Children’s Hospital, Boston, Massachusetts, USA
- Department of Pediatrics, Harvard Medical School, Boston, Massachusetts, USA
- Predictive Medicine Group, Boston Children’s Hospital and Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
21
|
Hosseini S, Karami M, Farhadian M, Mohammadi Y. Seasonal Activity of Influenza in Iran: Application of Influenza-like Illness Data from Sentinel Sites of Healthcare Centers during 2010 to 2015. J Epidemiol Glob Health 2019; 8:29-33. [PMID: 30859784 PMCID: PMC7325813 DOI: 10.2991/j.jegh.2018.08.100] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2017] [Accepted: 06/21/2018] [Indexed: 11/26/2022] Open
Abstract
This study aimed to predict seasonal influenza activity and detection of influenza outbreaks. Data of all registered cases (n = 53,526) of influenza-like illnesses (ILIs) from sentinel sites of healthcare centers in Iran were obtained from the FluNet web-based tool, World Health Organization (WHO), from 2010 to 2015. The status of the ILI activity was obtained from the FluNet and considered as the gold standard of the seasonal activity of influenza during the study period. The cumulative sum (CUSUM) as an outbreak detection method was used to predict the seasonal activity of influenza. Also, time series similarity between the ILI trend and CUSUM was assessed using the cross-correlogram. Of 7684 (14%) positive cases of influenza, about 71% were type A virus and 28% were type B virus. The majority of the outbreaks occurred in winter and autumn. Results of the cross-correlogram showed that there was a considerable similarity between time series graphs of the ILI cases and CUSUM values. However, the CUSUM algorithm did not have a good performance in the timely detection of influenza activity. Despite a considerable similarity between time series of the ILI cases and CUSUM algorithm in weekly lag, the seasonal activity of influenza in Iran could not be predicted by the CUSUM algorithm.
Collapse
Affiliation(s)
- Seyedhadi Hosseini
- Department of Epidemiology, School of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran
| | - Manoochehr Karami
- Department of Epidemiology, School of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran.,Research Center for Health Sciences, Hamadan University of Medical Sciences, Hamadan, Iran
| | - Maryam Farhadian
- Modeling of Non-communicable Diseases Research Center, Hamadan University of Medical Sciences, Hamadan, Iran
| | - Younes Mohammadi
- Social Determinants of Health Research Center, Hamadan University of Medical Sciences, Hamadan, Iran
| |
Collapse
|
22
|
Alessa A, Faezipour M. Flu Outbreak Prediction Using Twitter Posts Classification and Linear Regression With Historical Centers for Disease Control and Prevention Reports: Prediction Framework Study. JMIR Public Health Surveill 2019; 5:e12383. [PMID: 31237567 PMCID: PMC6615001 DOI: 10.2196/12383] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2018] [Revised: 05/22/2019] [Accepted: 05/25/2019] [Indexed: 01/26/2023] Open
Abstract
Background Social networking sites (SNSs) such as Twitter are widely used by diverse demographic populations. The amount of data within SNSs has created an efficient resource for real-time analysis. Thus, data from SNSs can be used effectively to track disease outbreaks and provide necessary warnings. Current SNS-based flu detection and prediction frameworks apply conventional machine learning approaches that require lengthy training and testing, which is not the optimal solution for new outbreaks with new signs and symptoms. Objective The objective of this study was to propose an efficient and accurate framework that uses data from SNSs to track disease outbreaks and provide early warnings, even for newest outbreaks, accurately. Methods We presented a framework of outbreak prediction that included 3 main modules: text classification, mapping, and linear regression for weekly flu rate predictions. The text classification module used the features of sentiment analysis and predefined keyword occurrences. Various classifiers, including FastText (FT) and 6 conventional machine learning algorithms, were evaluated to identify the most efficient and accurate one for the proposed framework. The text classifiers were trained and tested using a prelabeled dataset of flu-related and unrelated Twitter postings. The selected text classifier was then used to classify over 8,400,000 tweet documents. The flu-related documents were then mapped on a weekly basis using a mapping module. Finally, the mapped results were passed together with historical Centers for Disease Control and Prevention (CDC) data to a linear regression module for weekly flu rate predictions. Results The evaluation of flu tweet classification showed that FT, together with the extracted features, achieved accurate results with an F-measure value of 89.9% in addition to its efficiency. Therefore, FT was chosen to be the classification module to work together with the other modules in the proposed framework, including a regression-based estimator, for flu trend predictions. The estimator was evaluated using several regression models. Regression results show that the linear regression–based estimator achieved the highest accuracy results using the measure of Pearson correlation. Thus, the linear regression model was used for the module of weekly flu rate estimation. The prediction results were compared with the available recent data from CDC as the ground truth and showed a strong correlation of 96.29% . Conclusions The results demonstrated the efficiency and the accuracy of the proposed framework that can be used even for new outbreaks with new signs and symptoms. The classification results demonstrated that the FT-based framework improves the accuracy and the efficiency of flu disease surveillance systems that use unstructured data such as data from SNSs.
Collapse
Affiliation(s)
- Ali Alessa
- Department of Computer Science and Engineering, University of Bridgeport, Bridgeport, CT, United States.,Institute of Public Administration, Riyadh, Saudi Arabia
| | - Miad Faezipour
- Department of Computer Science and Engineering, University of Bridgeport, Bridgeport, CT, United States.,Department of Biomedical Engineering, University of Bridgeport, Bridgeport, CT, United States
| |
Collapse
|
23
|
Abstract
As the United Kingdom (UK) negotiates its separation from the European Union (EU), it is important to remember the public health mechanisms that are directly facilitated via our relationship with the EU. One such mechanism is the UK’s role within the European Centre for Disease Prevention and Control (ECDC). Global health protection is an area that is currently experiencing an unprecedented wave of innovation, both technologically and ideologically, and we must therefore ensure that our future relationship with ECDC is one that facilitates full involvement with the global health security systems of the future.
Collapse
|
24
|
Xue H, Bai Y, Hu H, Liang H. Regional level influenza study based on Twitter and machine learning method. PLoS One 2019; 14:e0215600. [PMID: 31013324 PMCID: PMC6478375 DOI: 10.1371/journal.pone.0215600] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2018] [Accepted: 04/04/2019] [Indexed: 11/28/2022] Open
Abstract
The significance of flu prediction is that the appropriate preventive and control measures can be taken by relevant departments after assessing predicted data; thus, morbidity and mortality can be reduced. In this paper, three flu prediction models, based on twitter and US Centers for Disease Control's (CDC's) Influenza-Like Illness (ILI) data, are proposed (models 1-3) to verify the factors that affect the spread of the flu. In this work, an Improved Particle Swarm Optimization algorithm to optimize the parameters of Support Vector Regression (IPSO-SVR) was proposed. The IPSO-SVR was trained by the independent and dependent variables of the three models (models 1-3) as input and output. The trained IPSO-SVR method was used to predict the regional unweighted percentage ILI (%ILI) events in the US. The prediction results of each model are analyzed and compared. The results show that the IPSO-SVR method (model 3) demonstrates excellent performance in real-time prediction of ILIs, and further highlights the benefits of using real-time twitter data, thus providing an effective means for the prevention and control of flu.
Collapse
Affiliation(s)
- Hongxin Xue
- School of Information and Communication Engineering, North University of China, Taiyuan, Shanxi, 030051, People’s Republic of China
- Department of Mathematics, School of Science, North University of China, Taiyuan, Shanxi, 030051, People’s Republic of China
| | - Yanping Bai
- Department of Mathematics, School of Science, North University of China, Taiyuan, Shanxi, 030051, People’s Republic of China
| | - Hongping Hu
- Department of Mathematics, School of Science, North University of China, Taiyuan, Shanxi, 030051, People’s Republic of China
| | - Haijian Liang
- National Key Laboratory for Electronic Measurement Technology, Key Laboratory of Instrumentation Science & Dynamic Measurement Ministry of Educations, School of Information and Communication Engineering, North University of China, Taiyuan, Shanxi, 030051, People’s Republic of China
| |
Collapse
|
25
|
Social network analysis for better understanding of influenza. J Biomed Inform 2019; 93:103161. [PMID: 30940598 DOI: 10.1016/j.jbi.2019.103161] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2018] [Revised: 03/17/2019] [Accepted: 03/29/2019] [Indexed: 11/22/2022]
Abstract
INTRODUCTION The objective of this study is to improve the understanding of spatial spreading of complicated cases of influenza that required hospitalizations, by creating heatmaps and social networks. They will allow to identify critical hubs and routes of spreading of Influenza, in specific geographic locations, in order to contain infections and prevent complications, that require hospitalizations. MATERIAL AND METHODS Data were downloaded from the Healthcare Cost and Utilization Project (HCUP) - SID, New York State database. Patients hospitalized with flu complications, between 2003 and 2012 were included in the research (30,380 cases). A novel approach was designed, by constructing heatmaps for specific geographic regions in New York state and power law networks, in order to analyze distribution of hospitalized flu cases. RESULTS Heatmaps revealed that distributions of patients follow urban areas and big roads, indicating that flu spreads along routes, that people use to travel. A scale-free network, created from correlations among zip codes, discovered that, the highest populated zip codes didn't have the largest number of patients with flu complications. Among the top five most affected zip codes, four were in Bronx. Demographics of top affected zip codes were presented in results. Normalized numbers of cases per population revealed that, none of zip codes from Bronx were in the top 20. All zip codes with the highest node degrees were in New York City area. DISCUSSION Heatmaps identified geographic distribution of hospitalized flu patients and network analysis identified hubs of the infection. Our results will enable better estimation of resources for prevention and treatment of hospitalized patients with complications of Influenza. CONCLUSION Analyses of geographic distribution of hospitalized patients with Influenza and demographic characteristics of populations, help us to make better planning and management of resources for Influenza patients, that require hospitalization. Obtained results could potentially help to save many lives and improve the health of the population.
Collapse
|
26
|
Ning S, Yang S, Kou SC. Accurate regional influenza epidemics tracking using Internet search data. Sci Rep 2019; 9:5238. [PMID: 30918276 PMCID: PMC6437143 DOI: 10.1038/s41598-019-41559-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 03/12/2019] [Indexed: 12/12/2022] Open
Abstract
Accurate, high-resolution tracking of influenza epidemics at the regional level helps public health agencies make informed and proactive decisions, especially in the face of outbreaks. Internet users' online searches offer great potential for the regional tracking of influenza. However, due to the complex data structure and reduced quality of Internet data at the regional level, few established methods provide satisfactory performance. In this article, we propose a novel method named ARGO2 (2-step Augmented Regression with GOogle data) that efficiently combines publicly available Google search data at different resolutions (national and regional) with traditional influenza surveillance data from the Centers for Disease Control and Prevention (CDC) for accurate, real-time regional tracking of influenza. ARGO2 gives very competitive performance across all US regions compared with available Internet-data-based regional influenza tracking methods, and it has achieved 30% error reduction over the best alternative method that we numerically tested for the period of March 2009 to March 2018. ARGO2 is reliable and robust, with the flexibility to incorporate additional information from other sources and resolutions, making it a powerful tool for regional influenza tracking, and potentially for tracking other social, economic, or public health events at the regional or local level.
Collapse
Affiliation(s)
- Shaoyang Ning
- Department of Statistics, Harvard University, 1 Oxford Street, Cambridge, 02138, MA, USA
| | - Shihao Yang
- Department of Statistics, Harvard University, 1 Oxford Street, Cambridge, 02138, MA, USA
| | - S C Kou
- Department of Statistics, Harvard University, 1 Oxford Street, Cambridge, 02138, MA, USA.
| |
Collapse
|
27
|
Lu FS, Hattab MW, Clemente CL, Biggerstaff M, Santillana M. Improved state-level influenza nowcasting in the United States leveraging Internet-based data and network approaches. Nat Commun 2019; 10:147. [PMID: 30635558 PMCID: PMC6329822 DOI: 10.1038/s41467-018-08082-0] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2018] [Accepted: 12/12/2018] [Indexed: 12/01/2022] Open
Abstract
In the presence of health threats, precision public health approaches aim to provide targeted, timely, and population-specific interventions. Accurate surveillance methodologies that can estimate infectious disease activity ahead of official healthcare-based reports, at relevant spatial resolutions, are important for achieving this goal. Here we introduce a methodological framework which dynamically combines two distinct influenza tracking techniques, using an ensemble machine learning approach, to achieve improved state-level influenza activity estimates in the United States. The two predictive techniques behind the ensemble utilize (1) a self-correcting statistical method combining influenza-related Google search frequencies, information from electronic health records, and historical flu trends within each state, and (2) a network-based approach leveraging spatio-temporal synchronicities observed in historical influenza activity across states. The ensemble considerably outperforms each component method in addition to previously proposed state-specific methods for influenza tracking, with higher correlations and lower prediction errors.
Collapse
Affiliation(s)
- Fred S Lu
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, 02115, USA.
| | - Mohammad W Hattab
- Wyss Institute for Biologically Inspired Engineering, Harvard Medical School, Boston, MA, 02115, USA
| | | | - Matthew Biggerstaff
- Influenza Division, National Center for Immunization and Respiratory Disease, Centers for Disease Control and Prevention, Atlanta, GA, 30333, USA
| | - Mauricio Santillana
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, 02115, USA.
- Department of Pediatrics, Harvard Medical School, Boston, MA, 02115, USA.
| |
Collapse
|
28
|
Fujibayashi K, Takahashi H, Tanei M, Uehara Y, Yokokawa H, Naito T. A New Influenza-Tracking Smartphone App (Flu-Report) Based on a Self-Administered Questionnaire: Cross-Sectional Study. JMIR Mhealth Uhealth 2018; 6:e136. [PMID: 29875082 PMCID: PMC6010834 DOI: 10.2196/mhealth.9834] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Revised: 04/19/2018] [Accepted: 04/22/2018] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Influenza infections can spread rapidly, and influenza outbreaks are a major public health concern worldwide. Early detection of signs of an influenza pandemic is important to prevent global outbreaks. Development of information and communications technologies for influenza surveillance, including participatory surveillance systems involving lay users, has recently increased. Many of these systems can estimate influenza activity faster than the conventional influenza surveillance systems. Unfortunately, few of these influenza-tracking systems are available in Japan. OBJECTIVE This study aimed to evaluate the flu-tracking ability of Flu-Report, a new influenza-tracking mobile phone app that uses a self-administered questionnaire for the early detection of influenza activity. METHODS Flu-Report was used to collect influenza-related information (ie, dates on which influenza infections were diagnosed) from November 2016 to March 2017. Participants were adult volunteers from throughout Japan, who also provided information about their cohabiting family members. The utility of Flu-Report was evaluated by comparison with the conventional influenza surveillance information and basic information from an existing large-scale influenza-tracking system (an automatic surveillance system based on electronic records of prescription drug purchases). RESULTS Information was obtained through Flu-Report for approximately 10,094 volunteers. In total, 2134 participants were aged <20 years, 6958 were aged 20-59 years, and 1002 were aged ≥60 years. Between November 2016 and March 2017, 347 participants reported they had influenza or an influenza-like illness in the 2016 season. Flu-Report-derived influenza infection time series data displayed a good correlation with basic information obtained from the existing influenza surveillance system (rho, ρ=.65, P=.001). However, the influenza morbidity ratio for our participants was approximately 25% of the mean influenza morbidity ratio for the Japanese population. The Flu-Report influenza morbidity ratio was 5.06% (108/2134) among those aged <20 years, 3.16% (220/6958) among those aged 20-59 years, and 0.59% (6/1002) among those aged ≥60 years. In contrast, influenza morbidity ratios for Japanese individuals aged <20 years, 20-59 years, and ≥60 years were recently estimated at 31.97% to 37.90%, 8.16% to 9.07%, and 2.71% to 4.39%, respectively. CONCLUSIONS Flu-Report supports easy access to near real-time information about influenza activity via the accumulation of self-administered questionnaires. However, Flu-Report users may be influenced by selection bias, which is a common issue associated with surveillance using information and communications technologies. Despite this, Flu-Report has the potential to provide basic data that could help detect influenza outbreaks.
Collapse
Affiliation(s)
- Kazutoshi Fujibayashi
- Department of General Medicine, School of Medicine, Juntendo University, Tokyo, Japan
| | - Hiromizu Takahashi
- Department of General Medicine, School of Medicine, Juntendo University, Tokyo, Japan
| | - Mika Tanei
- Department of General Medicine, School of Medicine, Juntendo University, Tokyo, Japan
| | - Yuki Uehara
- Department of General Medicine, School of Medicine, Juntendo University, Tokyo, Japan
| | - Hirohide Yokokawa
- Department of General Medicine, School of Medicine, Juntendo University, Tokyo, Japan
| | - Toshio Naito
- Department of General Medicine, School of Medicine, Juntendo University, Tokyo, Japan
| |
Collapse
|
29
|
Yeung D. Social Media as a Catalyst for Policy Action and Social Change for Health and Well-Being: Viewpoint. J Med Internet Res 2018; 20:e94. [PMID: 29555624 PMCID: PMC5881041 DOI: 10.2196/jmir.8508] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2017] [Revised: 01/17/2018] [Accepted: 01/23/2018] [Indexed: 12/26/2022] Open
Abstract
This viewpoint paper argues that policy interventions can benefit from the continued use of social media analytics, which can serve as an important complement to traditional social science data collection and analysis. Efforts to improve well-being should provide an opportunity to explore these areas more deeply, and encourage the efforts of those conducting national and local data collection on health to incorporate more of these emerging data sources. Social media remains a relatively untapped source of information to catalyze policy action and social change. However, the diversity of social media platforms and available analysis techniques provides multiple ways to offer insight for policy making and decision making. For instance, social media content can provide timely information about the impact of policy interventions. Social media location information can inform where to deploy resources or disseminate public messaging. Network analysis of social media connections can reveal underserved populations who may be disconnected from public services. Machine learning can help recognize important patterns for disease surveillance or to model population sentiment. To fully realize these potential policy uses, limitations to social media data will need to be overcome, including data reliability and validity, and potential privacy risks. Traditional data collection may not fully capture the upstream factors and systemic relationships that influence health and well-being. Policy actions and social change efforts, such as the Robert Wood Johnson Foundation’s effort to advance a culture of health, which are intended to drive change in a network of upstream health drivers, will need to incorporate a broad range of behavioral information, such as health attitudes or physical activity levels. Applying innovative techniques to emerging data has the potential to extract insight from unstructured data or fuse disparate sources of data, such as linking health attitudes that are expressed to health behaviors or broader health and well-being outcomes.
Collapse
|
30
|
Mavragani A, Sampri A, Sypsa K, Tsagarakis KP. Integrating Smart Health in the US Health Care System: Infodemiology Study of Asthma Monitoring in the Google Era. JMIR Public Health Surveill 2018; 4:e24. [PMID: 29530839 PMCID: PMC5869181 DOI: 10.2196/publichealth.8726] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Revised: 10/15/2017] [Accepted: 01/13/2018] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND With the internet's penetration and use constantly expanding, this vast amount of information can be employed in order to better assess issues in the US health care system. Google Trends, a popular tool in big data analytics, has been widely used in the past to examine interest in various medical and health-related topics and has shown great potential in forecastings, predictions, and nowcastings. As empirical relationships between online queries and human behavior have been shown to exist, a new opportunity to explore the behavior toward asthma-a common respiratory disease-is present. OBJECTIVE This study aimed at forecasting the online behavior toward asthma and examined the correlations between queries and reported cases in order to explore the possibility of nowcasting asthma prevalence in the United States using online search traffic data. METHODS Applying Holt-Winters exponential smoothing to Google Trends time series from 2004 to 2015 for the term "asthma," forecasts for online queries at state and national levels are estimated from 2016 to 2020 and validated against available Google query data from January 2016 to June 2017. Correlations among yearly Google queries and between Google queries and reported asthma cases are examined. RESULTS Our analysis shows that search queries exhibit seasonality within each year and the relationships between each 2 years' queries are statistically significant (P<.05). Estimated forecasting models for a 5-year period (2016 through 2020) for Google queries are robust and validated against available data from January 2016 to June 2017. Significant correlations were found between (1) online queries and National Health Interview Survey lifetime asthma (r=-.82, P=.001) and current asthma (r=-.77, P=.004) rates from 2004 to 2015 and (2) between online queries and Behavioral Risk Factor Surveillance System lifetime (r=-.78, P=.003) and current asthma (r=-.79, P=.002) rates from 2004 to 2014. The correlations are negative, but lag analysis to identify the period of response cannot be employed until short-interval data on asthma prevalence are made available. CONCLUSIONS Online behavior toward asthma can be accurately predicted, and significant correlations between online queries and reported cases exist. This method of forecasting Google queries can be used by health care officials to nowcast asthma prevalence by city, state, or nationally, subject to future availability of daily, weekly, or monthly data on reported cases. This method could therefore be used for improved monitoring and assessment of the needs surrounding the current population of patients with asthma.
Collapse
Affiliation(s)
- Amaryllis Mavragani
- Department of Computing Science and Mathematics, Faculty of Natural Sciences, University of Stirling, Stirling, United Kingdom
| | - Alexia Sampri
- Department of Computing Science and Mathematics, Faculty of Natural Sciences, University of Stirling, Stirling, United Kingdom
| | - Karla Sypsa
- Department of Pharmacy and Forensic Science, King's College London, University of London, London, United Kingdom
| | - Konstantinos P Tsagarakis
- Business and Environmental Technology Economics Lab, Department of Environmental Engineering, Democritus University of Thrace, Xanthi, Greece
| |
Collapse
|
31
|
Alessa A, Faezipour M. A review of influenza detection and prediction through social networking sites. Theor Biol Med Model 2018; 15:2. [PMID: 29386017 PMCID: PMC5793414 DOI: 10.1186/s12976-017-0074-5] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Accepted: 12/06/2017] [Indexed: 02/02/2023] Open
Abstract
Early prediction of seasonal epidemics such as influenza may reduce their impact in daily lives. Nowadays, the web can be used for surveillance of diseases. Search engines and social networking sites can be used to track trends of different diseases seven to ten days faster than government agencies such as Center of Disease Control and Prevention (CDC). CDC uses the Illness-Like Influenza Surveillance Network (ILINet), which is a program used to monitor Influenza-Like Illness (ILI) sent by thousands of health care providers in order to detect influenza outbreaks. It is a reliable tool, however, it is slow and expensive. For that reason, many studies aim to develop methods that do real time analysis to track ILI using social networking sites. Social media data such as Twitter can be used to predict the spread of flu in the population and can help in getting early warnings. Today, social networking sites (SNS) are used widely by many people to share thoughts and even health status. Therefore, SNS provides an efficient resource for disease surveillance and a good way to communicate to prevent disease outbreaks. The goal of this study is to review existing alternative solutions that track flu outbreak in real time using social networking sites and web blogs. Many studies have shown that social networking sites can be used to conduct real time analysis for better predictions.
Collapse
Affiliation(s)
- Ali Alessa
- Department of Computer Science and Engineering, School of Engineering, University of Bridgeport, 221 University Avenue, Bridgeport, 06604 CT USA
| | - Miad Faezipour
- Department of Computer Science and Engineering, School of Engineering, University of Bridgeport, 221 University Avenue, Bridgeport, 06604 CT USA
- Department of Biomedical Engineering, School of Engineering, University of Bridgeport, 221 University Avenue, Bridgeport, 06604 CT USA
| |
Collapse
|
32
|
Huang DC, Wang JF. Monitoring hand, foot and mouth disease by combining search engine query data and meteorological factors. THE SCIENCE OF THE TOTAL ENVIRONMENT 2018; 612:1293-1299. [PMID: 28898935 DOI: 10.1016/j.scitotenv.2017.09.017] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Revised: 09/03/2017] [Accepted: 09/03/2017] [Indexed: 05/12/2023]
Abstract
Hand, foot and mouth disease (HFMD) has been recognized as a significant public health threat and poses a tremendous challenge to disease control departments. To date, the relationship between meteorological factors and HFMD has been documented, and public interest of disease has been proven to be trackable from the Internet. However, no study has explored the combination of these two factors in the monitoring of HFMD. Therefore, the main aim of this study was to develop an effective monitoring model of HFMD in Guangzhou, China by utilizing historical HFMD cases, Internet-based search engine query data and meteorological factors. To this end, a case study was conducted in Guangzhou, using a network-based generalized additive model (GAM) including all factors related to HFMD. Three other models were also constructed using some of the variables for comparison. The results suggested that the model showed the best estimating ability when considering all of the related factors.
Collapse
Affiliation(s)
- Da-Cang Huang
- State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Science and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; Key Laboratory of Surveillance and Early Warning on Infectious Disease, Chinese Center for Disease Control and Prevention, Beijing 102206, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jin-Feng Wang
- State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Science and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing 210023, China.
| |
Collapse
|
33
|
Priedhorsky R, Osthus D, Daughton AR, Moran KR, Generous N, Fairchild G, Deshpande A, Del Valle SY. Measuring Global Disease with Wikipedia: Success, Failure, and a Research Agenda. CSCW : PROCEEDINGS OF THE CONFERENCE ON COMPUTER-SUPPORTED COOPERATIVE WORK. CONFERENCE ON COMPUTER-SUPPORTED COOPERATIVE WORK 2017; 2017:1812-1834. [PMID: 28782059 PMCID: PMC5542563 DOI: 10.1145/2998181.2998183] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
Effective disease monitoring provides a foundation for effective public health systems. This has historically been accomplished with patient contact and bureaucratic aggregation, which tends to be slow and expensive. Recent internet-based approaches promise to be real-time and cheap, with few parameters. However, the question of when and how these approaches work remains open. We addressed this question using Wikipedia access logs and category links. Our experiments, replicable and extensible using our open source code and data, test the effect of semantic article filtering, amount of training data, forecast horizon, and model staleness by comparing across 6 diseases and 4 countries using thousands of individual models. We found that our minimal-configuration, language-agnostic article selection process based on semantic relatedness is effective for improving predictions, and that our approach is relatively insensitive to the amount and age of training data. We also found, in contrast to prior work, very little forecasting value, and we argue that this is consistent with theoretical considerations about the nature of forecasting. These mixed results lead us to propose that the currently observational field of internet-based disease surveillance must pivot to include theoretical models of information flow as well as controlled experiments based on simulations of disease.
Collapse
Affiliation(s)
| | - Dave Osthus
- Computer, Computational, and Statistical Sciences (CCS) Division
| | - Ashlynn R Daughton
- Analytics, Intelligence, and Technology (A) Division Los Alamos National Laboratory Los Alamos, NM
| | - Kelly R Moran
- Analytics, Intelligence, and Technology (A) Division Los Alamos National Laboratory Los Alamos, NM
| | - Nicholas Generous
- Analytics, Intelligence, and Technology (A) Division Los Alamos National Laboratory Los Alamos, NM
| | - Geoffrey Fairchild
- Analytics, Intelligence, and Technology (A) Division Los Alamos National Laboratory Los Alamos, NM
| | - Alina Deshpande
- Analytics, Intelligence, and Technology (A) Division Los Alamos National Laboratory Los Alamos, NM
| | - Sara Y Del Valle
- Analytics, Intelligence, and Technology (A) Division Los Alamos National Laboratory Los Alamos, NM
| |
Collapse
|
34
|
Using Baidu Search Index to Predict Dengue Outbreak in China. Sci Rep 2016; 6:38040. [PMID: 27905501 PMCID: PMC5131307 DOI: 10.1038/srep38040] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2016] [Accepted: 11/04/2016] [Indexed: 12/23/2022] Open
Abstract
This study identified the possible threshold to predict dengue fever (DF) outbreaks using Baidu Search Index (BSI). Time-series classification and regression tree models based on BSI were used to develop a predictive model for DF outbreak in Guangzhou and Zhongshan, China. In the regression tree models, the mean autochthonous DF incidence rate increased approximately 30-fold in Guangzhou when the weekly BSI for DF at the lagged moving average of 1-3 weeks was more than 382. When the weekly BSI for DF at the lagged moving average of 1-5 weeks was more than 91.8, there was approximately 9-fold increase of the mean autochthonous DF incidence rate in Zhongshan. In the classification tree models, the results showed that when the weekly BSI for DF at the lagged moving average of 1-3 weeks was more than 99.3, there was 89.28% chance of DF outbreak in Guangzhou, while, in Zhongshan, when the weekly BSI for DF at the lagged moving average of 1-5 weeks was more than 68.1, the chance of DF outbreak rose up to 100%. The study indicated that less cost internet-based surveillance systems can be the valuable complement to traditional DF surveillance in China.
Collapse
|
35
|
Moran KR, Fairchild G, Generous N, Hickmann K, Osthus D, Priedhorsky R, Hyman J, Del Valle SY. Epidemic Forecasting is Messier Than Weather Forecasting: The Role of Human Behavior and Internet Data Streams in Epidemic Forecast. J Infect Dis 2016; 214:S404-S408. [PMID: 28830111 PMCID: PMC5181546 DOI: 10.1093/infdis/jiw375] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Mathematical models, such as those that forecast the spread of epidemics or predict the weather, must overcome the challenges of integrating incomplete and inaccurate data in computer simulations, estimating the probability of multiple possible scenarios, incorporating changes in human behavior and/or the pathogen, and environmental factors. In the past 3 decades, the weather forecasting community has made significant advances in data collection, assimilating heterogeneous data steams into models and communicating the uncertainty of their predictions to the general public. Epidemic modelers are struggling with these same issues in forecasting the spread of emerging diseases, such as Zika virus infection and Ebola virus disease. While weather models rely on physical systems, data from satellites, and weather stations, epidemic models rely on human interactions, multiple data sources such as clinical surveillance and Internet data, and environmental or biological factors that can change the pathogen dynamics. We describe some of similarities and differences between these 2 fields and how the epidemic modeling community is rising to the challenges posed by forecasting to help anticipate and guide the mitigation of epidemics. We conclude that some of the fundamental differences between these 2 fields, such as human behavior, make disease forecasting more challenging than weather forecasting.
Collapse
Affiliation(s)
| | | | | | | | - Dave Osthus
- Computer, Computational & Statistical Sciences Division
| | - Reid Priedhorsky
- High Performance Computing Division, Los Alamos National Laboratory, New Mexico
| | - James Hyman
- Theoretical Division
- Department of Mathematics, Tulane University, New Orleans, Louisiana
| | | |
Collapse
|
36
|
Quantifying the UK Online Interest in Substances of the EU Watchlist for Water Monitoring: Diclofenac, Estradiol, and the Macrolide Antibiotics. WATER 2016. [DOI: 10.3390/w8110542] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
|
37
|
Huang DC, Wang JF, Huang JX, Sui DZ, Zhang HY, Hu MG, Xu CD. Towards Identifying and Reducing the Bias of Disease Information Extracted from Search Engine Data. PLoS Comput Biol 2016; 12:e1004876. [PMID: 27271698 PMCID: PMC4894584 DOI: 10.1371/journal.pcbi.1004876] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2015] [Accepted: 03/17/2016] [Indexed: 11/19/2022] Open
Abstract
The estimation of disease prevalence in online search engine data (e.g., Google Flu Trends (GFT)) has received a considerable amount of scholarly and public attention in recent years. While the utility of search engine data for disease surveillance has been demonstrated, the scientific community still seeks ways to identify and reduce biases that are embedded in search engine data. The primary goal of this study is to explore new ways of improving the accuracy of disease prevalence estimations by combining traditional disease data with search engine data. A novel method, Biased Sentinel Hospital-based Area Disease Estimation (B-SHADE), is introduced to reduce search engine data bias from a geographical perspective. To monitor search trends on Hand, Foot and Mouth Disease (HFMD) in Guangdong Province, China, we tested our approach by selecting 11 keywords from the Baidu index platform, a Chinese big data analyst similar to GFT. The correlation between the number of real cases and the composite index was 0.8. After decomposing the composite index at the city level, we found that only 10 cities presented a correlation of close to 0.8 or higher. These cities were found to be more stable with respect to search volume, and they were selected as sample cities in order to estimate the search volume of the entire province. After the estimation, the correlation improved from 0.8 to 0.864. After fitting the revised search volume with historical cases, the mean absolute error was 11.19% lower than it was when the original search volume and historical cases were combined. To our knowledge, this is the first study to reduce search engine data bias levels through the use of rigorous spatial sampling strategies.
Collapse
Affiliation(s)
- Da-Cang Huang
- State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Science and Natural Resource Research, Chinese Academy of Sciences, Beijing, China
- Key Laboratory of Surveillance and Early Warning on Infectious Disease, Chinese Center for Disease Control and Prevention, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jin-Feng Wang
- State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Science and Natural Resource Research, Chinese Academy of Sciences, Beijing, China
- Key Laboratory of Surveillance and Early Warning on Infectious Disease, Chinese Center for Disease Control and Prevention, Beijing, China
- * E-mail:
| | - Ji-Xia Huang
- College of Forestry, Beijing Forestry University, Beijing, China
| | - Daniel Z. Sui
- Department of Geography, The Ohio State University, Columbus, Ohio, United States of America
| | - Hong-Yan Zhang
- School of Geographical Science, Northeast Normal University, Changchun, China
| | - Mao-Gui Hu
- State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Science and Natural Resource Research, Chinese Academy of Sciences, Beijing, China
- Key Laboratory of Surveillance and Early Warning on Infectious Disease, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Cheng-Dong Xu
- State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Science and Natural Resource Research, Chinese Academy of Sciences, Beijing, China
- Key Laboratory of Surveillance and Early Warning on Infectious Disease, Chinese Center for Disease Control and Prevention, Beijing, China
| |
Collapse
|
38
|
DEFENDER: Detecting and Forecasting Epidemics Using Novel Data-Analytics for Enhanced Response. PLoS One 2016; 11:e0155417. [PMID: 27192059 PMCID: PMC4871418 DOI: 10.1371/journal.pone.0155417] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2015] [Accepted: 04/28/2016] [Indexed: 11/19/2022] Open
Abstract
In recent years social and news media have increasingly been used to explain patterns in disease activity and progression. Social media data, principally from the Twitter network, has been shown to correlate well with official disease case counts. This fact has been exploited to provide advance warning of outbreak detection, forecasting of disease levels and the ability to predict the likelihood of individuals developing symptoms. In this paper we introduce DEFENDER, a software system that integrates data from social and news media and incorporates algorithms for outbreak detection, situational awareness and forecasting. As part of this system we have developed a technique for creating a location network for any country or region based purely on Twitter data. We also present a disease nowcasting (forecasting the current but still unknown level) approach which leverages counts from multiple symptoms, which was found to improve the nowcasting accuracy by 37 percent over a model that used only previous case data. Finally we attempt to forecast future levels of symptom activity based on observed user movement on Twitter, finding a moderate gain of 5 percent over a time series forecasting model.
Collapse
|
39
|
Santillana M, Nguyen AT, Louie T, Zink A, Gray J, Sung I, Brownstein JS. Cloud-based Electronic Health Records for Real-time, Region-specific Influenza Surveillance. Sci Rep 2016; 6:25732. [PMID: 27165494 PMCID: PMC4863169 DOI: 10.1038/srep25732] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2015] [Accepted: 04/20/2016] [Indexed: 11/30/2022] Open
Abstract
Accurate real-time monitoring systems of influenza outbreaks help public health officials make informed decisions that may help save lives. We show that information extracted from cloud-based electronic health records databases, in combination with machine learning techniques and historical epidemiological information, have the potential to accurately and reliably provide near real-time regional estimates of flu outbreaks in the United States.
Collapse
Affiliation(s)
- M. Santillana
- Computational Health Informatics Program, Boston Children’s Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
- Harvard School of Engineering and Applied Sciences, Cambridge, MA, USA
| | - A. T. Nguyen
- Harvard School of Engineering and Applied Sciences, Cambridge, MA, USA
| | - T. Louie
- Harvard School of Public Health, Boston, MA, USA
| | - A. Zink
- athenaResearch at athenahealth, Watertown, MA, USA
| | - J. Gray
- athenaResearch at athenahealth, Watertown, MA, USA
| | - I. Sung
- athenaResearch at athenahealth, Watertown, MA, USA
| | - J. S. Brownstein
- Computational Health Informatics Program, Boston Children’s Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| |
Collapse
|
40
|
Kidd BA. Decoding the immune response to successful influenza vaccination. Nat Immunol 2016; 17:113-4. [DOI: 10.1038/ni.3372] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
41
|
Accurate estimation of influenza epidemics using Google search data via ARGO. Proc Natl Acad Sci U S A 2015; 112:14473-8. [PMID: 26553980 DOI: 10.1073/pnas.1515373112] [Citation(s) in RCA: 179] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Accurate real-time tracking of influenza outbreaks helps public health officials make timely and meaningful decisions that could save lives. We propose an influenza tracking model, ARGO (AutoRegression with GOogle search data), that uses publicly available online search data. In addition to having a rigorous statistical foundation, ARGO outperforms all previously available Google-search-based tracking models, including the latest version of Google Flu Trends, even though it uses only low-quality search data as input from publicly available Google Trends and Google Correlate websites. ARGO not only incorporates the seasonality in influenza epidemics but also captures changes in people's online search behavior over time. ARGO is also flexible, self-correcting, robust, and scalable, making it a potentially powerful tool that can be used for real-time tracking of other social events at multiple temporal and spatial resolutions.
Collapse
|
42
|
Santillana M, Nguyen AT, Dredze M, Paul MJ, Nsoesie EO, Brownstein JS. Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance. PLoS Comput Biol 2015; 11:e1004513. [PMID: 26513245 PMCID: PMC4626021 DOI: 10.1371/journal.pcbi.1004513] [Citation(s) in RCA: 208] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2015] [Accepted: 08/24/2015] [Indexed: 11/19/2022] Open
Abstract
We present a machine learning-based methodology capable of providing real-time ("nowcast") and forecast estimates of influenza activity in the US by leveraging data from multiple data sources including: Google searches, Twitter microblogs, nearly real-time hospital visit records, and data from a participatory surveillance system. Our main contribution consists of combining multiple influenza-like illnesses (ILI) activity estimates, generated independently with each data source, into a single prediction of ILI utilizing machine learning ensemble approaches. Our methodology exploits the information in each data source and produces accurate weekly ILI predictions for up to four weeks ahead of the release of CDC's ILI reports. We evaluate the predictive ability of our ensemble approach during the 2013-2014 (retrospective) and 2014-2015 (live) flu seasons for each of the four weekly time horizons. Our ensemble approach demonstrates several advantages: (1) our ensemble method's predictions outperform every prediction using each data source independently, (2) our methodology can produce predictions one week ahead of GFT's real-time estimates with comparable accuracy, and (3) our two and three week forecast estimates have comparable accuracy to real-time predictions using an autoregressive model. Moreover, our results show that considerable insight is gained from incorporating disparate data streams, in the form of social media and crowd sourced data, into influenza predictions in all time horizons.
Collapse
Affiliation(s)
- Mauricio Santillana
- Harvard School of Engineering and Applied Sciences, Cambridge, Massachusetts, United States of America
- Boston Children’s Hospital Informatics Program, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
| | - André T. Nguyen
- Harvard School of Engineering and Applied Sciences, Cambridge, Massachusetts, United States of America
| | - Mark Dredze
- Department of Computer Science, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Michael J. Paul
- Department of Information Science, University of Colorado, Boulder, Colorado, United States of America
| | - Elaine O. Nsoesie
- Department of Global Health, University of Washington, Seattle, Washington, United States of America
- Institute for Health Metrics and Evaluation, Seattle, Washington, United States of America
| | - John S. Brownstein
- Boston Children’s Hospital Informatics Program, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
| |
Collapse
|
43
|
Abstract
Background Globalization and the potential for rapid spread of emerging infectious diseases have heightened the need for ongoing surveillance and early detection. The Global Public Health Intelligence Network (GPHIN) was established to increase situational awareness and capacity for the early detection of emerging public health events. Objective To describe how the GPHIN has used Big Data as an effective early detection technique for infectious disease outbreaks worldwide and to identify potential future directions for the GPHIN. Findings Every day the GPHIN analyzes over more than 20,000 online news reports (over 30,000 sources) in nine languages worldwide. A web-based program aggregates data based on an algorithm that provides potential signals of emerging public health events which are then reviewed by a multilingual, multidisciplinary team. An alert is sent out if a potential risk is identified. This process proved useful during the Severe Acute Respiratory Syndrome (SARS) outbreak and was adopted shortly after by a number of countries to meet new International Health Regulations that require each country to have the capacity for early detection and reporting. The GPHIN identified the early SARS outbreak in China, was credited with the first alert on MERS-CoV and has played a significant role in the monitoring of the Ebola outbreak in West Africa. Future developments are being considered to advance the GPHIN's capacity in light of other Big Data sources such as social media and its analytical capacity in terms of algorithm development. Conclusion The GPHIN's early adoption of Big Data has increased global capacity to detect international infectious disease outbreaks and other public health events. Integration of additional Big Data sources and advances in analytical capacity could further strengthen the GPHIN's capability for timely detection and early warning.
Collapse
|
44
|
Ma C, Smith HW, Chu C, Juarez DT. Big data in pharmacy practice: current use, challenges, and the future. INTEGRATED PHARMACY RESEARCH AND PRACTICE 2015; 4:91-99. [PMID: 29354523 PMCID: PMC5741030 DOI: 10.2147/iprp.s55862] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Pharmacy informatics is defined as the use and integration of data, information, knowledge, technology, and automation in the medication-use process for the purpose of improving health outcomes. The term "big data" has been coined and is often defined in three V's: volume, velocity, and variety. This paper describes three major areas in which pharmacy utilizes big data, including: 1) informed decision making (clinical pathways and clinical practice guidelines); 2) improved care delivery in health care settings such as hospitals and community pharmacy practice settings; and 3) quality performance measurement for the Centers for Medicare and Medicaid and medication management activities such as tracking medication adherence and medication reconciliation.
Collapse
Affiliation(s)
- Carolyn Ma
- Department of Pharmacy Practice, The Daniel K Inouye College of Pharmacy, University of Hawai‘i at Hilo, Hilo, HI, USA
| | - Helen Wong Smith
- Department of Pharmacy Practice, The Daniel K Inouye College of Pharmacy, University of Hawai‘i at Hilo, Hilo, HI, USA
| | - Cherie Chu
- Department of Pharmacy Practice, The Daniel K Inouye College of Pharmacy, University of Hawai‘i at Hilo, Hilo, HI, USA
| | - Deborah T Juarez
- Department of Pharmacy Practice, The Daniel K Inouye College of Pharmacy, University of Hawai‘i at Hilo, Hilo, HI, USA
| |
Collapse
|
45
|
Affiliation(s)
- Viroj Wiwanitkit
- Surin Rajabhat University, Thailand; Wiwanitkit House, Thailand; Hainan Medical College, China
| |
Collapse
|
46
|
Stensrud MJ. Om å tenke stort. TIDSSKRIFT FOR DEN NORSKE LEGEFORENING 2015; 135:869-70. [DOI: 10.4045/tidsskr.15.0171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022] Open
|