Lyme disease incidence has been increasing among U.S. Department of Defense (DOD) service members over the past 20 years, threatening the health and readiness of the force.1 Syndromic surveillance of tick bite visits can provide timely information that might predict changes in tick-borne disease incidence and geographic spread.2 Tick and tick-borne illness surveillance programs in the U.S. face many barriers, however, including inconsistent funding and limited capacities.3
Epidemic forecasting models can be used to enhance routine surveillance and inform public health policy.4 Previous research on Lyme disease forecasting has focused on general time series and machine-learning models; however, these models have demonstrated high percentage errors and limited predictive accuracy.5,6 Further research is needed to explore alternative modeling approaches that may improve forecasting performance.
Since 2019, the Armed Forces Health Surveillance Division, Integrated Biosurveillance (AFHSD-IB) Branch, within the Defense Health Agency’s Public Health Directorate, has been conducting respiratory disease forecasting using syndromic surveillance data.7 Since 2024, AFHSD-IB has integrated vector-borne disease forecasting to provide situational awareness and inform public health responses from DOD senior leaders. This report aims to predict the number of future Lyme disease and tick bite encounters among U.S Military Health System (MHS) beneficiaries using outpatient encounter data and multiple forecasting methods.
Methods
Tick bite encounters were used as a proxy for tick exposure, while Lyme disease encounters served as a proxy for Lyme disease diagnoses in the absence of trusted case reporting. Encounter definitions were developed using internal criteria.
A single instance of a Lyme disease encounter was defined using the International Classification of Diseases, 10th Revision (ICD-10), discharge diagnosis code ‘A69.2’ and chief complaint terms “Lyme disease,” “erythema migrans,” or “bulls-eye rash,” or their misspellings, in the chief complaint field; any mention of a history of Lyme disease was excluded. Similarly, tick bite encounters were defined as health records mentioning “tick bite” and associated misspellings (including “tic” or “tik” for tick and “bit” for bite) in the chief complaint field.
Monthly direct-sourced outpatient encounter data for each military treatment facility were obtained from the DOD’s Electronic Surveillance System for the Early Notification of Community-based Epidemics (ESSENCE). Data were then aggregated into 4 U.S. surveillance regions, selected based on their high volume of Lyme disease encounters in 2024: National Capital Region, New England, Tidewater, and West Point. Surveillance regions are collections of neighboring military installations, clinics, and hospitals used for disease surveillance and reporting. (Information about each surveillance region is shown in Supplementary Table 1.) Monthly encounter data from January 2021 through October 2024 were collected and used to generate 1- through 3-month forecasts for each outcome metric and surveillance region for the period January through October 2024. Models were trained each month, using data from January 2021 through the most recent month.
Several time series and machine-learning models were used for forecasting. The seasonal autoregressive integrated moving average (SARIMA) model is based on the widely used Box-Jenkins method for univariate time series forecasting.8 The error, trend, seasonal (ETS) model is another time series model that belongs to a special class of exponential smoothing models known as state-space models.8 The exponentially weighted moving average (EWMA) model is a highly sensitive model that effectively identifies subtle data pattern changes due to its weighting scheme, which is of particular importance when assessing rare outcomes.9 The vector autoregressive (VAR) model is a forecasting algorithm that can be used when 2 or more time series influence each other.10 The neural network (NNET) model is a machine-learning model that can adapt to changing inputs, generating the best possible results without needing to redesign the output criteria.11 Prophet is open-source forecasting model created by Facebook that allows users to easily customize and produce high quality forecasts.12 Finally, a baseline naïve model was created using data from the previous tick-borne season. An ensemble (ENS) model was also calculated as the average of all individual model forecasts.
A log-transformed weighted interval score (WIS) was used to measure accuracy of the model forecasts. WIS was previously established as a scoring method for respiratory disease forecasts,13 with lower values indicating more accurate forecasts. The median absolute percentage error (MAPE), another common metric for forecasting error, was also calculated. All analyses were performed in R software 4.4 (R Foundation for Statistical Computing, Vienna, Austria), including the fable14 and fabletools15 packages.
Results
Our analyses indicated that observed Lyme disease encounters increased from late winter to early spring, peaking during the summer months, except for West Point, which peaked in April. West Point also had the highest peak (34.5 per 100,000 MHS beneficiaries), followed by New England (31.4 in July), National Capital Region (13.9, June), and Tidewater (4.2, June) (Figure 1). Observed tick bite encounters increased from February until spring for each surveillance region. The National Capital Region had the highest peak (50.0 per 100,000 MHS beneficiaries in June), followed by West Point (39.5, April), New England (29.1, April), and Tidewater (18.9, May). Figure 1 also shows 1- through 3-month ahead horizon forecasts for the ENS model.

For Lyme disease encounter forecasts, the NNET model had the lowest median log-WIS across the 2-month (0.6) and 3-month (0.8) ahead horizons, while the ENS model had the lowest median log-WIS for the 1-month ahead horizon (0.7) (Figure 2). The ETS model had the lowest MAPE for Lyme disease encounters for the 1-month ahead horizon (33%), while the NNET model had the lowest MAPE for the 2-month ahead horizon (37%), and the ENS model had the lowest MAPE for the 3-month ahead horizon (28%) ahead horizon (Supplementary Figure 2). During the months with the highest activity (March–October), MAPE for the ENS model improved to 29% and 27%, respectively, for the 1-month and 3-month ahead horizons, while the MAPE for the NNET model improved to 29% for the 2-month ahead horizon, indicating greater forecasting accuracy during high activity periods when compared to the total surveillance time (Supplementary Table 2).

For tick bite encounter forecasts, the ETS model had the lowest score (1.0) for all horizons, as well as for the 1- (0.8), 2- (1.0), and 3-month (1.2) ahead horizons, followed by the NNET model (1.2, all horizons; 1.2, 1-month; 1.1, 2-month; and 1.2, 3-month) (Figure 2). The ETS model had the lowest MAPE across all 1-month (25%), 2-month (23%), and 3-month (24%) ahead horizons (Supplementary Figure 2). During the months of highest activity, MAPE for the ETS model improved to 21%, 22%, and 24% respectively for the 1-month, 2-month, and 3-month ahead horizons (Supplementary Table 2).
Discussion
This study produced 2 major findings. First, among all the time series and machine-learning models mentioned in our analysis, 3 models—ENS, ETS, NNET—provided the most accurate forecasts of Lyme disease and tick bite activity in MHS beneficiaries, with increased accuracy during peak activity periods. This finding suggests that these 3 models would be valuable as early warning signals for public health surveillance and preparedness. It is important to note, however, that model accuracy decreased at longer horizons. Second, MAPE estimates were higher than those reported in previous studies using SARIMA models, although accuracy was improved by focusing on periods of higher activity. Individual surveillance regions aligned with the findings of the aforementioned study.6
Limitations of this study must be considered. First, data lags and inconsistent reporting of Lyme disease cases within the MHS prevented validation of encounter data against confirmed case data. Consequently, sensitivity analysis of encounter definition robustness could not be performed, and therefore, results may not be representative of overall disease incidence. The training data set was derived from a 3-year period of encounter data, but additional historical data may be needed to properly train some of the models. Due to limited technology capacity, particularly hardware, this study also did not utilize more computationally intensive machine-learning models, such as long short-term memory and random forest models, potentially limiting the accuracy of the forecasts. In addition, these models are susceptible to over-fitting when there is insufficient training data.
Despite these limitations, this study provided the first quantitative evidence of the use of outpatient encounter syndromic surveillance data for forecasting Lyme disease and tick bite encounters in the MHS population. Lyme disease forecasting can provide vital information for anticipating the impact on military health and readiness, as well as informing effective public health responses and mitigation efforts within the DOD. Further research is required to explore additional models, more robust training data, and other covariates, including incident Lyme disease cases and other key predictors, such as host species, geographic factors, climate, and weather.




Authors’ Affiliation
Integrated Biosurveillance Branch, Armed Forces Health Surveillance Division, Public Health Directorate, Defense Health Agency, Silver Spring, MD
References
- Melanson VR, Bateman SL, Hering K, et al. A review of force health protection aspects of Lyme disease in the U.S. military. Mil Med. 2025;190(3-4):e561-e568. doi:10.1093/milmed/usae415
- Marx GE, Spillane M, Beck A, et al. Emergency department visits for tick bites: United States, January 2017–December 2019. MMWR Morb Mortal Wkly Rep. 2021;70(17):612-616. doi:10.15585/mmwr.mm7017a2
- Mader EM, Ganser C, Geiger A, et al. A survey of tick surveillance and control practices in the United States. J Med Entomol. 2021;58(4):1503-1512. doi:10.1093/jme/tjaa094
- Chowell G, Skums P. Investigating and forecasting infectious disease dynamics using epidemiological and molecular surveillance data. Phys Life Rev. 2024;51:294-327. doi:10.1016/j.plrev.2024.10.011
- Sun YQ, Z XY, Fan TC, et al. Projecting Lyme disease risk in the United States: a machine learning approach integrating environmental, socioeconomic and vector factors. One Health. 2025;21:101111. doi:10.1016/j.onehlt.2025.101111
- Kapitány-Fövény M, Ferenci T, Sulyok Z, et al. Can Google Trends data improve forecasting of Lyme disease incidence? Zoonoses Public Health. 2019;66(1):101-107. doi:10.1111/zph.12539
- Bova ML, McGee SA, Elliott KR, Ubiera JI. Predicting COVID-19 and respiratory illness: results of the 2022-2023 Armed Forces Health Surveillance Division forecasting challenge. MSMR. 2024;31(5):24-30. Accessed May 8, 2025. https://www.health.mil/news/articles/2024/05/01/msmr-afhsd-ib-2022-respiratory-forecast
- Khan DM, Ali M, Iqbal N, et al. Short-term prediction of COVID-19 using novel hybrid ensemble empirical mode decomposition and error trend seasonal model. Front Public Health. 2022;10:922795. doi:10.3389/fpubh.2022.922795
- Alfasanah Z, Niam MZH, Wardiani S, et al. Monitoring air quality index with EWMA and individual charts using XGBoost and SVR residuals. MethodsX. 2024;14:103107. doi:10.1016/j.mex.2024.103107
- Vector Autoregression (VAR): Comprehensive Guide with Examples in Python. Accessed Aug. 20, 2025. https://www.machinelearningplus.com/timeseries/vector-autoregression-examples-python
- Kumar GS. Understanding and Building Neural Network (NN) Models. 2025. Accessed Aug. 20, 2025. https://builtin.com/machine-learning/nnmodels
- Taylor SJ, Letham B. Prophet: Forecasting at Scale. 2017. Accessed Aug. 20, 2025. https://research.facebook.com/blog/2017/02/prophet-forecasting-at-scale
- Bracher J, Ray EL, Gneiting T, Reich NG. Evaluating epidemic forecasts in an interval format. [Published correction in PLoS Comput Biol. 2022;18(10):e1010592. doi:10.1371/journal.pcbi.1010592.]. PLoS Comput Biol. 2021;17(2):e1008618. doi:10.1371/journal.pcbi.1008618
- O’Hara-Wild M, Hyndman R, Wang E. fable: Forecasting Models for Tidy Time Series, R Package Version 0.4.0. 2024. Accessed Mar. 12, 2025. https://fable.tidyverts.org
- O’Hara-Wild M, Hyndman R, Wang E. fabletools: Core Tools for Packages in the ‘fable’ Framework. R package version 0.5.0. 2024. Accessed Mar. 12, 2025. https://fabletools.tidyverts.org/index.html