The Link Between Mental Health Provider Density and Poor Mental Health at the County Level
Introduction
In recent years, the mental health epidemic has become a growing public health concern. Factors such as social isolation, economic uncertainty, the stigmatization of mental illness, and limited access to care have all contributed to an increase in reported cases of anxiety, depression, and other mental health conditions. According to the Centers for Disease Control and Prevention (CDC), “more than one in five adults in the United States live with a mental illness” (Disease Control and Prevention (2025)). Poor mental health can significantly affect an individual’s quality of life, impacting emotional well-being, job performance, relationships, and physical health. Yet, according to the National Institutes of Health (NIH), only about “half of those experiencing mental illness receive treatment”(Mental Health (2025)).
This concerning gap between mental health needs and treatment raises an important question: does access to mental health care influence mental health outcomes?
One key component in addressing this issue is ensuring that individuals have access to qualified mental health professionals. However, access to care is not evenly distributed across geographic regions. While some counties benefit from a dense network of providers, others face critical shortages, leaving residents without adequate support or timely treatment.
This study aims to explore the relationship between the availability of mental health professionals and the mental well being of a population. Specifically, we ask: Does the number of mental health professionals per county affect the average number of poor mental health days? Understanding this relationship can help inform policy decisions, guide resource allocation, and identify areas most in need of improved mental health infrastructure, ultimately working toward better mental health outcomes at the community level.
Data
About the Data
We are using the 2025 County Health Rankings Dataset, which is collected by the University of Wisconsin Population Health Institute. The data ranks each county in all 50 states based on their health outcomes and variety of health factors. This dataset is widely used by policymakers and researchers to better understand and address factors influencing community and national health. Each row represents one U.S. county and its county-level metrics, and each column is a particular variable. We looked at 6 specific columns, focusing only on the raw values in that category. These were: mental health providers, poor mental health days, lack of social and emotional support, suicide rate, frequent mental distress, and county FIPS code. To standardize for population, we also engineered a variable of the proportion of Mental Health Providers per 100,000 people and used this to improve our visualizations.
Exploratory Data Analysis (EDA)
To explore the relationships between variables, we created a correlation plot of the numerical variables. The plot revealed a strong positive correlation between feelings of loneliness and lack of social and emotional support. In contrast, population showed little to no correlation with the other variables. Poor mental health days appeared to be moderately correlated with all other variables. Notably, the number of mental health providers showed a slight negative correlation with both poor mental health days and lack of social and emotional support.
To better understand the spatial distribution of mental health challenges and resource availability, we created choropleth maps at the county level across the U.S. The first map shows the number of poor mental health days, revealing clear regional disparities—counties in the South report higher rates, whereas parts of the North Central region experience relatively fewer poor mental health days. The second map visualizes the population-to-provider ratio, an indicator of access to care. A higher ratio suggests fewer mental health providers relative to the population.
These map highlights that Southwestern counties, along with parts of Alaska, tend to have more limited access to providers. Together, these maps underscore regional inequities in mental health outcomes and access to services, suggesting that both geography and infrastructure play key roles in shaping community well-being.
As part of our exploratory data analysis, we examined the relationship between provider access and suicide rates. While the overall trend shows only a slight association between these two variables, a few striking outliers emerge (primarily Alaskan counties) with both high suicide rates and high provider availability. This pattern suggests that some communities may have responded to elevated suicide risks by increasing access to mental health services as a preventative strategy, rather than the provider access itself being a direct deterrent of higher suicide rates.
Alaska
As mentioned earlier, Alaska exhibited high suicide rates despite having relatively high provider availability. To better understand this paradox, we conducted a more focused analysis on Alaskan counties.
In the choropleth maps below, we observe that many Alaskan counties report a high number of poor mental health days, despite having a greater density of mental health providers compared to other regions. This further supports the earlier finding that provider access alone may not be sufficient to mitigate mental health challenges, particularly in geographically isolated areas.
Building on this, we created a scatterplot analyzing poor mental health days in relation to lack of social and emotional support which are two key indicators strongly correlated in our earlier analysis. We filtered specifically for Alaskan boroughs and census areas and color coded them by region type (urban vs. rural).
The scatterplot shows a clear linear trend: as social support decreases, poor mental health days increase. Notably, rural regions tend to fall above the regression line, suggesting that even at similar levels of low social support, rural areas experience more poor mental health days than expected. In contrast, urban boroughs cluster toward the lower end of both axes, implying better outcomes.
This regional disparity points to broader social determinants of health, such as community isolation, economic opportunity, and access to nonclinical support systems, that likely contribute to elevated mental health struggles in rural Alaska. It also raises important questions about the effectiveness and reach of mental health services in remote communities and highlights social and emotional support as a potentially more critical factor in improving mental health outcomes.
Methods
Quasi-Poisson Regression
To test which variables were significant, we used Poisson regression. We chose this method because our outcome variable, poor mental health days, is count-based and non-negative. We checked assumptions for Poisson regression including independence and the equality of mean and variance. Due to observed overdispersion, we used a quasi-Poisson model to account for greater variance. All predictors were standardized before fitting the model. The Shapiro-Wilk test for residual normality yielded a p-value of 0.0009671, which at significance level supports the model’s normality assumptions. The regression revealed that feelings of loneliness is the most statistically significant predictor. We visualized these results using a table of percent change effects, which clearly highlighted the stronger social predictors.
We first fit a Poisson linear model because ‘poor_mental_days’ is a non-negative count variable.
where
= provider–population ratio
= feelings of loneliness
= lack of social support
= suicide rate
Assumption checks
- Independence – counties are administrative units and provide reasonable independence.
- Dispersion – the ratio of residual deviance to degrees of freedom was > 1, indicating over‑dispersion.
- Residual normality – A Shapiro–Wilk test on the model’s residuals gave a p-value of 0.009671, which at significance level supports the model’s normality assumptions.
Because of over‑dispersion we refit the model with a quasi‑Poisson model, which relaxes the dispersion requirement by estimating a dispersion parameter (). All numeric predictors were standardised (z‑scores) to place coefficients on a common scale.
Inference and interpretation
Coefficients from the quasi‑Poisson fit were converted to percent change in poor‑mental‑health days per 1‑SD increase in each predictor. Significance was judged by 95% confidence intervals that excludes 0% change.
| Predictor | % Change | 95% CI Lower | 95% CI Upper | p‑value |
|---|---|---|---|---|
| Provider Ratio | 1.18 | 0.75 | 1.61 | 0 |
| Loneliness (0–1 Scale) | 3.54 | 2.99 | 4.10 | 0 |
| Lack of Social Support (0–1 Scale) | 0.92 | 0.34 | 1.51 | 0 |
| Suicide Rate | 1.78 | 1.38 | 2.19 | 0 |
Machine‑learning models for prediction
Since we had a large sample, we decided to implement some machine learning to create predictive models. We selected Random Forest and XGBoost and evaluated them using RMSE. The lower the RMSE, the more accurate the predictions.
Since we had a large sample, we decided to implement some machine learning to create predictive models. We selected Random Forest and XGBoost and evaluated them use RMSE. The lower the RMSE is the better the model is. We also used % increase in RMSE to evaluate feature importance. The greater the % increase is, the greater the importance it has in the model.
Results
After analyzing findings from the previous plot, which found that Alaskan counties (or boroughs/census areas) seem to have significantly higher suicide rates than other US counties, we did a more in-depth analysis with this scatterplot filtering for only Alaskan boroughs/census areas using two other key variables: poor mental health days and lack of social & emotional support (both raw value counts). We also color coded by region type, differentiating rural from urban regions.
The linear regression line captures the overall trend, where rural areas tend to fall above the trend line, telling us that for a given level of low social support, Alaskans in rural regions may experience more poor mental health days than expected. In contrast, urban areas clustering lower on both the x and y suggests that there may be disparities in mental health outcomes between Alaska’s urban and rural regions. This secondary analysis highlights the social determinants of health and considers social support to be a more critical factor than we previously thought.
| Model | MSE | RMSE |
|---|---|---|
| Random Forest | 0.212 | 0.460 |
| XGBoost | 0.196 | 0.443 |
Recommendations
Our analysis found no significant relationship between the number of mental health providers in a county and the amount of self-reported poor mental health days, suggesting that increasing the amount of providers in U.S. counties will not be enough to meaningfully reduce mental health challenges. We found this to be especially important to consider for rural counties, where many of their alarming trends stem not just from low provider headcounts but from gaps in social factors like isolation and loneliness. Based on these conclusions, we recommend that UHG/Optum consider implementing county-informed mental health care programs with a goal of prioritizing connectedness and accessibility, especially in rural regions. A sample program could include the following components:
Partner with particular state and local governments to improve broadband access (subsidized mobile hotspots, data plans, etc.). Many rural communities have the opportunity to utilize telehealth platforms but lack the internet infrastructure to support it.
Invest in virtual therapy platforms that can connect individuals to licensed mental health providers. In-person care is limited or sometimes unavailable in rural regions for a variety of reasons, making this method of care critical for many in these communities.
Develop community peer-support networks, to be paired with virtual therapy. Our findings show that connectedness and social support are some of the strongest predictors of a good mental state. Community leaders and members could stand to provide social and emotional support in meetings and organized events, virtually or in-person.
Our recommendations shift the focus from solving mental health problems with only clinical interventions to a community-based, social approach. Planning investments and developments in these areas keeps the focus and support on the individual. This gives UHG/Optum the opportunity to better support these notably underserved areas and make a real impact, where many traditional, in-person models have not been enough.
Discussion
Research Summary
Our analysis found no strong correlation between mental health provider density and number of poor mental health days, challenging the common assumption that increasing the number of clinical providers will lead to better mental health outcomes overall. This told us that there are other variables playing a bigger role here. Among the group tested, feelings of loneliness and lack of social and emotional support led as the strongest predictors of poor mental health days. Taking a further step with our findings, we discovered an alarming pattern in Alaskan counties, pointing to disparities in mental health outcomes between their urban and rural regions. This new view reinforced the importance of the social determinants of health and helped direct our discussions and recommendations. When comparing predictive models, we found XGBoost to be a better predictive model, as it captured the non-linear relationships between all our variables better. This suggests that more advanced machine learning methods could offer a better understanding of complex subjects like mental health.
Limitations
A main limitation of our work is our use of aggregated, county-level data, so variation on an individual/personal level is unknown. Population-level associations might not accurately reflect what happens at the individual level, making it harder to design effective care strategies. Our dataset lacked variables on broader social determinants, as we mainly used lack of social and emotional support to draw conclusions on social implications. Variables like housing/food insecurity and trauma exposure were not included and because of that, we were not able to fully interpret the true care accessibility issues going on, on the national and county levels. Lacking mental health provider metrics also made our analysis of clinical variables slightly weaker. We were not able to assess the role of insurance rates in mental health outcomes, which could possibly direct future work into another key area as well.
Next Steps
For future work, we believe using geographical factors, behavioral health indicators, and mental health provider metrics will be useful. Considering social (race, gender) and economic (income, insurance rates) factors could make for an even more valuable and insightful analysis. Exploring longitudinal data trends would also be useful when adopting machine learning for this type of analysis, that way patterns can be localized confidently in particular areas of interest. Future studies should seek individual-level data as well, to interpret the importance of personal/life experience and risk factors in order to account for how perceived support, discrimination, and stigma can affect mental health outcomes. Listening to individual perspectives and narratives allows for more accurate/empathetic understanding of disparities in mental health outcomes across the nation. Taking these future steps would open a door to even stronger clinical/behavioral health recommendations, all with the end goal of improving mental health outcomes for all people.
Appendix
Another way we evaluated feature importance was using a gain based vs. permutation based approach. In the gain based method (shown in green), which measures how much each feature contributes to reducing error in the tree-building process, loneliness, social and emotional support, and crude suicide rate emerged as the most influential features. On the other hand, the permutation based method (shown in blue), which assesses how randomizing each feature affects model performance (% increase in MSE), indicated that social and emotional support and loneliness still had importance, but population and raw suicide counts also showed higher relevance than in the gain based metric.
The consistent presence of social support and loneliness across both metrics reinforces their key role in mental health related outcomes.