Skip Navigation HRSA - U.S Department of Health and Human Services, Health Resources and Service Administration U.S. Department of Health & Human Services
Home
Questions
Order Publications
 
Grants Find Help Service Delivery Data Health Care Concerns About HRSA
Toward a Method for Identifying Facilities and Communities with Shortages of Nurses, Summary Report
 
Models and Analyses Based on Facility Data

All of the analyses using facility data are based on North Carolina (NC) and North Dakota (ND). These datasets included a number of possible measures of shortage that could be used as dependent variables:

Effects of Nursing Shortage on Facility Operations. The surveys asked respondents an open-ended question about how nursing shortages have affected the operations of their facility. Responses were then coded into five broad categories: labor cost increase, reduced services, strain on staff, patient care problems, and organizational disturbance. More detailed codes within categories were also given (e.g., labor cost increase included breakouts for increases in agency use, recruitment costs, overtime, wages, retention expenses, development of float pools, and orientation expenses). This was an interesting variable because of in-depth discussions in the first advisory panel meeting about how the true measure of a nursing shortage should be related to patient care and facility operations. Although subjective, this variable touches on those issues. Caution was warranted, however, because the question asked about nursing shortage generally, and respondents may have answered the question thinking about LPNs as well as RNs, particularly if they were from a setting that relies heavily on LPNs (e.g., long-term care). Nonetheless, this variable was used as the dependent variable in a series of preliminary ordinary least squares (OLS) regressions.

RN Vacancy Rates. Both the NC and ND datasets included RN vacancy rates. Many facilities, however, had vacancy rates of 0, which limited the variation in the variable. Interestingly, there was very little correlation between RN vacancy rates and the number of reported effects of the nursing shortage, which was cause to question the utility of the consequences variable given its subjectivity. Vacancy rates were also used as the dependent variable in OLS regressions.

RN Turnover Rates. Turnover rates were not used in any of the in-depth analyses. In the first set of advisory panel meetings, the panelists pointed out that facilities that had a genuinely limited supply of RNs to draw from should be separated from facilities in which poor management led to large numbers of departures. Turnover can certainly reflect limited supply, but also seems likely to reflect problems of organizational culture, particularly in facilities that had low vacancy rates but high turnover (meaning that they had no trouble recruiting RNs, but had trouble retaining them.)

Time to Recruit RNs. Both datasets contained information on the average number of weeks reported to fill RN vacancies. Although theoretically a good indicator of shortage, the large amount of missing data for this variable ruled it out for practical reasons.

Difficulty Recruiting RNs. This ordinal variable was used in a series of ordered probit models conducted as part of the study. The variable used a five-point Likert scale with categories: Very Difficult, Difficult, Neutral, Easy, and Very Easy. Figure 6, which summarizes the responses for North Carolina, shows that somewhat more facilities reported difficulty than ease in recruiting RNs in 2004.

Figure 7 shows that only 4.6% of hospitals in North Carolina reported that recruiting RNs was either difficult or very difficult. The percentages were higher for home health agencies (15.2%), long-term care facilities (21.1%), and public health agencies (26.4%).

Figure 6. Facilities in North Carolina Reporting Different Levels of Difficulty Recruiting Nurses, 2004

[D]

Figure 7. Percentage of Facilities in NC Reporting That Recruiting Nurses Was Either "Difficult" or "Very Difficult", 2004

[D]

A. Preliminary Ordinary Least Squares (OLS) Regressions

OLS regression equations were estimated to predict and explain the number of adverse consequences and vacancy rates in all four types of facilities in North Carolina. First the models were estimated with both facility- and county-level explanatory variables, which was the ideal model. In recognition of the fact that facility-level variables were not available in most states, an abbreviated model using only county-level data was estimated for each facility type as well. The results for the models in which adverse consequences were the dependent variables are shown in Tables 2 through 6.

The results of these models were not particularly satisfying. Relatively few variables were strongly correlated to adverse consequences, and the explanatory power of the models (as measured by the R2 statistic) was generally low. Although there were some statistically significant explanatory (independent) variables in the models for both predicted consequences and vacancy rates, the models explained only a relatively small percentage of the variation in the dependent variables. The explanatory power was even smaller when the facility-level variables (which would not be available outside of NC and ND without new data collection) were removed from the models, and only community variables were used.

The conclusion based on these models is that the variables collected by North Carolina were not adequate to accurately predict and explain either adverse consequences or vacancy rates.

Table 2. Coefficients for Full and Abbreviated OLS Regression Models to Predict Number of Adverse Effects of Nursing Shortages in Hospitals in NC

Explanatory (Independent) Variable
Unstandardized Coefficient
Full Model
Abbreviated Model
Standardized Coefficient
t
p Value
Unstandardized Coefficient
Standardized Coefficient
t
p Value
B
Std Err
B
Std Err
Constant
-0.683
3.02
-
-0.226
0.822
1.295
2.374
-
0.546
0.587
RNs per 100,000 Adjusted Need
-0.004
0.002
-0.353
-2.002
0.052
-0.001
0.001
-0.132
-0.880
0.382
RN Salary to Average Salary
0.518
0.707
0.132
0.732
0.468
0.281
0.582
0.081
0.482
0.631
# Nursing/Personal Care Facilities
0.032
0.015
0.663
2.176
0.035
0.023
0.012
0.494
1.905
0.061
% Population Below Poverty, 2000
0.078
0.065
0.308
1.202
0.236
0.033
0.053
0.136
0.622
0.536
RNs per Hospital Bed
0.265
0.445
0.082
0.596
0.555
0.044
0.402
0.013
0.108
0.914
Hours of Agency RNs
0.002
0.043
0.008
0.058
0.954
-
-
-
-
-
Hours of RN Overtime
-0.001
0.016
-0.007
-0.052
0.959
-
-
-
-
-
RN Vacancy Rate
0.032
0.032
0.142
0.985
0.330
-
-
-
-
-
RN Turnover Rate
0.011
0.021
0.077
0.505
0.616
-
-
-
-
-
Persons per Square Mile (natural log)
0.156
0.358
0.146
0.436
0.665
-0.158
0.271
-0.159
-0.582
0.563
# Short-term Community Hospitals, '01
-0.359
0.134
-0.610
-2.69
0.010
-0.227
0.109
-0.414
-2.076
0.041
RN Students per 100K Adjusted Need
-0.010
0.004
-0.392
-2.828
0.007
-0.005
0.003
-0.226
-1.967
0.053
% Population White Non-Hispanic, 2004
-0.011
0.012
-0.167
-0.902
0.372
-0.005
0.010
-0.086
-0.511
0.611

Full model R2 = 0.429 Abbreviated model R2 = 0.177

Table 3. Coefficients for Full OLS Regression Model to Predict RN Vacancy Rates in Nursing Homes in NC

Independent Variables
Unstandardized Coefficients
Standardized Coefficients
t
p Value
B
Std. Error
Beta
(Constant)
-15.65
18.185
-
-0.861
0.392
RNs per 100,000 Adjusted Need
0.032
0.022
0.234
1.444
0.152
RN Salary to Average Salary
13.83
6.945
0.316
1.992
0.049
# Nursing/Personal Care Facilities
-0.215
0.127
-0.320
-1.687
0.095
% Population Below Poverty, 2000
-0.939
0.460
-0.276
-2.039
0.044
RNs per Hospital Bed
-9.236
5.976
-0.161
-1.545
0.126
Hours of Agency RNs
-0.281
0.165
-0.182
-1.704
0.092
Hours of RN Overtime
0.138
0.114
0.116
1.214
0.228
RN Turnover Rate
0.027
0.026
0.117
1.063
0.291
Persons per Square Mile (natural log)
1.824
2.768
0.120
0.659
0.512
# Short-Term Community Hospitals, "01
0.840
1.257
0.104
0.669
0.506
LPN Vacancy Rate
0.356
0.083
0.401
4.287
0.000
LPNs per 100,000 Adjusted Need
-0.080
0.108
-0.090
-0.740
0.461
LPNs per RN
1.126
0.402
0.257
2.801
0.006
LPN Turnover Rate
0.050
0.040
0.128
1.274
0.206

R2 = 0.35

Table 4. Coefficients for Full OLS Regression Model to Predict Number of Adverse Effects of Nursing Shortages in Home Health Agencies in NC

Independent Variable
Unstandardized Coefficients
Standardized Coefficients
t
p Value
B
Std Err
Beta
(Constant)
2.270
2.216
-
1.024
0.310
RNs per 100,000 Adjusted Need
0.0022
0.002
0.214
1.412
0.163
RN salary to Average Salary
1.570
0.607
0.480
2.587
0.012
# Nursing/Personal Care Facilities
0.014
0.013
0.255
1.137
0.260
% Population Below Poverty, 2000
-0.118
0.052
-0.519
-2.266
0.027
RNs per Hospital Bed
-0.200
0.337
-0.062
-0.594
0.555
Hours of Agency RNs
0.046
0.022
0.232
2.069
0.043
Hours of RN overtime
-0.011
0.030
-0.041
-0.369
0.713
RN Vacancy Rate
0.024
0.008
0.374
3.078
0.003
RN Turnover Rate
0.0069
0.003
0.265
2.339
0.023
Persons per Square Mile (natural log)
-0.436
0.290
-0.392
-1.502
0.139
# Short-Term Community Hospitals, "01
-0.020
0.116
-0.027
-0.170
0.865
RN Students per 100K Adjusted Need
-0.00088
0.001
-0.202
-1.605
0.114
% Population White Non-Hispanic, 2004
-0.0136
0.010
-0.230
-1.340
0.185

R2 = 0.44

Table 5. Coefficients for Full OLS Regression Model to Predict Number of Adverse Effects of Nursing Shortages in Public Health Agencies in NC

Independent Variable
Unstandardized Coefficients
Standardized Coefficients
t
p Value
B
Std. Error
Beta
(Constant)
2.183
2.839
-
0.769
0.447
RNs per 100,000 Adjusted Need
-0.0013
0.002
-0.123
-0.639
0.527
RN Salary to Average Salary
0.408
0.864
0.088
0.473
0.639
# Nursing/Personal Care Facilities
0.017
0.034
0.118
0.517
0.608
% Population Below Poverty, 2000
-0.066
0.056
-0.276
-1.176
0.247
RNs per Hospital Bed
0.578
0.619
0.159
0.934
0.356
Hours of Agency RNs
0.0386
0.075
0.080
0.516
0.609
Hours of RN Overtime
0.0905
0.057
0.227
1.585
0.121
RN Vacancy Rate
0.0282
0.014
0.353
1.979
0.055
RN Turnover Rate
0.0041
0.007
0.088
0.555
0.582
Persons per Square mile (natural log)
0.190
0.353
0.162
0.537
0.594
# Short-Term Community Hospitals 2001
-0.352
0.287
-0.250
-1.228
0.227
RN Students per 100K Adjusted Need
-0.0015
0.001
-0.409
-2.321
0.026
% Population White Non-Hispanic, "04
-0.024
0.011
-0.404
-2.179
0.036

R2 = 0.34

Table 6. Coefficients for Abbreviated OLS Regression Model to Predict Number of Adverse Effects of Nursing Shortages in Public Health Agencies in NC

Independent Variable
Unstandardized Coefficients
Standardized Coefficients
t
p Value
B
Std. Error
Beta
(Constant)
3.607
2.172
-
1.661
0.102
RNs per 100,000 Adjusted Need
-0.00085
0.002
-0.074
-0.405
0.687
RN Salary to Average Salary
0.571
0.612
0.146
0.932
0.355
# Nursing/Personal Care Facilities 2000
0.037
0.030
0.400
1.236
0.221
Percent of Population Below Poverty, 2000
-0.086
0.051
-0.338
-1.684
0.098
Ratio of RNs to Beds
0.365
0.444
0.116
0.822
0.415
Ln Population Density
-0.084
0.262
-0.072
-0.321
0.750
# Short-Term Community Hospitals "01
-0.430
0.174
-0.441
-2.468
0.017
RN Students per 100,000 Adjusted Need
-0.00087
0.001
-0.203
-1.675
0.099
Number of Hospital Beds
0.00033
0.001
0.124
0.360
0.720
Percent White Non-Hispanic, 2004
-0.0246
0.010
-0.412
-2.525
0.014

R2 = 0.30

B. Ordered Probit Models

The next set of models estimated for North Carolina used the dependent variable of difficulty recruiting RNs. Although this variable was not available for RNs overall, facilities in North Carolina did rate RN recruiting difficulty on a scale of one to five for several types of RNs in several types of units (e.g., staff RNs in ICUs, nurse managers in ob/gyn floors, etc.). To translate this set of ratings into a single summary variable, a median value was calculated for all the positions that each facility had provided. Although few facilities had valid values for all of the different categories of hires because they had not recruited for particular positions in the past year, the median did provide an estimate of the overall difficulty.

A series of ordered probit models were estimated to predict and explain variations in this new median self-reported difficulty in recruiting RNs. Coefficients for the different explanatory and independent variables were estimated for the four facility types both separately and together (to predict recruiting difficulty relative to facilities of their own type and relative to all facilities). The combined model is shown in Table 7 below; the facility-specific models are available in the technical report Methods for Identifying Facilities and Communities with Shortages of Nurses.

These models showed promise in explaining difficulty recruiting RNs. Nonetheless, the models were dependent upon a number of facility-level variables, and it was not clear whether a subjective assessment of the difficult recruiting was an adequate basis for rating nursing shortages in facilities.

Table 7. Coefficient Estimates for the Ordered Probit Nursing Shortage Model Based on All Facilities in NC

Variable
Hospital
Home Health
Long-Term Care
Public Health
Coeff
p
Coeff
p
Coeff
p
Coeff
p
Demographic Variables
Dummy for metropolitan area
-0.343
0.323
 
 
-0.750
0.016
-0.474
0.289
Proportion of population < 5 years
 
 
 
 
-7.032
0.009
 
 
Proportion of population age 20 - 65 years
 
 
25.836
0.001
 
 
 
 
Proportion of population > 65 years
 
 
8.543
0.145
-20.231
0.001
27.654
0.001
Proportion of White population
 
 
 
 
 
 
-59.011
0.005
Proportion of Black population
 
 
2.270
0.121
 
 
-50.752
0.014
Proportion of Hispanic population
 
 
1.207
0.039
-1.844
0.000
-4.511
0.033
Proportion of AIAN population
1.202
0.150
 
 
0.586
0.020
 
 
Income per capital ($10,000)
0.692
0.099
 
 
-0.593
0.296
-2.144
0.066
Percentage of population in poverty
 
 
-0.232
0.004
-0.110
0.099
-0.262
0.014
Proportion of population using Medicare
 
 
 
 
1.5818
0.040
 
 
Proportion of population using Medicaid
 
 
 
 
 
 
2.177
0.052
Nursing Variables
# of RNs per 100 individuals
 
 
 
 
-1.103
0.009
 
 
# of Med Records & Health Info Techs per 1,000 individuals
 
 
 
 
1.942
0.008
 
 
# of hospitals per 10,000 individuals
 
 
2.242
0.039
 
 
-4.656
0.000
# of Hospices per 10,000 individuals
-1.035
0.454
0.696
0.450
 
 
2.457
0.048
Dummy for county having hospital with nursing school
-1.210
0.061
 
 
0.399
0.427
2.457
0.048
# of hospital full time personals per 10 individuals
1.176
0.469
 
 
-2.89
0.101
 
 
# of nursing home full time personals per 1,000 individuals
 
 
-0.550
0.038
 
 
 
 
Ratio of average RN salary to median income
 
 
2.530
0.010
-1.877
0.018
-4.023
0.004
Facility Variables
Facilty type
-5.384
0.078
-22.06
<0.0005
9.801
0.022
63.513
0.001
Total number of budgeted RN positions
-0.130
0.092
-1.946
0.121
1.834
0.438
-2.491
0.012
RN vacancy rate
1.936
0.046
50.736
>0.0005
35.816
0.010
 
 
Total number of budgeted LPN positions
-0.854
0.115
 
 
 
 
 
 
LPN vacancy rate
 
 
 
 
14.321
 
0.114
 
RN turnover rate
1.729
0.322
 
 
0.1987
0.291
6.396
0.005
Recruiting Difficulty Thresholds
Very easy (1) to recruit if score
< -5.494
Easy (2) to recruit if score
< -4.429
Not difficult (3) to recruit if score
< -3.348
Difficult (4) to recruit if score
< -2.159
Very difficult (5) to recruit if score
> -2.159

McKelvey-Zavoina R2 = 0.71

C. Validation of North Carolina Results

To address some of the questions regarding the adequacy of the “difficulty recruiting” variable, project staff attempted to validate the reported difficulty with a series of follow-up calls to those facilities that reported the most and least difficulty recruiting RNs. A list of the ID codes for the top- and bottom-ranked facilities was given to the North Carolina Center for Nursing, which provided contact information for those facilities without linking them to the identifiers in order to preserve the confidentiality of the data provided on the original survey. Consequently, the validation process was partially “blind,” with no one involved in the validation knowing whether the facility had original reported a very high or a very low level of recruiting difficulty. The interviewer asked for a retrospective evaluation of difficulty recruiting RNs in 2004 (the data year used in the analysis), and to control for the possibility that people would provide retrospective data based on the current situation, an assessment of the current difficulty recruiting RNs was also obtained. Results were then sent back to the North Carolina Center for Nursing, where names of facilities were stripped and original survey identifiers were reattached in order to compare original with retrospective responses.

The rank order correlation between the original data reported in 2004 and retrospective data obtained through the validation process was only 0.347, an indication that difficulty recruiting RNs was a less than ideal measure of shortage, even though the correlation was greater than would be expected by random chance (p = 0.016). Not only was the difficulty recruiting in 2004 from the interviews not highly correlated with current difficulty, but it also was not highly correlated with the original assessments made in 2004. Because this process provided only marginal support for the validity of this dependent variable, it was decided that subjective indicators of shortage were likely to be too highly influenced by personal judgments and biases of the person completing the survey (e.g., overall disposition, momentary mood) to justify using them as the basis for a shortage assessment and rating process.

D. Apply North Carolina Ordered Probit Coefficients in North Dakota

Another attempt to validate the models to use the North Carolina data on recruiting difficulty involved an attempt to apply these models to another state. The coefficients from the ordered probit models were applied to comparable data from North Dakota to compare predicted to actual reported recruiting difficulty. The coefficients from the North Carolina models proved to be poor predictors of reported difficulty in North Dakota.

This raised serious questions about the possibility of using coefficients from one state to predict or estimate the extent of shortages in another state. Although further investigation might reveal that coefficients from one state might be used in another state with similar demographic characteristics, interstate variations in health care and labor market environments seem to preclude nationwide use of a model constructed in only one state.

E. OLS Regressions for Vacancy Rates Using Combined Data from NC and ND

It was hypothesized that the relatively small sample size for models based solely on data from North Carolina might have contributed to the limited number of statistically significant coefficients, and that increasing the number of cases might yield better results. This hypothesis led to a final set of models in the study incorporating facility-level data by the study team and models based on a combined data set from both North Carolina and North Dakota. OLS regression models were estimated to predict vacancy rates at facilities in those two states combined.

The hypothesis, in fact, proved to be true. Models based on the combined dataset (shown below in Tables 8 to 11) highlighted a greater number of statistically significant explanatory variables for RN vacancy rates than models for either state alone. The overall explanatory power of these models remained only moderate, however, with much unexplained variation in vacancy rates. The long-term care model, in particular, had very limited explanatory power (R2 = 0.238; Table 10). Furthermore, these models continued to rely heavily on facility-specific data that would be difficult to obtain at the national level.

Table 8. OLS Coefficient Estimates for RN Vacancy Rates in Hospital Settings for Combined NC & ND Model

Independent Variable

Estimate

Std Err

t-stat

p-value

Intercept

-0.7335

0.3863

-1.899

0.061

Dummy for North Dakota

0.0155

0.0203

0.7619

0.448

Dummy for metropolitan area

0.0239

0.0194

1.2323

0.222

Income per capita ($10,000)

0.0327

0.0270

1.2096

0.230

Proportion of Hispanic population *10

-0.0226

0.0358

-0.629

0.531

Total Medicare inpatient days per Pop

-0.0728

0.0173

-4.219

0.0001

Proportion of population < 5 years *10

0.2115

0.1239

1.7076

0.092

Proportion of population >65 years

0.8423

0.3865

2.1792

0.032

Proportion of population age 20 - 65 years

0.5597

0.4809

1.1639

0.248

# Full time RNs per 100 individuals

0.0828

0.0392

2.1113

0.038

Ratio of average RN salary to median income

0.0739

0.0440

1.6775

0.098

Number of budgeted RN positions

-0.0021

0.0021

-1.008

0.317

RN turnover rate

0.2252

0.0655

3.4395

0.001

LPN vacancy rate

0.1661

0.0523

3.1785

0.002

LPN turnover rate

0.0048

0.0159

0.3003

0.765

R2 = 0.400

Table 9. Coefficient Estimates for RN Vacancy Rates in Home Health Setting for Combined NC & ND Model

Independent Variable

Estimate

Std Err

t-stat

p-value

Intercept

-0.5407

0.2488

-2.174

0.032

Dummy for North Dakota

-0.0811

0.0450

-1.801

0.075

Dummy for county w/ hosp w/ prof nursing schl

0.0945

0.0699

1.3522

0.180

Income per capita ($10,000)

0.0789

0.0329

2.3969

0.018

Proportion of Hispanic population x 10

-0.0966

0.0606

-1.593

0.114

# Hospitals per 10,000 individuals

-0.0240

0.0217

-1.105

0.272

# Med records & health info techs per 1,000 Pop

0.0607

0.0579

1.0491

0.297

Proportion of population < 5 years x 10

0.4565

0.2532

1.8031

0.074

Proportion of population >65 years

1.1646

0.5227

2.2279

0.028

Number of budgeted RN positions

-0.2623

0.1473

-1.781

0.078

RN turnover rate

0.1234

0.0360

3.4259

0.001

LPN vacancy rate

0.1937

0.0687

2.8196

0.006

Number of budgeted LPN positions

0.6455

0.5568

1.1593

0.249

R2 = 0.346

Table 10. Coefficient Estimates for Long-Term Care Setting for Combined NC & ND Model

Independent Variable

Estimate

Std Err

t-stat

p-value

Intercept

0.2447

0.2397

1.0207

0.309

Dummy for North Dakota

0.0392

0.0337

1.1655

0.246

Income per capita ($10,000)

-0.0324

0.0380

-0.8517

0.396

Proportion of Hispanic population *10

0.0567

0.0443

1.2798

0.203

Proportion of population < 5 years *10

-0.2825

0.1789

-1.5792

0.116

Proportion of population >65 years

-0.5939

0.3957

-1.5007

0.136

# Full time RNs per 100 individuals

0.0343

0.0459

0.7470

0.456

Ratio of average RN salary to median income

0.0389

0.0569

0.6834

0.495

Number of budgeted RN positions

-0.1725

0.1351

-1.2765

0.204

RN turnover rate

0.0223

0.0199

1.1191

0.265

LPN vacancy rate

0.3508

0.0754

4.6521

0.000

Number of budgeted LPN positions

0.4843

0.1735

2.7920

0.006

R2 = 0.238

Table 11. Coefficient Estimates for Public Health Setting for Combined NC & ND Model

Independent Variable

Estimate

Std Err

t-stat

p-value

Intercept

0.0755

0.1079

0.6999

0.486

Dummy for North Dakota

-0.0848

0.0288

-2.9455

0.004

Dummy for Cnty w/ Hosp w/ Prof Nursing Schl

0.0622

0.0708

0.8792

0.382

Proportion of AIAN population x 10

0.0652

0.0226

2.8856

0.005

Proportion of black population

0.1231

0.0937

1.3133

0.193

# Hospitals per 10,000 individuals

0.0466

0.0239

1.9535

0.054

# Hospices per 10,000 individuals

-0.0541

0.0293

-1.8447

0.069

Total Medicaid inpatient days per Pop

-0.0910

0.0378

-2.4066

0.018

Proportion of population < 5 years x 10

-0.2130

0.1284

-1.6591

0.101

Percentage of population in poverty

-0.0081

0.0041

-1.9653

0.053

Ratio of mean RN salary to median income

0.1357

0.0428

3.1690

0.002

RN turnover rate

0.0710

0.0421

1.6854

0.096

R2 = 0.389