Publication, Part of Dentists' Working Patterns, Motivation and Morale
Dentists' Working Patterns, Motivation and Morale - 2022/23
Annex E – Regression Analysis and Key Assumptions
Multiple Linear Regression Analysis
Linear regression analysis attempts to model the relationship between a dependent variable y (in the case of this report, the ‘motivation index’) and an independent (explanatory) variable denoted x (such as average weekly hours or NHS/HS share). Multiple linear regression considers more than one explanatory variable at a time (x1, x2, x3…) and quantifies the strength of relationships between them (recorded as parameter estimates) and the dependent variable y.
Unlike other results in the report, the multiple linear regression analysis (and logistic regression provided in the accompanying Multivariate Analysis csv file) is not weighted. Normally in regression analysis it is assumed that the standard deviation of the error term is constant over all values of the explanatory variables. Weighting allows certain observations to be assigned more weight in the regression analysis (thereby having more influence on the calculated parameter estimates) because it is believed that they are more accurate. Without weighting all the observations are treated equally, which is the preferred option for this report.
The ‘goodness of fit’ of a multiple linear regression model can be assessed by considering the adjusted R-squared value produced in the analytical output. The adjusted R-squared value measures the proportion of the variation in the dependent variable accounted for by the explanatory variables. The values of adjusted R-squared range from 0 to 1, with a value of 1 indicating that the regression line perfectly fits the data. The results in the report are generally lower than 0.2, which would traditionally be seen as low in many objective prediction models. However, in prediction models used in psychology, the R-squared values tend to be lower. This is because the goal is to determine which variables are statistically significant and how they relate to changes in the response variable, meaning that the adjusted R-squared value becomes less important.
Assumption Testing of Linear Regression Models[1]
There are four principal assumptions supporting the use of linear regression models for purposes of inference:
- Linearity and additivity of the relationship between the dependent and independent variables
- Statistical independence of errors
- Constant variance of the errors
- Normality of the error distribution
Violations of linearity or additivity are extremely serious: if a linear model is fitted to data which are non-linearly or non-additivity related, then the predictions are likely to be in serious error. In multiple regression models, non-linearity or non-additivity may be tested by systematic patterns in plots of the residuals versus individual explanatory variables. The points should be symmetrically distributed around the horizontal line with a roughly constant variance. In terms of statistical independence of errors, the residuals should be randomly and symmetrically distributed around zero under all conditions. To satisfy constant variance of errors, the residuals should not get systematically larger in any one direction of the explanatory variables by a significant amount.
Violations of normality create problems for determining whether model coefficients are significantly different from zero and for calculating confidence intervals for forecasts. Sometimes the error distribution is "skewed" by the presence of a few large outliers. Since parameter estimation is based on the minimization of squared error, a few extreme observations can exert a disproportionate influence on parameter estimates. Calculation of confidence intervals and various significance tests for coefficients are all based on the assumptions of normally distributed errors. If the error distribution is significantly non-normal, confidence intervals may be too wide or too narrow.
One of the best tests for normally distributed errors is a normal quantile plot of the residuals. This is a plot of the fractiles of error distribution versus the fractiles of a normal distribution having the same mean and variance. If the distribution is normal, the points on such a plot should fall close to the diagonal reference line.
[1] The majority of the material used in this section was taken from ‘Statistical forecasting: notes on regression and time series analysis’ website, Faqua School of Business, Duke University: http://people.duke.edu/~rnau/testing.htm
Detailed Regression Results and Assumption Testing
As discussed in Annex D, several analytical assumptions have been made to investigate motivation in dentists. Many of these assumptions relate to the calculation of the average ‘motivation index’ and its modelling by linear regression. This Annex gives the linear regression results for 2022/23 for England in more detail and tests the assumptions around the use of the multivariate analysis for the ‘motivation index’.
England
Table E1 shows multiple linear regression results by dental type for the average ‘motivation index’. This table also shows the parameter estimate results together with the p-value for each and the adjusted R-squared results. The intercept represents value of the ‘motivation index’ when all explanatory variables are zero.
Table E1: Parameter estimates and p-values for ‘motivation index’ by dental type using multiple linear regression, England, 2022/23
Dental Type | Sample | Parameter Estimate | Intercept | Weekly Hours | NHS% | Clinical% | Leave | Age | Adjusted R2 |
Providing-Performer | 481 | Estimate |
43.789 |
-0.126 | -0.244 | 0.049 | 1.825 | 0.277 | 0.190 |
P-value | <.0001 | 0.122 | <.0001 | 0.384 | <.0001 | 0.007 | |||
Associate | 1,203 | Estimate | 57.355 | 0.061 | -0.205 | -0.003 | 0.604 | 0.005 | 0.084 |
P-value | <.0001 | 0.334 | <.0001 | 0.943 | 0.039 | 0.925 |
The positive or negative values of the variable results indicate the relationship between each variable and the average ‘motivation index’. For both sets of dentists an increase in annual leave correlates with an increase in motivation, whereas increases in NHS share (%) have the opposite effect and lower motivation.
This type of analysis allows the individual relationships between each measured variable and the ‘motivation index’ to be assessed. For example, the statistical model predicts that if all other working patterns remained unchanged, but dentists switched from all private to entirely NHS work (from 0% to 100% NHS share) the ‘motivation index’ of Providing-Performers would decrease by 24.4 percentage points and by 20.5 for Associate dentists.
Whilst the weekly hours of work results are not significant for either Providing-Performer or Associate dentists, removing part-time dentists (<35 weekly hours) from the analysis changes the coefficient scores from -0.13 to -0.36 for Providing-Performers and from 0.06 to -0.61 for Associates with both results becoming significant as shown in table E2 below. This suggest that choosing to work part-time limits the negative relationship between motivation and weekly hours of work, which may be one reason that some dentists make this work-life choice.
Finally, it is important to note that whilst regression analysis provides evidence for the existence of relationships between variables, it does not provide measures of causality. In other words, although there may well be a relationship between, for example, weekly hours of work and motivation, it is not possible to determine if longer working hours demotivate staff or whether demotivated staff tend to work longer hours.
Based on the results it is possible to predict the ‘motivation index’ for both dental types. For example, for Providing-Performers this is given as:
43.789 - (0.126 x WH) - (0.244 x NHS%) + (0.049 x Clinical%) + (1.825 x Leave) - (0.277 x Age)
Residual plots for Providing-Performers are shown in Figure E1.
Figure E1: Providing-Performer dentists, plot of residuals for average ‘motivation index’ versus individual independent variables, England, 2022/23
Overall, the residual plots support the first three assumptions of multiple linear regression for average ‘motivation index.
Figure E2 shows the normal quantile plot of the residuals for Providing-Performers.
Figure E2: Providing-Performer dentists, normal quantile plot of residuals for average ‘motivation index’, England, 2022/23
The result is a linear pattern plot with no significant departure from normality.
Similar plots to the above are shown for Associate dentists in Figures E3 and E4.
Figure E3: Associate dentists, plot of residuals for average ‘motivation index’ versus individual independent variables, England, 2022/23
Figure E4: Associate dentists, normal quantile plot of residuals for average ‘motivation index’, England, 2022/23
The residual plots for Associate dentists appear to support the first three assumptions for multiple linear regression of the average ‘motivation index’ as does the almost linear quantile plot.
Different Dental Populations
Table E2 compares multivariate results for the average ‘motivation index’ for Providing-Performer and Associate dentists. The first rows ('All') repeat results from table E1 above, with subsequent rows showing results for different cohorts of dentists. This table also shows the parameter estimate results together with the p-value for each and the adjusted R-squared results.
Table E2: Parameter estimates and p-values for ‘motivation index’ by dental type and split populations using multiple linear regression, England, 2022/23
Dental Type | Population | Sample | Parameter Estimate | Intercept | Weekly Hours | NHS% | Clinical% | Leave | Age | Adjusted R2 |
Providing-Performer | All | 481 | Estimate | 43.789 | -0.126 | -0.244 | 0.049 | 1.825 | 0.277 | 0.190 |
P-value | <.0001 | 0.122 | <.0001 | 0.384 | <.0001 | 0.007 | ||||
<35 Hours | 120 | Estimate | 7.046 | 0.048 | -0.126 | 0.069 | 2.611 | 0.637 | 0.190 | |
P-value | 0.650 | 0.872 | 0.030 | 0.492 | 0.010 | 0.001 | ||||
≥35 Hours | 361 | Estimate | 68.353 | -0.359 | -0.283 | 0.005 | 1.504 | 0.156 | 0.210 | |
P-value | <.0001 | 0.005 | <.0001 | 0.945 | 0.011 | 0.197 | ||||
Mainly NHS | 229 | Estimate | 48.611 | -0.502 | 0.068 | -0.121 | 0.990 | 0.285 | 0.097 | |
P-value | 0.014 | <.0001 | 0.697 | 0.166 | 0.169 | 0.056 | ||||
Mainly Private | 127 | Estimate | 29.391 | 0.314 | -0.639 | 0.094 | 1.082 | 0.348 | 0.077 | |
P-value | 0.066 | 0.043 | 0.007 | 0.360 | 0.248 | 0.059 | ||||
Male | 344 | Estimate | 42.933 | -0.068 | -0.295 | 0.096 | 1.347 | 0.249 | 0.206 | |
P-value | <.0001 | 0.513 | <.0001 | 0.202 | 0.028 | 0.042 | ||||
Female | 137 | Estimate | 19.412 | -0.117 | -0.124 | 0.035 | 3.237 | 0.556 | 0.192 | |
P-value | 0.208 | 0.363 | 0.016 | 0.660 | <.0001 | 0.005 | ||||
Associate | All | 1,203 | Estimate | 57.355 | 0.061 | -0.205 | -0.003 | 0.604 | 0.005 | 0.084 |
P-value | <.0001 | 0.334 | <.0001 | 0.943 | 0.039 | 0.925 | ||||
<35 Hours | 573 | Estimate | 54.133 | 0.099 | -0.206 | -0.004 | 1.015 | -0.015 | 0.097 | |
P-value | <.0001 | 0.446 | <.0001 | 0.951 | 0.011 | 0.854 | ||||
≥35 Hours | 630 | Estimate | 87.378 | -0.613 | -0.204 | -0.022 | 0.126 | 0.054 | 0.097 | |
P-value | <.0001 | 0.000 | <.0001 | 0.715 | 0.769 | 0.486 | ||||
Mainly NHS | 760 | Estimate | 44.297 | 0.020 | -0.041 | -0.001 | 0.851 | -0.022 | 0.001 | |
P-value | <.0001 | 0.811 | 0.668 | 0.982 | 0.023 | 0.758 | ||||
Mainly Private | 174 | Estimate | 69.034 | 0.074 | -0.856 | 0.127 | -0.360 | -0.177 | 0.091 | |
P-value | <.0001 | 0.600 | <.0001 | 0.316 | 0.544 | 0.267 | ||||
Male | 550 | Estimate | 54.829 | 0.115 | -0.248 | -0.009 | 0.279 | 0.099 | 0.133 | |
P-value | <.0001 | 0.235 | <.0001 | 0.901 | 0.513 | 0.228 | ||||
Female | 653 | Estimate | 57.180 | 0.041 | -0.167 | 0.004 | 0.741 | -0.056 | 0.041 | |
P-value | <.0001 | 0.643 | <.0001 | 0.941 | 0.071 | 0.495 |
There are some interesting results when dentists are split into different cohorts. For example, both Providing-Performer and Associate dentists who work full-time (≥35 hours) exhibit a statistically significant negative relationship between ‘motivation index’ and weekly hours of work, whilst their part-time colleagues (<35 hours) do not[2].
It is also noticeable, that unlike any other variable, NHS share (%) exhibits a statistically significant and negative relationship with ‘motivation index’ for the majority of cohorts of dentists.
[2] More detailed working pattern results are presented in a separate interactive report which accompanies this document and can be found at: https://digital.nhs.uk/data-and-information/publications/statistical/dental-working-hours/2022-23-working-patterns-motivation-and-morale
Last edited: 25 April 2024 9:23 am