Lagged dependent variables (LDVs) have been used in regression analysis to provide robust estimates of the effects of independent variables, but some research argues that using LDVs in regressions produces… Click to show full abstract
Lagged dependent variables (LDVs) have been used in regression analysis to provide robust estimates of the effects of independent variables, but some research argues that using LDVs in regressions produces negatively biased coefficient estimates, even if the LDV is part of the data-generating process. I demonstrate that these concerns are easily resolved by specifying a regression model that accounts for autocorrelation in the error term. This actually implies that more LDV and lagged independent variables should be included in the specification, not fewer. Including the additional lags yields more accurate parameter estimates, which I demonstrate using the same data-generating process scholars had previously used to argue against including LDVs. I use Monte Carlo simulations to show that this specification returns much more accurate coefficient estimates for independent variables (across a wide range of parameter values) than alternatives considered in earlier research. The simulation results also indicate that improper exclusion of LDVs can lead to severe bias in coefficient estimates. While no panacea, scholars should continue to confidently include LDVs as part of a robust estimation strategy.
               
Click one of the above tabs to view related content.