The mlts
package allows fitting multilevel manifest or
latent time series models and dynamic structural equation models (DSEM),
as described in Asparouhov et al. (2018). It relies on Stan (Stan Development Team, 2023b) and
the rstan
package (Stan Development Team, 2023a) for
Bayesian inference. The package is designed for researchers working with
intensive longitudinal data, e.g., data from ambulatory assessment
studies or experience sampling methods. Models allow for missing data,
unequal numbers of observations across units, unequally spaced
observations, and approximation of continuous time processes. Parameters
can be subjected to restrictions (i.e., fixations or equality
constraints) and latent variables can be incorporated using a
measurement model. For the models hypothesized by the user, a LaTex
formula and path models can be inspected, and model parameters can be
plotted.
You can install the development version of mlts from GitHub with:
# install.packages("devtools")
::install_github("munchfab/mlts") devtools
# load package
library(mlts)
One of the simplest models we can fit with mlts
is a
multilevel first order autoregressive model with only one observed
variable. We start by specifying the model with
mlts_model()
. The argument q
controls the
number of time-series constructs. For this simple model, the following
call is sufficient:
# build a simple autoregressive model
<- mlts_model(q = 1) ar1_model
We can check the parameters present in the model by just calling the object:
ar1_model#> Model Level Type Param
#> 1 Structural Within Fixed effect mu_1
#> 2 Structural Within Fixed effect phi(1)_11
#> 3 Structural Within Fixed effect ln.sigma2_1
#> 4 Structural Between Random effect SD sigma_mu_1
#> 5 Structural Between Random effect SD sigma_phi(1)_11
#> 6 Structural Between Random effect SD sigma_ln.sigma2_1
#> 7 Structural Between RE correlation r_mu_1.phi(1)_11
#> 8 Structural Between RE correlation r_mu_1.ln.sigma2_1
#> 9 Structural Between RE correlation r_phi(1)_11.ln.sigma2_1
#> Param_Label isRandom prior_type prior_location prior_scale
#> 1 Trait 1 normal 0 10.0
#> 2 Dynamic 1 normal 0 2.0
#> 3 Log Innovation Variance 1 normal 0 10.0
#> 4 Trait 0 cauchy 0 2.5
#> 5 Dynamic 0 cauchy 0 2.5
#> 6 Log Innovation Variance 0 cauchy 0 2.5
#> 7 RE Cor 0 LKJ 1 NA
#> 8 RE Cor 0 LKJ 1 NA
#> 9 RE Cor 0 LKJ 1 NA
When mlts_model()
sets up this model, all model
parameters are free (i.e., not constrained) by default. On the within
level, there are three fixed effects: the grand mean (mu_1
)
of the outcome, the autoregressive effect (phi(1)_11
), and
the natural log of the innovation variance (ln.sigma2_1
).
The (1)
in brackets for the autoregressive effect parameter
indicates a lag of first order, and the _11
subscript
denotes that the first construct is as well predicted by the first
construct. Note that for the innovation variance, the natural log is
used to prevent the variance from dropping below zero. For each of these
effects, random effects are also estimated on the between level, which
are drawn from a multivariate normal distribution with zero mean.
Standard deviations of random effects are indicated with a
sigma_
prefix, and random effect correlations are indicated
with an r_
prefix.
A TeX formula for the above model can be obtained by calling the
mlts_model_formula()
function on the model object. By
default, the function produces an RMarkdown file and renders it to a pdf
file using knitr
. However, the TeX file can also be kept by
calling keep_tex = TRUE
within the function call.
mlts_model_formula(ar1_model)
\[ \begin{gathered} \begin{bmatrix} y_{1, it} \\ \end{bmatrix} = \begin{bmatrix} \mu_{1,i} \\ \end{bmatrix} + \begin{bmatrix} y_{1, it}^w \\ \end{bmatrix} \end{gathered} \]
\[ \begin{gathered} \begin{bmatrix} y_{1, it}^w \\ \end{bmatrix} = \begin{bmatrix} \phi_{(1)11,i} \\ \end{bmatrix} \begin{bmatrix} y_{1,i(t - 1)}^w \\ \end{bmatrix} + \begin{bmatrix} \zeta_{1, it} \\ \end{bmatrix} ,\\ \text{with}~\zeta_{1,it} \sim \mathit{N}(0, \sigma^2_{\zeta_{1},i}) \end{gathered} \]
\[ \begin{gathered} \begin{bmatrix} \mu_{1,i}\\ \phi_{(1)11,i}\\ \ln(\sigma^2_{\zeta_{1},i})\\ \end{bmatrix} = \begin{bmatrix} \gamma_{0,\mu_{1}}\\ \gamma_{0,\phi_{(1)11}}\\ \gamma_{0,\ln(\sigma^2_{\zeta_{1}})}\\ \end{bmatrix} + \begin{bmatrix} \upsilon_{\mu_{1},i}\\ \upsilon_{\phi_{(1)11},i}\\ \upsilon_{\ln(\sigma^2_{\zeta_{1}}),i}\\ \end{bmatrix} ,\\ \text{with}~ \upsilon_{i} \sim \mathit{MVN}(\mathbf{0}, \mathbf{\Omega}) \end{gathered} \]
Furthermore, a path model can also be produced with the function
mlts_model_paths()
. Again, the function produces an
RMarkdown and pdf file by default. A png file for each level can also be
produced by calling add_png = TRUE
and the TeX code for the
path model can be kept with keep_tex = TRUE
.
mlts_model_paths(ar1_model)
To fit the above model, we pass it together with the data set to
mlts_fit()
. The data set for this example is an artificial
data set simulated from an autoregressive model:
head(ar1_data)
#> ID time Y1
#> 1 1 1 1.16
#> 2 1 2 -0.29
#> 3 1 3 0.40
#> 4 1 4 -0.18
#> 5 1 5 -0.66
#> 6 1 6 0.42
We need to specify the variable in data
that contains
the time-series process in the ts
argument and the variable
that contains the unit identifier in the id
argument. With
the argument tinterval
, the time interval for approximation
of a continuous time process can be specified Asparouhov et al. (2018).
We don’t specify it here, but see the Vignette Approximation of a
Continuous Time Model for more details.
<- mlts_fit(
ar1_fit model = ar1_model,
data = ar1_data,
id = "ID",
ts = "Y1",
iter = 4000
)
The model summary()
shows general information about the
model and data:
summary(ar1_fit)
#> Call:
#> mlts_model(q = 1, max_lag = 1)
#> Time series variables as indicated by parameter subscripts:
#> 1 --> Y1
#> Data: 2500 observations in 50 IDs
#> Model convergence criteria:
#> Maximum Potential Scale Reduction Factor (PSR; Rhat): 1.009 (should be < 1.01)
#> Minimum Bulk ESS: 575 (should be > 200, 100 per chain)
#> Minimum Tail ESS: 566 (should be > 200, 100 per chain)
#> Number of divergent transitions: 0 (should be 0)
#>
#> Fixed Effects:
#> Post. Mean Post. Median Post. SD 2.5% 97.5% Rhat Bulk_ESS
#> mu_1 0.731 0.731 0.090 0.552 0.910 1 5357
#> phi(1)_11 0.275 0.276 0.029 0.218 0.331 1 2433
#> ln.sigma2_1 -0.304 -0.304 0.043 -0.392 -0.217 1 2964
#> Tail_ESS
#> 2695
#> 3051
#> 3013
#>
#> Random Effects SDs:
#> Post. Mean Post. Median Post. SD 2.5% 97.5% Rhat Bulk_ESS
#> mu_1 0.600 0.595 0.068 0.481 0.749 1.000 5042
#> phi(1)_11 0.134 0.134 0.032 0.069 0.198 1.009 582
#> ln.sigma2_1 0.231 0.229 0.045 0.147 0.323 1.002 1266
#> Tail_ESS
#> 3082
#> 566
#> 1376
#>
#> Random Effects Correlations:
#> Post. Mean Post. Median Post. SD 2.5% 97.5% Rhat
#> mu_1.phi(1)_11 0.017 0.018 0.213 -0.396 0.435 1.001
#> mu_1.ln.sigma2_1 0.217 0.224 0.187 -0.159 0.569 1.000
#> phi(1)_11.ln.sigma2_1 0.114 0.118 0.260 -0.391 0.607 1.000
#> Bulk_ESS Tail_ESS
#> 2116 2477
#> 2398 2726
#> 1056 1736
#>
#> Samples were drawn using NUTS on Thu Apr 25 12:39:28 2024.
#> For each parameter, Bulk_ESS and Tail_ESS are measures of effective
#> sample size, and Rhat is the potential scale reduction factor
#> on split chains (at convergence, Rhat = 1).
The line
Time series variables as indicated by parameter subscripts: 1 --> Y1
shows that model parameters indexed by a _1
refer to the
variable Y1
in the data set. The
Model convergence criteria
provide an overview across
convergence diagnostics for all model parameters (i.e., also parameters
which are not printed in the summary()
by default). For the
simple AR1-model, all parameters converged well after 4,000
iterations.
The section Fixed Effects
provides information about the
fixed effects in the model, i.e., \(\gamma_{0,
\mu_1}\), \(\gamma_{0,
\phi_{(1)11}}\), and \(\gamma_{0,\ln(\sigma^2_{\zeta_{1}})}\) in
the above formula. For example, the posterior mean of the autoregressive
effect parameter phi(1)_11
is estimated at .275 with
95%-credible interval [.218, .331]. The log variance of the innovations
\(\zeta_{1t}\) is estimated at -.304.
The estimate needs to be exponentiated to be on the original scale:
exp(-.304) = 0.738.
The section Random Effects SDs
shows standard deviations
of the random effects \(\upsilon_{\mu_{1},i}\), \(\upsilon_{\phi_{(1)11},i}\), and \(\upsilon_{\ln(\sigma^2_{\zeta_{1}}),i}\).
The section Random Effects Correlations
shows correlations
between random effects. For example, while random effects of the person
mean \(\upsilon_{\mu_{1},i}\) and the
autoregressive effect \(\upsilon_{\phi_{(1)11},i}\) display nearly
no correlation, 0.017, there is a positive correlation between the
person mean and log innovation variance, 0.217. This indicates that
individuals with a higher person mean in the variable Y1
also tend to have a higher innovation variance.
The parameter estimates can also be plotted with the
mtls_plot()
-function. By default, forest plots of model
parameters are produced. The what
-argument controls what
parameter types are plotted (e.g., fixed effects, random effect standard
deviations, and other).
mlts_plot(ar1_fit, type = "fe", what = "Fixed effect")