Temporal Matrix Factorization
[Question A] For any partially observed data matrix with the observed index set
, suppose
is the orthogonal projection supported on
, then the matrix factorization (i.e.,
are the rank-
latent factor matrices) involving temporal modeling on latent temporal factors (i.e.,
) can be formulated as follows,
where is the coefficient matrix. The symbol
denotes Kronecker product.
If
and
are known, then how to optimize the variables
? Do these variables have any closed-form solutions?
Note that
where is the
th column of
, while
is the
th column of
.
Is it possible to use alternating least squares (ALS) method to solve the optimization problem?
[Note] Let be the objective, the first-order partial derivatives are:
- with respect to
:
- with respect to
:
Let , then we have a vectorized closed-form solution to
as
where , and we have
In this case, we also define a block matrix
with the following sub-matrix
- with respect to
:
where .
Here, we have a sequence of matrices:
[Question B] How do the following two models perform?
vs.
Which is better?
References
[Question C] What is the difference between
and
[Note] For , the first-order partial derivative is given by