Temporal Matrix Factorization
[Question A] For any partially observed data matrix with the observed index set , suppose is the orthogonal projection supported on , then the matrix factorization (i.e., are the rank- latent factor matrices) involving temporal modeling on latent temporal factors (i.e., ) can be formulated as follows,
where is the coefficient matrix. The symbol denotes Kronecker product.
If and are known, then how to optimize the variables ? Do these variables have any closed-form solutions?
Note that
where is the th column of , while is the th column of .
Is it possible to use alternating least squares (ALS) method to solve the optimization problem?
[Note] Let be the objective, the first-order partial derivatives are:
- with respect to :
- with respect to :
Let , then we have a vectorized closed-form solution to as
where , and we have
In this case, we also define a block matrix
with the following sub-matrix
- with respect to :
where .
Here, we have a sequence of matrices:
[Question B] How do the following two models perform?
vs.
Which is better?
References
[Question C] What is the difference between
and
[Note] For , the first-order partial derivative is given by