Temporal Matrix Factorization

[Question A] For any partially observed data matrix with the observed index set , suppose is the orthogonal projection supported on , then the matrix factorization (i.e., are the rank- latent factor matrices) involving temporal modeling on latent temporal factors (i.e., ) can be formulated as follows,

where is the coefficient matrix. The symbol denotes Kronecker product.

If and are known, then how to optimize the variables ? Do these variables have any closed-form solutions?

Note that

where is the th column of , while is the th column of .

Is it possible to use alternating least squares (ALS) method to solve the optimization problem?


[Note] Let be the objective, the first-order partial derivatives are:

  • with respect to :

  • with respect to :

Let , then we have a vectorized closed-form solution to as

where , and we have

In this case, we also define a block matrix

with the following sub-matrix

  • with respect to :

where .

Here, we have a sequence of matrices:



[Question B] How do the following two models perform?

vs.

Which is better?


References



[Question C] What is the difference between

and


[Note] For , the first-order partial derivative is given by