Multifactor analysis of the time series
Contents
Introduction
The birth of the multifactor analysis of the time series (singular-spectrum analysis [1]) is usually associated with the first papers of Broomhead and King [2,3]. At present the list of publications is more than hundred. Around sixty references can be found in the book [1] together with many examples of application of such approach to analysis of the time series and description of various theoretical and practical tasks.
We call the represented algorithm as multifactor analysis of the time series (MAS), since the algorithm has likeness to one of variants of the factor analysis, because in MAS the time series are represented as the sample values of a random vector (of given length M) with the subsequent application of singular value decomposition (SVD) to a sample correlation matrix of such vector and evaluation of the principal components. On the other hand, it is possible to interpret MAS as decomposition of the time series using the system of orthogonal basis functions, which are obtained on the basis of the original time series.
Such algorithm is ‘essentially a model-free technique; it is more an exploratory, model building tool than a confirmatory procedure. It aims at a decomposition of the original series into a sum of a small number of interpretable components such as a slow varying trend, oscillatory components and a 'structureless' noise’ [1]. Possible application of MAS is very wide and include climatology, meteorology, signal processing and physics. This approach can be applicable to other fields of sciences and human activity.
Thus, using MAS, the interpreter can extract slowly varying components, oscillating components, noise components, i.e. to implement the denoise procedure. The choice of number of basis functions for decomposition of initial time series formally is not given and is determined by the interpreter according to a type of problem.
Description of MAS with examples of processing of the time series can be downloaded as pdf file. Software modules written in MATLAB language can be downloaded as zip file.
[1] Golyandina, N., V. Nekrutkin and A. Zhigljavsky (2001) Analysis of Time Series Structure: SSA and Related Techniques, Chapman & Hall/CRC.
[2] Broomhead, D. S. and G. P. King (1986) Extracting qulitative dynamics from experimental data, Physica D, 20,217-236.
[3] Broomhead, D. S. and G. P. King (1986) On the qualitative analysis of experimental dynamical system, In S. Sarkar (Ed.), Nonlinear Phenomena and Chaose, pp. 113-144, Adam Higer, Bristol.
Contents
Analysis of the temperature time series
The multifactor analysis is implemented for processing of time series from 37 meteorological stations (monthly data). Slowly (or "like slowly") components are recovered with the use of decomposition on 132 principal components. The results of recovery are represented in graphic form, including 15 first principal components and diagrams for 5 first principal components, which demonstrate the structure of the time series.
Number |
Latitude.N |
Longitude |
Station |
Period |
|
65.37 |
322.21 |
ANGMAGSSALIK |
1895 |
2000 |
|
65.41 |
341.55 |
AKUREYRI |
1882 |
2000 |
|
70.59 |
351.20 |
JAN MAYEN |
1921 |
2000 |
|
60,24 |
5,19 |
BERGEN/FREDRIKSBERG |
1816 |
2000 |
|
67.16 |
14.22 |
BODO |
1868 |
2000 |
|
74.31 |
19.01 |
BJORNOVA |
1920 |
2000 |
|
65.50 |
24.09 |
HAPARANDA |
1860 |
1999 |
|
68.58 |
33.03 |
MURMANCK |
1918 |
1999 |
|
64.35 |
40.30 |
ARKHANGEL'SK |
1813 |
2000 |
|
68.39 |
43.18 |
KANIN NOS M. |
1915 |
2000 |
|
65.27 |
52.16 |
UST'-TSIL'MA |
1889 |
1999 |
|
72.23 |
52.44 |
MALYE KARAMAKULY |
1897 |
2000 |
|
66.32 |
66.32 |
SALEKHARD |
1883 |
2000 |
|
61.15 |
73.30 |
SURGUT |
1885 |
1996 |
|
73.30 |
80.14 |
DIKSON |
1916 |
2000 |
|
65.47 |
87.57 |
TURUKHANSK |
1881 |
2000 |
|
64.10 |
100.04 |
TURA |
1928 |
1999 |
|
71.59 |
102.27 |
KHATANGA |
1929 |
1999 |
|
63.46 |
121.37 |
VILYUISK |
1898 |
1999 |
|
70.41 |
127.24 |
KYUSYUR |
1918 |
1998 |
|
67.33 |
133.23 |
VERKHOYANSK |
1891 |
2000 |
|
73.11 |
143.56 |
SHALAUROVA |
1928 |
2000 |
|
66.10 |
169.50 |
UELEN |
1928 |
1999 |
|
64.68 |
170.02 |
MARKOVO |
1894 |
2000 |
|
64.47 |
177.34 |
ANADYR' |
1898 |
2000 |
|
70.97 |
181.28 |
VRANGELYA |
1926 |
1999 |
|
64.30 |
194.44 |
NOME |
1907 |
2000 |
|
60.47 |
198.12 |
BETHEL |
1923 |
2000 |
|
71.18 |
203.32 |
BARROW |
1882 |
2000 |
|
64.49 |
212.08 |
FAIRBANKS |
1904 |
2000 |
|
67.49 |
244.52 |
COPPERMINE |
1937 |
2000 |
|
60.00 |
248.02 |
FT SMITH |
1914 |
1998 |
|
63.20 |
269.17 |
CHESTERFIELD |
1921 |
1991 |
|
80.00 |
274.04 |
EUREKA |
1947 |
1999 |
|
70.27 |
291.23 |
CLYDE |
1942 |
1999 |
|
72,47 |
303,56 |
UPERNAVIK |
1873 |
2000 |
|
64,10 |
308,57 |
GOTHAAB |
1866 |
2000 |
Analysis of the rivers run-off
The multifactor analysis is implemented for processing of river run-off for 7 Siberia rivers(monthly and annual data). In the case of monthly data, slowly (or "like slowly") components are recovered with the use of decomposition on 132 principal components. The results of recovery are represented in graphic form, including 40 first principal components and diagrams for 5 first principal components, which demonstrate the structure of the time series. In the case of annual data for decomposition of the time series 11 principal components is used.
River name |
Period |
|
(monthly and annual data) |
1881 |
1988 |
(monthly data) |
1936 |
1992 |
(monthly data) |
1937 |
1993 |
(monthly data) |
1927 |
1989 |
(monthly data) |
1935 |
1993 |
(monthly data) |
1930 |
1993 |
(monthly data) |
1932 |
1987 |