Abstract
The analysis of cross-correlations is extensively applied for the understanding of interconnections in stock markets and the portfolio risk estimation. Current studies of correlations in Chinese market mainly focus on the static correlations between return series, and this calls for an urgent need to investigate their dynamic correlations. Our study aims to reveal the dynamic evolution of cross-correlations in the Chinese stock market, and offer an exact interpretation for the evolution behavior. The correlation matrices constructed from the return series of 367 A-share stocks traded on the Shanghai Stock Exchange from January 4, 1999 to December 30, 2011 are calculated over a moving window with a size of 400 days. The evolutions of the statistical properties of the correlation coefficients, eigenvalues, and eigenvectors of the correlation matrices are carefully analyzed. We find that the stock correlations are significantly increased in the periods of two market crashes in 2001 and 2008, during which only five eigenvalues significantly deviate from the random correlation matrix, and the systemic risk is higher in these volatile periods than calm periods. By investigating the significant contributors of the deviating eigenvectors in different time periods, we observe a dynamic evolution behavior in business sectors such as IT, electronics, and real estate, which lead the rise (drop) before (after) the crashes. Our results provide new perspectives for the understanding of the dynamic evolution of cross-correlations in the Chines stock markets, and the result of risk estimation is valuable for the application of risk management.
Introduction
The stock market is a typical complex system with interactions between individuals, groups, and institutions at different levels. In financial crises, the risk can quickly propagate among these interconnected institutions which have mutual beneficial business. Therefore, the analysis of the correlations between shares issued by different institutions is of crucial importance for the understanding of interactive mechanism of the stock market and the portfolio risk estimation [1]–[3]. Variety of works have been done to reveal the information contained in the internal correlations among stocks, and the methods generally used in the research of stock cross-correlations include the random matrix theory (RMT) [4], [5], the principal component analysis (PCA) [6]–[8], and the hierarchical structure [9]–[17].
The random matrix theory (RMT), originally developed in complex quantum system, is applied to analyze the cross-correlations between stocks in the U.S. stock market by Plerou et al. [4]. The statistics of the most of the eigenvalues of the correlation matrix calculated from stock return series agree with the predictions of random matrix theory, but with deviations for a few of the largest eigenvalues. Extended work has been conducted to explain information contained in the deviating eigenvalues [18], which reveals that the largest eigenvalue corresponds to a market-wide influence to all stocks and the remaining deviating eigenvalues correspond to conventionally identified business sectors. Additional work has proved that even the eigenvalues within the spectrum of RMT carry some sort of correlations [19], [20]. Using the same RMT method, extensive works have been performed in the correlation analysis of various stock markets [21]–[30].
In recent years, there are increasing works concentrated on the variation of the cross-correlations between market equities over time [31]–[40]. Aste et al. have investigated the evolution of the correlation structure among 395 stocks quoted on the U.S. equity market from 1996 to 2009, in which the connected links among stocks are built by a topologically constrained graph approach [34]. They find that the stocks have increased correlations in the period of larger market instabilities. By using the similar filtered graph approach, the correlation structure among 57 different market indices all over the world has been studied [37]. Fenn et al. have used the RMT method to analyze the time evolutions of the correlations between the market equity indices of 28 geographical regions from 1999 to 2010 [38], and they also observe the increase of the correlations between several different markets after the credit crisis of 2007–2008. Similar results have also been observed in Refs. [31], [32], [35],[41],[42].
The RMT method has been applied to the analysis of the static correlations between the return series in the Chinese stock market [26]. No clear interactions between stocks in same business sectors are observed, while unusual sectors containing the ST (specially treated) and Blue-chip stocks are identified by a few of the largest eigenvalues. Further work has been done to analyze the anti-correlated sub-sectors that compose the unusual sectors [43]. Up to now, not much work has been conducted on the dynamics of stock correlations in the Chinese market to the best of our knowledge. Using the daily records of 259 stocks on the Chinese stock market from 1997 to 2007, the dynamic evolution of the Chinese stock network was firstly analyzed in [36]. In their work the links are constructed between the stocks which have correlations larger than a threshold, and a stable topological structure is revealed by using a dynamic threshold instead of the static threshold. Although additional efforts are made to identify the economic sectors based on the RMT method, the dynamic effects of conventional business sectors is extremely weak.
The principal component analysis (PCA) is another method commonly used to detect the correlations between stock returns. It is closely related to the RMT method, since it is also done through eigenvalue decomposition of the correlation (or covariance) matrix of the return series. This method uses an orthogonal transformation to convert a set of possible correlated returns into several uncorrelated components, which are ranked by their explanatory power for the total variance of the system. The studies of correlations among stock returns based on the PCA method are primarily concerned about the systemic risk measures [6]–[8].
In this paper, by mainly using the RMT method, we study dynamic evolution of the correlations between the 367 A-share stocks traded on Shanghai Stock Exchange from 1999 to 2011. The internal correlations between the stocks are investigated based on the correlation matrix of the return series of individual stocks in a moving window with a fixed length. We are mainly concerned about the statistical properties of the correlation coefficients, eigenvalues and eigenvectors of the correlation matrix, and their variations in different time periods. Our results confirm the strong collective behavior of the stock returns in the periods of market crashes, which is verified by the observations of the distribution of the correlation coefficients and the mean correlation coefficient. Further, based on the PCA method we calculate the proportion of total variance explain by the first components, through which the systemic risk of the Chinese stock market is estimated for different time periods. Another important purpose of our study is to extract the information contained in the eigenvectors deviating from RMT. We find the largest eigenvector quantifies a market-wide influence on all stocks, and this market mode remains stable over time. For the interpretation of other deviating eigenvectors, dynamic evolutions of several conventional industries including IT, electronics, machinery, petrochemicals, and real estate, are remarkably observed.
Materials and Methods
The database analyzed in our study contains the daily data of all A-Share stocks traded on Shanghai Stock Exchange (SHSE), one of the two stock exchanges in mainland China. The A-Share stocks are issued by mainland Chinese companies, and traded in Chinese Yuan. The data source is from Beijing Gildata RESSET Data Technology Co., Ltd, see http://www.resset.cn/. To better understand the correlation structures under different market conditions, we select the A-share stocks traded on Shanghai Stock Exchange from January 4, 1999 to December 30, 2011 covering the two big crashes in 2001 and 2008. To make sure that the stocks have enough number of trading days to be statistically significant in our studies, we select the stocks traded on the stock exchange for at least 2600 days, i.e., exclude those stocks suspended from the market for more than about two years. This filter yields the sample data including 367 A-Share stocks and 1114364 daily records in total.
Before we quantify the cross-correlations between stocks, we first calculate the return series for a given stock as
(1) |
where is the price for stock at time , and is in units of one day. The Pearson's correlation coefficient between two stock return series and is defined as
(2) |
where and are the standard deviations of two stock return series. It is a common measure of the dependence between the return series of the two stocks. There are sample stocks, therefore we have a correlation matrix with correlation coefficients as elements. The elements of the correlation matrix are restricted to the domain : for the stocks are correlated, for the stocks are anti-correlated, and for the stocks are uncorrelated.
The cross-correlation defined above is to calculate the dependence between the return series in the whole period of the sample data. We are more interested in the dynamic variation of the stock correlations evolved with time , so we look at the correlations calculated over a moving window. The size of the moving window is fixed to be 400 trading days, i.e., about two years, which is a little bit larger than the number of the sample stocks. Equation (2) is applied to calculate the correlation coefficients over a subset of return series within the moving window . For instance, the correlations in the first moving window are computed by the return series within , and for the following moving window. In consideration of our sample date, which is from 04/01/1999 to 30/12/2011, the starting date of the moving window covers the period from 04/01/1999 to 12/05/2010, and the ending date is from 06/09/2000 to 30/12/2011.
Results
Dynamics of correlation coefficients
We first analyze the distribution of the elements of the correlation matrix to capture the statistical properties of the correlation coefficients. In Figure 1, the probability density function (PDF) of the correlation coefficients evolved with time is shown. We observe that the center of the distribution clearly deviates from zero for the whole range of . The values of the coefficient , at which the peaks of are located, are significantly positive and vary over . The peaks of show two local maxima of as approaches 2003 and 2009, and appear at relatively small for other .
The Chinese stock market suffered a big crash after the release of the policy of state-held shares sale in listed companies in 2001, and the collapse of the internet bubble also took place in 2000–2001. In 2008, the global financial crisis burst out, and hit the stock markets around the world, certainly including the Chinese stock market. Considering that the length of the moving window is about two years, the correlations between the stock returns are significantly increased in the time windows 2001–2003 and 2008–2009. This indicates that stock price variations are more likely to be correlated around the market crashes.
To further verify the dependence of the stock correlations on the time , we compute the mean correlation coefficient in the moving window. Figure 2 plots as a function of the evolving time , and it strongly fluctuates during the whole range of . We simply look at the curve of , and pick out two local maxima on 02/04/2003 and 04/09/2009 and a local minimum on 25/12/2006 with eyes. There are also some relevant works concerned with extremum values for trend detection in stock price dynamics [44]–[46]. The moving windows corresponding to the two maxima are from 30/07/2001 to 02/04/2003 and from 17/01/2008 to 04/09/2009, and for the minimum is from 10/05/2005 to 25/12/2006. The date 30/07/2001 was close to the date 26/07/2001 on which the policy of state-held shares sale was formally implemented, and 17/01/2008 was near the date 21/01/2008 on which the Shanghai Stock Exchange Index dropped more than 5% followed by a decline over 7% the next day.
The volatility of the A-share Index of Shanghai Stock Exchange, quantified as the mean absolute returns within the moving window of 100 days length, is also illustrated in Figure 2. In the periods from 30/07/2001 to 02/04/2003 and from 17/01/2008 to 04/09/2009 the stock market was strongly fluctuating, while in the period from 10/05/2005 to 25/12/2006 the market was in a relatively calm state. In comparison with the variation of , one may conclude that stock correlations are more prominent in volatile periods, showing larger values of than those in calm periods.
Dynamics of eigenvalues and their explanations of system variance
We compute the eigenvalues of the correlation matrix with elements, and denote them as , , and . We investigate the probability density function (PDF) of the eigenvalues and its variation over time . In Figure 3, the PDF for evolved with is plotted. The peaks of show larger values for around 2003 and 2009 than those for other . The for large eigenvalues is plotted in Figure 4. The largest eigenvalue evolves with time , and shows larger values, i.e., , in the time windows 2001–2003 and 2008–2009. This phenomenon consists with the unveiling of two local maxima of in the moving windows from 30/07/2001 to 02/04/2003 and from 17/01/2008 to 04/09/2009.
In the observation of , we note that there exist large eigenvalues obviously large than the eigenvalues of the random correlation matrix. To compare the difference between the eigenvalues of the stock correlation matrix and those of the random correlation matrix, we show the analytical result of the random matrices following Ref.[47]. For the correlation matrix of random time series of length , the PDF of the eigenvalues in the limit and is given by
(3) |
where , and is within the bounds . and are the minimum and maximum eigenvalues of the random correlation matrix, which are given by
(4) |
In Figure 5, we plot of the random correlation matrix with finite and , the same as those of the stock return series. Within the bounds , of the correlation matrix constructed from the empirical return series in the first moving window (black solid line) is consistent with the analytical result of equation (3) (red dashed line). There also exist some deviations of large eigenvalues. In particular, the largest eigenvalue shown in the inset of Figure 5, which is about 31 times larger than .
We next identify the eigenvalues of the stock correlation matrix which deviate from those of the random correlation matrix, and investigate their variations over time . The analytical result of RMT is strictly valid for and . Instead, we compare of the stock correlation matrix with of the correlation matrix constructed from uncorrelated time series with length . The uncorrelated time series is generated by shuffling the empirical return series, in which the equal-time correlations between the original return series are destroyed. We compute the cross-correlations between these shuffled return series, and use this surrogate correlation matrix as a random correlation matrix. In Figure 6, black circled line denotes the 99th percentile of the eigenvalues calculated from the random correlation matrix. It stays relatively constant about 3 as the time evolves. This means that 99 percent of the eigenvalues of the random correlation matrix are less than this value.
If an eigenvalue of the empirical correlation matrix is larger than the 99th percentile of the eigenvalues generated from the shuffled return series, it is considered to be significantly larger than the eigenvalues of the random correlation matrix. In Figure 6, the number of the eigenvalues significantly larger than those of the random correlation matrix is plotted by the red square line. The number of empirical significantly larger than of random correlation matrix fluctuates over time . For around two date points 02/04/2003 and 04/09/2009, it shows a minimal value about 5, while for around 25/12/2006, it shows a maximal value about 16. This means that the number of significant eigenvalues in the volatile periods close to 30/07/2001-02/04/2003 and 17/01/2008-04/09/2009 is less than that in the calm period close to 10/05/2005-25/12/2006. To further illustrate the volatile and calm periods of the A-share market, we also plot the index composed of all A-share stocks in the figure. The crashes of 2001–2003 and 2008–2009 seem to start from middle 2001 and early 2008 respectively, and the following indices keep dropping for long periods of time. Between these two crashes, there exists a calm period from middle 2005 to late 2006, in which the A-share index shows a local minimum in middle 2005 and relatively small values till late 2006.
We give a cursory explanation for the above phenomenon. It can be easily proved that the sum of the eigenvalues of the stock correlation matrix is fixed to be the number of sample stocks, i.e, . As shown in the distribution of the eigenvalues, the major portion of eigenvalues are distributed in the region , and the large eigenvalues close to the market crashes of 2001–2003 and 2008–2009 are prominently larger than those during the calm period. Therefore, the number of eigenvalues in-between during crashes is less than calm periods. This may indicate that a few of the eigenvalues contain the information about the stock correlations when the market strongly fluctuates.
The commonality among the stock returns can also be detected by the PCA method, which has a close link to the RMT method. In fact, the systemic risk measured by the collective behavior of the stock price movements based on PCA has been analyzed in many studies [6]–[8]. The risk estimation is also valuable for the portfolio optimization, and some work has been done to analyze the risk-return relationship [48], [49]. The PCA method decomposes the returns of a sample of stocks into several orthogonal principal components. The principal components are uncorrelated, and satisfy the condition if , where is the -th eigenvalue of the correlation matrix of stock returns. The standardized return of stock , defined as , can be expressed as a linear combination of the principal components
(5) |
where is the total number of stocks analyzed, and is the component of -th eigenvector corresponding to stock , which is also known as the factor loading of for stock . The total variance of the return series is
(6) |
in which the total variance is decomposed into the orthogonal factor loadings and the eigenvalues . For the periods that stocks are highly correlated and connectively volatile, a small number of eigenvalues can explain most of the volatility in the system.
The cumulative risk fraction (CRF) is generally used to quantify the proportion of total variance explained by the first principal components [7], also known as absorption ratio in [8]. It is defined as
(7) |
where is the -th eigenvalue, . Since the PCA is done through the decomposition of the correlation (covariance) matrix of return (standardized return) series, the total variance of the system explained by all principal components is quantified as . The variance associated with the first principal components is quantified as . The CRF is the ratio of these two quantities.
In Figure 7, the CRFs for are shown as a function of the evolving time . The CRF displays two local maxima at nearby 02/04/2003 and 04/09/2009, at which it can explain more than 50%, 60%, and 80% of the total variance for respectively. This indicates that the stocks are highly correlated in the moving windows from 30/07/2001 to 02/04/2003 and from 17/01/2008 to 04/09/2009, in which the majority of the stock returns tend to move together. Thus the stock market is at a high level of systemic risk. We also observe that the CRF displays a relatively small value in the moving window from 10/05/2005 to 25/12/2006, in which the stocks are less correlated. These results are coincident with those observed in the mean correlation coefficient.
Evolution of eigenvectors and their interpretations
To analyze the information contained in the deviating eigenvectors, we first investigate the contributions of the eigenvector components grouped in conventional industries. According to the China Securities Regulatory Commission (CSRC) industry code, the stocks traded on Shanghai Stock Exchange are grouped into A-M conventional industries. Table 1 presents summary statistics of the 22 industries, including the industry codes, industry names, and the number of chosen stocks belonging to each industry. For each deviating eigenvector , with element as the component of the -th eigenvector corresponding to stock , we calculate the contribution of each industry group
(8) |
where is the number of stocks belonging to industry group , . The measure of is analogous to the analysis of wave function in disordered systems, and firstly introduced to financial data analysis in Ref. [50].
Table 1. Basic information about the A-M conventional industries.
Industry code | Industry | Number of stocks |
A | Agriculture | 4 |
B | Mining | 7 |
C0 | Food & beverage | 21 |
C1 | Textiles & apparel | 12 |
C2 | Timber & furnishings | 0 |
C3 | Paper & printing | 5 |
C4 | Petrochemicals | 31 |
C5 | Electronics | 10 |
C6 | Metals & non-metals | 26 |
C7 | Machinery | 52 |
C8 | Pharmaceuticals | 19 |
C99 | Other manufacturing | 2 |
D | Utilities | 19 |
E | Construction | 6 |
F | Transportation | 14 |
G | IT | 18 |
H | Wholesale & retail trade | 54 |
I | Finance & insurance | 2 |
J | Real estate | 43 |
K | Social services | 11 |
L | Communication & cultural industry | 4 |
M | Comprehensive | 7 |
The conventional industries are grouped based on the China Securities Regulatory Commission (CSRC) industry code. The basic information includes the industry code, full name of the industry, and the number of chosen stocks belonging to each industry.
Figure 8 shows for deviating eigenvectors evolved with time . The participants of the eigenvectors listed in the horizontal axis are 367 stocks. The stocks belonging to industry group are endowed with the same value of , and ranked by their capitalizations on the ending date of the sample data. We find that for the largest eigenvector universally shows large values among different industries, which means that almost all the industries have significant contributions to . It is quite robust for different . In Figure 9, for , , and show different patterns in the periods divided by the date points 02/04/2003, 25/12/2006, and 04/09/2009. In addition, before and after two date points 13/01/2009 and 11/05/2010, which are the ending dates of the moving windows started from 30/05/2007 and 16/09/2008 respectively, show remarkably different patterns. These discrete patterns can be easily observed for and . The Shanghai Stock Exchange fell 6.5% on 30/05/2007, which was caused by an increase in the stamp tax on stock transactions to 0.3% from 0.1%. The bankruptcy of Lehman Brothers on 14/09/2008 indicated that the financial crisis entered an acute phase, and the Chinese stock market started to be affected by the global financial crisis after that, showing a 4.5% fall on 16/09/2008. Therefore, we choose the ending dates of these two moving windows, i.e., 13/01/2009 and 11/05/2010, as additional dividing dates. The date points 02/04/2003, 25/12/2006, 13/01/2009, and 11/05/2010 are picked as coarse-grained dividing points.
We next analyze the contributions of industries in different time periods separated by the four dividing dates. As shown in Figure 9, and show large values of for the electronics and IT industries respectively in the first period from 06/09/2000 to 02/04/2003. In the following period from 02/04/2003 to 25/12/2006, mining, electronics, and real estate industries have significant contributions to , , and respectively. Real estate industry is a significant contributor of in the periods from 25/12/2006 to 13/01/2009, and of and from 13/01/2009 to 11/05/2010. In the last period from 11/05/2010 to 30/12/2011, both real estate and pharmaceuticals industries have significant contributions to , and mining industry is a significant contributor of . It is worth noting that of finance & insurance and other manufacturing industries display large values for -. We neglect their contributions to the deviating eigenvectors, since there are only small numbers of chosen stocks belonging to these two industries.
To further confirm the wide influence of the largest eigenvector observed in the contributions of industries, we also calculate the projection of the stock returns on the largest eigenvector
(9) |
where is the component of corresponding to stock , and is the number of sample stocks. In Figure 10, we plot against the return of the A-share Index of Shanghai Stock Exchange for the moving windows ended on 06/09/2000, 02/04/2003, 25/12/2006, 13/01/2009, 11/05/2010, and 30/12/2011. The A-share Index is composed of all A-share stocks traded on Shanghai Stock Exchange. The projection can be well fitted by a linear fit, which shows a narrow scatter around the fitted line in figure. The slope is about , with a slight quantitative difference for different moving windows. This value is a little bit larger than observed in [18]. The significant linear correlation between and indicates that the largest eigenvalue can be interpreted as quantifying market-wide influence on all stocks, and it remains quite robust to the variance of . In fact, all the components of are positive in our study, and similar results are revealed in [26]. The A-share Index is a capitalization-weighted average of the prices of all A-share stocks, and large components of are universally distributed among all stocks. Thus it would be no surprise to observe the significant correlation between and .
We have offered an overall observation of the contributions of industry groups. For the interpretation of -, we further analyze the component stocks which significantly contribute to each deviating eigenvector in different time periods divided by the date points 02/04/2003, 25/12/2006, 13/01/2009, and 11/05/2010. The minor adjustments of the dividing dates centered around them do not significantly change the results. Table 2–6 show the stocks and industry groups corresponding to the largest ten components of the deviating eigenvectors , , , and by the average ranks of the eigenvector components in different time periods. We rank the components according to their eigenvector component values, and average the ranks of the components over the moving windows with ending dates from 06/09/2000 to 02/04/2003, from 02/04/2003 to 25/12/2006, from 25/12/2006 to 13/01/2009, from 13/01/2009 to 11/05/2010, and from 11/05/2010 to 30/12/2011. The components with the smallest ten average ranks are picked as the largest ten components. The largest ten components correspond to ten stocks which significantly contribute to the relevant eigenvectors.
Table 2. Largest ten components of deviating eigenvectors in period from 06/09/2000 to 02/04/2003.
Stock code | Industry | Industry code | Stock code | Industry | Industry code |
IT | G | Pharmaceuticals | C8 | ||
Utilities | D | IT | G | ||
Comprehensive | M | Real estate | J | ||
IT | G | Wholesale & retail trade | H | ||
Real estate | J | Machinery | C7 | ||
Electronics | C5 | Machinery | C7 | ||
IT | G | Machinery | C7 | ||
IT | G | Petrochemicals | C4 | ||
Social services | K | Machinery | C7 | ||
IT | G | Petrochemicals | C4 | ||
Stock code | Industry | Industry code | Stock code | Industry | Industry code |
Petrochemicals | C4 | Real estate | J | ||
Food & beverage | C0 | Machinery | C7 | ||
Petrochemicals | C4 | Machinery | C7 | ||
Real estate | J | Real estate | J | ||
Pharmaceuticals | C8 | Real estate | J | ||
Other manufacturing | C99 | IT | G | ||
Utilities | D | Wholesale & retail trade | H | ||
Wholesale & retail trade | H | Metals & non-metals | C6 | ||
Pharmaceuticals | C8 | Machinery | C7 | ||
Machinery | C7 | Petrochemicals | C4 |
Largest ten components of , , , and by the average ranks of the eigenvector components taken over the moving windows with ending dates from 06/09/2000 to 02/04/2003. The eigenvectors are obtained from the correlation matrices of the return series in these moving windows. The stock codes corresponding to the largest ten components, the industries they belonging to, and the industry codes are listed.
Table 6. Largest ten components of deviating eigenvectors in period from 11/05/2010 to 30/12/2011.
Stock code | Industry | Industry code | Stock code | Industry | Industry code |
Real estate | J | Wholesale & retail trade | H | ||
Machinery | C7 | Textiles & apparel | C1 | ||
IT | J | Mining | B | ||
Real estate | J | Wholesale & retail trade | H | ||
Real estate | J | Metals & non-metals | C6 | ||
Pharmaceuticals | C8 | Transportation | F | ||
Real estate | J | Petrochemicals | C4 | ||
Real estate | J | Real estate | J | ||
Real estate | J | Real estate | J | ||
Metals & non-metals | C6 | Wholesale & retail trade | H | ||
Stock code | Industry | Industry code | Stock code | Industry | Industry code |
Textiles & apparel | C1 | IT | G | ||
Transportation | F | Food & beverage | C0 | ||
Agriculture | A | Machinery | C7 | ||
Metals & non-metals | C6 | Wholesale & retail trade | H | ||
Wholesale & retail trade | H | IT | G | ||
Metals & non-metals | C6 | Pharmaceuticals | C8 | ||
Mining | B | Wholesale & retail trade | H | ||
Machinery | C7 | Pharmaceuticals | C8 | ||
Mining | B | Pharmaceuticals | C8 | ||
Petrochemicals | C4 | Electronics | C5 |
Largest ten components of , , , and by the average ranks of the eigenvector components taken over the moving windows with ending dates from 11/05/2010 to 30/12/2011. The eigenvectors are obtained from the correlation matrices of the return series in these moving windows. The stock codes corresponding to the largest ten components, the industries they belonging to, and the industry codes are listed.
If one looks carefully at the stock codes of the largest ten components, dynamic evolutions of conventional stock industries are remarkably observed. The stocks belonging to the industries which have significant contributions to distinct eigenvectors also appear in their largest ten components. For the moving windows with ending dates in the period from 06/09/2000 to 02/04/2003, as shown in Table 2, among the largest ten components of five stocks belong to IT industry and one stock belongs to electronics industry, and for four stocks belong to machinery industry and two stocks belong to petrochemicals industry. In the following period from 02/04/2003 to 25/12/2006, as shown in Table 3, four IT stocks and two electronics stocks are in the list of the largest ten components of , and five machinery stocks and two petrochemicals stocks are in the list of . More interestingly, stocks 600198, 600100, and 600770, which are among the largest ten components of in the first time period, appear in the largest ten components of in the following period. The starting dates of the moving windows in the first period are from 04/01/1999 to 30/07/2001, and from 30/07/2001 to 10/05/2005 for the second period. The evolutions of the IT and electronic industries recall the history of the Chinese stock market in the period of 1999–2001. During that period of time, the Chinese stock market was in a bull market, and high-tech stocks issued by companies deal in IT and electronics were leading the rise. After 2001, the Chinese stock market started to decline, thus the IT and electronics stocks are contained in . Similar phenomenon is observed for the stocks in machinery and petrochemicals industries: stocks 600843, 600818, 600618, and 600841 among the largest ten components of in the first time period become the members of the largest ten components of in the following period.
Table 3. Largest ten components of deviating eigenvectors in period from 02/04/2003 to 25/12/2006.
Stock code | Industry | Industry code | Stock code | Industry | Industry code |
Mining | B | Electronics | C5 | ||
Transportation | F | Electronics | C5 | ||
Machinery | C7 | IT | G | ||
Utilities | D | Communication & cultural industry | L | ||
Food & beverage | C0 | Comprehensive | M | ||
Petrochemicals | C4 | IT | G | ||
Utilities | D | IT | G | ||
Utilities | D | Comprehensive | M | ||
Petrochemicals | C4 | IT | G | ||
Comprehensive | M | Metals & non-metals | C6 | ||
Stock code | Industry | Industry code | Stock code | Industry | Industry code |
Machinery | C7 | Wholesale & retail trade | H | ||
Machinery | C7 | Real estate | J | ||
Other manufacturing | C99 | Real estate | J | ||
Real estate | J | Real estate | J | ||
Petrochemicals | C4 | Pharmaceuticals | C8 | ||
Wholesale & retail trade | H | Real estate | J | ||
Machinery | C7 | Real estate | J | ||
Machinery | C7 | Other manufacturing | C99 | ||
Petrochemicals | C4 | Social services | K | ||
Machinery | C7 | Machinery | C7 |
Largest ten components of , , , and by the average ranks of the eigenvector components taken over the moving windows with ending dates from 02/04/2003 to 25/12/2006. The eigenvectors are obtained from the correlation matrices of the return series in these moving windows. The stock codes corresponding to the largest ten components, the industries they belonging to, and the industry codes are listed.
The dynamic evolution behavior is also observed in real estate industry. In the period from 02/04/2003 to 25/12/2006, five stocks belonging to real estate industry appear in the largest ten components of . The number of real estate stocks in the largest ten components of increases to seven in the period from 25/12/2006 to 13/01/2009, as shown in Table 4. In the following period from 13/01/2009 to 11/05/2010, five (seven) real estate stocks are in the largest ten components of (), as shown in Table 5. After September 2008, the Chinese stock market tended to be affected by the global financial crisis, and the stocks belonging to real estate industry were leading the drop. In Table 6, we observe that seven real estate stocks appear in the largest ten components of for the period from 11/05/2010 to 30/12/2011, in which the moving windows have starting dates from 16/09/2008 to 12/05/2010. In general, the real estate stocks contained in the largest five eigenvectors slowly move to be contained in the second largest eigenvector as the time approaches the global financial crisis. This conclusion is based upon the fact that many real estate stocks appear repeatedly in the largest ten components of the largest five eigenvectors in different periods. For instance, stock 600663 first appears in the largest ten components of in the period from 02/04/2003 to 25/12/2006, then it moves to be in those of in the following period from 25/12/2006 to 13/01/2009, and finally it becomes a member of those of in the latest period from 11/05/2010 to 30/12/2011.
Table 4. Largest ten components of deviating eigenvectors in period from 25/12/2006 to 13/01/2009.
Stock code | Industry | Industry code | Stock code | Industry | Industry code |
Metals & non-metals | C6 | Machinery | C7 | ||
Real estate | J | Transportation | F | ||
Real estate | J | Mining | B | ||
Food & beverage | C0 | Real estate | J | ||
Food & beverage | C0 | Textiles & apparel | C1 | ||
Petrochemicals | C4 | Food & beverage | C0 | ||
Machinery | C7 | Petrochemicals | C4 | ||
Machinery | C7 | Utilities | D | ||
Food & beverage | C0 | Pharmaceuticals | C8 | ||
Wholesale & retail trade | H | Metals & non-metals | C6 | ||
Stock code | Industry | Industry code | Stock code | Industry | Industry code |
Real estate | J | Metals & non-metals | C6 | ||
Real estate | J | Food & beverage | C0 | ||
Real estate | J | Petrochemicals | C4 | ||
Wholesale & retail trade | H | Construction | E | ||
Real estate | J | Real estate | J | ||
Wholesale & retail trade | H | Real estate | J | ||
Real estate | J | Food & beverage | C0 | ||
Real estate | J | Textiles & apparel | C1 | ||
Real estate | J | Textiles & apparel | C1 | ||
Wholesale & retail trade | H | Utilities | D |
Largest ten components of , , , and by the average ranks of the eigenvector components taken over the moving windows with ending dates from 25/12/2006 to 13/01/2009. The eigenvectors are obtained from the correlation matrices of the return series in these moving windows. The stock codes corresponding to the largest ten components, the industries they belonging to, and the industry codes are listed.
Table 5. Largest ten components of deviating eigenvectors in period from 13/01/2009 to 11/05/2010.
Stock code | Industry | Industry code | Stock code | Industry | Industry code |
IT | G | Wholesale & retail trade | H | ||
Real estate | J | Real estate | J | ||
IT | G | Real estate | J | ||
Transportation | F | Electronics | C5 | ||
Wholesale & retail trade | H | Real estate | J | ||
Real estate | J | Pharmaceuticals | C8 | ||
Machinery | C7 | Wholesale & retail trade | H | ||
Textiles & apparel | C1 | Real estate | J | ||
Comprehensive | M | Real estate | J | ||
Mining | B | Pharmaceuticals | C8 | ||
Stock code | Industry | Industry code | Stock code | Industry | Industry code |
Machinery | C7 | Metals & non-metals | C6 | ||
Real estate | J | Paper & printing | C3 | ||
Real estate | J | Wholesale & retail trade | H | ||
Real estate | J | Wholesale & retail trade | H | ||
Real estate | J | Wholesale & retail trade | H | ||
Real estate | J | Utilities | D | ||
Real estate | J | Food & beverage | C0 | ||
Finance & insurance | I | Food & beverage | C0 | ||
Real estate | J | Comprehensive | M | ||
Food & beverage | C0 | Transportation | F |
Largest ten components of , , , and by the average ranks of the eigenvector components taken over the moving windows with ending dates from 13/01/2009 to 11/05/2010. The eigenvectors are obtained from the correlation matrices of the return series in these moving windows. The stock codes corresponding to the largest ten components, the industries they belonging to, and the industry codes are listed.
In both analysis of the contributions of industry groups and component stocks, sectors like IT, electronics, machinery, petrochemicals, and real estate are significant contributors and their dynamic evolutions are clearly observed. Other sectors which have large industry contributions are also observed in the largest ten components of the same eigenvector in a particular time period. For instance, utilities and mining have large contributions to in the period from 02/04/2003 to 25/12/2006, metals & non-metals and mining have large contributions to in the period from 11/05/2010 to 30/12/2011, and pharmaceuticals has large contribution to in the period from 11/05/2010 to 30/12/2011.
Discussion and Conclusion
In summary, we have conducted a thorough study of the evolution of the cross-correlations between the return series of 367 A-share stocks on Shanghai Stock Exchange from 1999 to 2011. We find that the stock returns behave more collectively in volatile periods, showing biased distribution of correlation coefficients centered around lager positive coefficients and larger values of mean correlation coefficient as the time approaches the two big crashes in 2001 and 2008. In the same volatile periods, we find that the largest eigenvalue shows larger values, while the number of eigenvalues that significantly deviate from those of the random correlation matrix is less. In addition, only a small number of eigenvalues can explain the major portion of the total system variance when the market is volatile, which indicates a high level of systemic risk.
For the interpretation of deviating eigenvectors, we have further analyzed the eigenvector components and their contributions. By computing the contributions of the components grouped in conventional industries, we find significant contributors, such as mining, electronics, IT, and real estate, for distinct eigenvectors over different time . We also analyze the projection of the stock returns on the largest eigenvector, and confirm the market-wide influence of the largest eigenvector and its stability in time. In the analysis of the component stocks which significantly contribute to each eigenvector, dynamic evolution of conventional industries are observed, basically consistent with the results of industry contributions. The stocks in IT and electronics industries significantly contributing to the second largest eigenvector before the crash in 2001 become the significant contributors of the third largest eigenvector after the crash. Similarly, the stocks in real estate industry significantly contributing to other deviating eigenvectors before the crisis of 2008-2009 become the significant contributors of the second largest eigenvector during the crisis period.
We offer a new interpretation of the deviating eigenvectors of the correlation matrices in the Chinese stock market. It is revealed that the information contained in a particular eigenvector varies over time, which is different from results of fixed sectors and sub-sectors observed in the static correlation analysis. The dynamic evolution of significant eigenvector contributors reminds us of the sector rotation commonly observed in financial market. This work is valuable for the understanding of risk propagation among interconnected stocks and the classification of stock sectors, and can be further applied to portfolio risk estimation and systemic risk management.
Acknowledgments
The authors would like to thank M. Tumminello for helpful comments and suggestions.
Funding Statement
This work was partially supported by the National Natural Science Foundation (Nos. 10905023, 11075054 and 71131007), Humanities and Social Sciences Fund sponsored by Ministry of Education of the People's Republic of China (No. 09YJCZH042), the Shanghai (Follow-up) Rising Star Program Grant 11QH1400800, the Zhejiang Provincial Natural Science Foundation of China (Nos. Z6090130 and Y6110687), and the Fundamental Research Funds for the Central Universities. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Farmer JD (1999) Physicists attempt to scale the ivory towers of finance. Comput Sci Eng 1: 26–39. [Google Scholar]
- 2.Mantegna RN, Stanley HE (2000) An Introduction to Econophysics: Correlations and Complexity in Finance. Cambridge: Cambridge University Press.
- 3.Bouchaud JP, Potters M (2000) Theory of Financial Risks: From Statistical Physics to Risk Management. Cambridge: Cambridge University Press.
- 4. Plerou V, Gopikrishnan P, Rosenow B, Amaral LAN, Stanley HE (1999) Universal and nonuniversal properties of cross correlations in financial time series. Phys Rev Lett 83: 1471–1474. [Google Scholar]
- 5. Laloux L, Cizeau P, Bouchaud JP, Potters M (1999) Noise dressing of financial correlation matrices. Phys Rev Lett 83: 1467–1470. [Google Scholar]
- 6. Zheng ZY, Podobnik B, Feng L, Li BW (2012) Changes in cross-correlations as an indicator for systemic risk. Sci Rep 2: 888. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Billio M, Getmansky M, Lo AW, Pelizzon L (2012) Econometric measures of connectedness and systemic risk in the finance and insurance sectors. J Financial Econ 104: 535–559. [Google Scholar]
- 8. Kritzman M, Li YZ, Page S, Rigobon R (2011) Principal components as a measure of systemic risk. J Portf Manag 37: 112–126. [Google Scholar]
- 9. Mantegna RN (1999) Hierarchical structure in financial markets. Eur Phys J B 11: 193–197. [Google Scholar]
- 10. Bonanno G, Caldarelli G, Lillo F, Mantegna RN (2003) Topology of correlation-based minimal spanning trees in real and model markets. Phys Rev E 68: 046130. [DOI] [PubMed] [Google Scholar]
- 11. Bonanno G, Caldarelli G, Lillo F, Miccichè S, Vandewalle N, et al. (2004) Networks of equities in financial markets. Eur Phys J B 38: 363–371. [Google Scholar]
- 12. Tumminello M, Coronnello C, Lillo F, Miccichè S, Mantegna RN (2007) Spanning trees and bootstrap reliability estimation in correlation-based networks. Int J Bifurcat Chaos 17: 2319–2329. [Google Scholar]
- 13. Tumminello M, Di Matteo T, Aste T, Mantegna RN (2007) Correlation based networks of equity returns sampled at different time horizons. Eur Phys J B 55: 209–217. [Google Scholar]
- 14. Garas A, Argyrakis P (2009) Filtering of complex systems using overlapping tree networks. EPL (Europhys Lett) 86: 28005. [Google Scholar]
- 15. Cai SM, Zhou YB, Zhou T, Zhou PL (2010) Hierachical organization and disassortative mixing of correaltion-based weighted financial networks. Int J Mod Phys C 21: 433–441. [Google Scholar]
- 16. Tumminello M, Lillo F, Mantegna RN (2010) Correlation, hierarchies, and networks in financial markets. J Econ Behav Org 75: 40–58. [Google Scholar]
- 17. Kwapień J, Drożdż S (2012) Physical approach to complex systems. Phys Rep 515: 115–226. [Google Scholar]
- 18. Plerou V, Gopikrishnan P, Rosenow B, Amaral LAN, Guhr T, et al. (2002) Random matrix approach to cross correlations in financial data. Phys Rev E 65: 066126. [DOI] [PubMed] [Google Scholar]
- 19. Kwapień J, Drożdż S, Oświecimka P (2006) The bulk of the stock market correlation matrix is not pure noise. Physica A 359: 589–606. [Google Scholar]
- 20. Drożdż S, Kwapień J, Oświecimka P (2007) Empirics versus rmt in financial cross-correlations. Acta Phys Pol B 38: 4027–4039. [Google Scholar]
- 21. Utsugi A, Ino K, Oshikawa M (2004) Random matrix theory analysis of cross correlations in financial markets. Phys Rev E 70: 026110. [DOI] [PubMed] [Google Scholar]
- 22. Kim DH, Jeong H (2005) Systematic analysis of group identification in stock markets. Phys Rev E 72: 046133. [DOI] [PubMed] [Google Scholar]
- 23. Pan RK, Sinha S (2007) Collective behavior of stock price movements in an emerging market. Phys Rev E 76: 046116. [DOI] [PubMed] [Google Scholar]
- 24. Wilcox D, Gebbie T (2007) An analysis of cross-correlations in an emerging market. Physica A 375: 584–598. [Google Scholar]
- 25. Shapira Y, Kenett D, Ben-Jacob E (2009) The Index cohesive effect on stock market correlations. Eur Phys J B 72: 657–669. [Google Scholar]
- 26. Shen J, Zheng B (2009) Cross-correlation in financial dynamics. EPL (Europhys Lett) 86: 48005. [Google Scholar]
- 27. Ahn S, Choi J, Lim G, Cha KY, Kim S, et al. (2011) Identifying the structure of group correlation in the Korean financial market. Physica A 390: 1991–2001. [Google Scholar]
- 28. Oh G, Eom C, Wang F, Jung WS, Stanley HE, et al. (2011) Statistical properties of cross-correlation in the Korean stock market. Eur Phys J B 79: 55–60. [Google Scholar]
- 29. Wang D, Podobnik B, Horvatic D, Stanley HE (2011) Quantifying and modeling long-range cross correlations in multiple time series with applications to world stock indices. Phys Rev E 83: 046121. [DOI] [PubMed] [Google Scholar]
- 30. Kwon O, Oh G (2012) Asymmetric information flow between market index and individual stocks in several stock markets. EPL (Europhys Lett) 97: 28007. [Google Scholar]
- 31. Drożdż S, Grümmer F, Górski A, Ruf F, Speth J (2000) Dynamics of competition between collectivity and noise in the stock market. Physica A 287: 440–449. [Google Scholar]
- 32. Drożdż S, Grümmer F, Ruf F, Speth J (2001) Towards identifying the world stock market cross-correlations: Dax versus dow jones. Physica A 294: 226–234. [Google Scholar]
- 33. Onnela JP, Chakraborti A, Kaski K, Kertesz J, Kanto A (2003) Dynamics of market correlations: Taxonomy and portfolio analysis. Phys Rev E 68: 056110. [DOI] [PubMed] [Google Scholar]
- 34. Aste T, Shaw W, Di Matteo T (2010) Correlation structure and dynamics in volatile markets. New J Phys 12: 085009. [Google Scholar]
- 35. Podobnik B, Wang D, Horvatic D, Grosse I, Stanley HE (2010) Time-lag cross-correlations in collective phenomena. EPL (Europhys Lett) 90: 68001. [Google Scholar]
- 36. Qiu T, Zheng B, Chen G (2010) Financial networks with static and dynamic thresholds. New J Phys 12: 043057. [Google Scholar]
- 37. Song DM, Tumminello M, Zhou WX, Mantegna R (2011) Evolution of worldwide stock markets, correlation structure, and correlation based graphs. Phys Rev E 84: 026108. [DOI] [PubMed] [Google Scholar]
- 38. Fenn DJ, Porter MA, Williams S, McDonald M, Johnson NF, et al. (2011) Temporal evolution of financial-market correlations. Phys Rev E 84: 026109. [DOI] [PubMed] [Google Scholar]
- 39. Kumar S, Deo N (2012) Correlation and network analysis of global financial indices. Phys Rev E 86: 026101. [DOI] [PubMed] [Google Scholar]
- 40. Gao YC, Wei ZW, Wang BH (2013) Dynamic evolution of financial network and its realtion to economic crisis. Int J Mod Phys C 24: 1350005. [Google Scholar]
- 41. Kenett D, Tumminello M, Madi A, Gur-Gershgoren G, Mantegna R, et al. (2010) Dominating clasp of the financial sector revealed by partial correlation analysis of the stock market. PLoS One 5: e15032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Kenett DY, Raddant M, Lux T, Ben-Jacob E (2012) Evolvement of uniformity and volatility in the stressed global financial village. PLoS One 7: e31144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Jiang XF, Zheng B (2012) Anti-correlation and subsector structure in financial systems. EPL (Europhys Lett) 97: 48006. [Google Scholar]
- 44. Johansen A, Sornette D (2001) Large stock market price drawdowns are outliers. J Risk 4: 69–110. [Google Scholar]
- 45. Wei JR, Huang JP (2012) An exotic long-term pattern in stock price dynamics. PLoS One 7: e51666. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Liu L, Wei JR, Huang JP (2013) Scaling and volatility of breakouts and breakdowns in stock price dynamics. PLoS One 8: e82771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Sengupta AM, Mitra PP (1999) Distributions of sigular values for some random matrices. Phys Rev E 60: 3389–3392. [DOI] [PubMed] [Google Scholar]
- 48. Fama EF, MacBeth JD (1973) Risk, return, and equilibrium: empirical tests. J Polit Econ 81: 607–636. [Google Scholar]
- 49. Song KY, An KN, Yang G, Huang JP (2012) Risk-return relationship in a complex adaptive system. PLoS One 7: e33588. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Gopikrishnan P, Rosenow B, Plerou V, Stanley HE (2001) Quantifying and interpreting collective behavior in financial markets. Phys Rev E 64: 035106. [DOI] [PubMed] [Google Scholar]