Reading 11: Correlation and Regression-LOS f, (Part 2)习题精选

UID: 118990
帖子: 774
主题: 1
注册时间: 2008-12-31
最后登录: 2012-9-5

19^#

发表于 2010-4-20 15:18 | 只看该作者

thank you

maxsimax

UID: 107737
帖子: 1401
主题: 0
注册时间: 2008-9-6
最后登录: 2016-9-21

18^#

发表于 2010-4-14 14:44 | 只看该作者

thanks

UID: 137525
帖子: 5724
主题: 885
注册时间: 2009-7-1
最后登录: 2011-3-29

17^#

发表于 2010-4-8 12:11 | 只看该作者

What does the R² of a simple regression of two variables measure and what calculation is used to equate the correlation coefficient to the coefficient of determination?

R² measures:

Correlation coefficient

percent of variability of the independent variable that is explained by the variability of the dependent variable

R² = r²

percent of variability of the dependent variable that is explained by the variability of the independent variable

R² = r²

percent of variability of the independent variable that is explained by the variability of the dependent variable

R² = r × 2

R², or the Coefficient of Determination, is the square of the coefficient of correlation (r). The coefficient of correlation describes the strength of the relationship between the X and Y variables. The standard error of the residuals is the standard deviation of the dispersion about the regression line. The t-statistic measures the statistical significance of the coefficients of the regression equation. In the response: "percent of variability of the independent variable that is explained by the variability of the dependent variable," the definitions of the variables are reversed.

UID: 137525
帖子: 5724
主题: 885
注册时间: 2009-7-1
最后登录: 2011-3-29

16^#

发表于 2010-4-8 12:11 | 只看该作者

The R² of a simple regression of two factors, A and B, measures the:

A)	impact on B of a one-unit change in A.

B)	statistical significance of the coefficient in the regression equation.

C)	percent of variability of one factor explained by the variability of the second factor.

UID: 137525
帖子: 5724
主题: 885
注册时间: 2009-7-1
最后登录: 2011-3-29

15^#

发表于 2010-4-8 12:11 | 只看该作者

The R² of a simple regression of two factors, A and B, measures the:

A)	impact on B of a one-unit change in A.

B)	statistical significance of the coefficient in the regression equation.

C)	percent of variability of one factor explained by the variability of the second factor.

The coefficient of determination measures the percentage of variation in the dependent variable explained by the variation in the independent variable.

UID: 137525
帖子: 5724
主题: 885
注册时间: 2009-7-1
最后登录: 2011-3-29

14^#

发表于 2010-4-8 12:10 | 只看该作者

Craig Standish, CFA, is investigating the validity of claims associated with a fund that his company offers. The company advertises the fund as having low turnover and, hence, low management fees. The fund was created two years ago with only a few uncorrelated assets. Standish randomly draws two stocks from the fund, Grey Corporation and Jars Inc., and measures the variances and covariance of their monthly returns over the past two years. The resulting variance covariance matrix is shown below. Standish will test whether it is reasonable to believe that the returns of Grey and Jars are uncorrelated. In doing the analysis, he plans to address the issue of spurious correlation and outliers.

Grey

Jars

Grey

42.2

20.8

Jars

20.8

36.5

Standish wants to learn more about the performance of the fund. He performs a linear regression of the fund’s monthly returns over the past two years on a large capitalization index. The results are below:

ANOVA

df

SS

MS

F

Regression

1

92.53009

92.53009

28.09117

Residual

22

72.46625

3.293921

Total

23

164.9963

Coefficients

Standard Error

t-statistic

P-value

Intercept

0.148923

0.391669

0.380225

0.707424

Large Cap Index

1.205602

0.227467

5.30011

2.56E-05

Standish forecasts the fund’s return, based upon the prediction that the return to the large capitalization index used in the regression will be 10%. He also wants to quantify the degree of the prediction error, as well as the minimum and maximum sensitivity that the fund actually has with respect to the index.

He plans to summarize his results in a report. In the report, he will also include caveats concerning the limitations of regression analysis. He lists four limitations of regression analysis that he feels are important: relationships between variables can change over time, the decision to use a t-statistic or F-statistic for a forecast confidence interval is arbitrary, if the error terms are heteroskedastic the test statistics for the equation may not be reliable, and if the error terms are correlated with each other over time the test statistics may not be reliable.

Given the variance/covariance matrix for Grey and Jars, in a one-sided hypothesis test that the returns are positively correlated H₀: ρ = 0 vs. H₁: ρ > 0, Standish would:

A)	reject the null at the 1% level of significance.

B)	reject the null at the 5% but not the 1% level of significance.

C)	need to gather more information before being able to reach a conclusion concerning significance.

First, we must compute the correlation coefficient, which is 0.53 = 20.8 / (42.2 × 36.5)^0.5.

The t-statistic is: 2.93 = 0.53 × [(24 - 2) / (1 ? 0.53 × 0.53)]^0.5, and for df = 22 = 24 ? 2, the t-statistics for the 5 and 1% level are 1.717 and 2.508 respectively. (Study Session 3, LOS 11.g)

In performing the correlation test on Grey and Jars, Standish would most appropriately address the issue of:

A)	spurious correlation but not the issue of outliers.

B)	neither outliers nor correlation.

C)	spurious correlation and the issue of outliers.

Both these issues are important in performing correlation analysis. A single outlier observation can change the correlation coefficient from significant to not significant and even from negative (positive) to positive (negative). Even if the correlation coefficient is significant, the researcher would want to make sure there is a reason for a relationship and that the correlation is not caused by chance. (Study Session 3, LOS 11.b)

If the large capitalization index has a 10% return, then the forecast of the fund’s return will be:

13.5.

12.2.

16.1.

The forecast is 12.209 = 0.149 + 1.206 × 10, so the answer is 12.2. (Study Session 3, LOS 11.h)

The standard error of the estimate is:

9.62.

1.81.

0.56.

SEE equals the square root of the MSE, which on the ANOVA table is 72.466 / 22 = 3.294. The SEE is 1.81 = (3.294)(0.5). (Study Session 3, LOS 11.i)

A 95% confidence interval for the slope coefficient is:

A)	0.734 to 1.677.

B)	0.905 to 1.506.

C)	0.760 to 1.650.

The 95% confidence interval is 1.2056 ± (2.074 × 0.2275). (Study Session 3, LOS 11.f)

Of the four caveats of regression analysis listed by Standish, the least accurate is:

A)	if the error terms are heteroskedastic the test statistics for the equation may not be reliable.

B)	the relationships of variables change over time.

C)	the choice to use a t-statistic or F-statistic for a forecast confidence interval is arbitrary.

The t-statistic is used for constructing the confidence interval for the forecast. The F-statistic is not used for this purpose. The other possible shortfalls listed are valid. (Study Session 3, LOS 11.i)

UID: 137525
帖子: 5724
主题: 885
注册时间: 2009-7-1
最后登录: 2011-3-29

13^#

发表于 2010-4-8 12:10 | 只看该作者

What does the R² of a simple regression of two variables measure and what calculation is used to equate the correlation coefficient to the coefficient of determination?

R² measures:

Correlation coefficient

percent of variability of the independent variable that is explained by the variability of the dependent variable

R² = r²

percent of variability of the dependent variable that is explained by the variability of the independent variable

R² = r²

percent of variability of the independent variable that is explained by the variability of the dependent variable

UID: 137525
帖子: 5724
主题: 885
注册时间: 2009-7-1
最后登录: 2011-3-29

12^#

发表于 2010-4-8 12:02 | 只看该作者

Consider the following estimated regression equation:

AUTO_t = 0.89 + 1.32 PI_t

The standard error of the coefficient is 0.42 and the number of observations is 22. The 95% confidence interval for the slope coefficient, b₁, is:

A)	{-0.766 < b₁ < 3.406}.

B)	{0.444 < b₁ < 2.196}.

C)	{0.480 < b₁ < 2.160}.

UID: 137525
帖子: 5724
主题: 885
注册时间: 2009-7-1
最后登录: 2011-3-29

11^#

发表于 2010-4-8 12:02 | 只看该作者

Consider the following estimated regression equation:

AUTO_t = 0.89 + 1.32 PI_t

The standard error of the coefficient is 0.42 and the number of observations is 22. The 95% confidence interval for the slope coefficient, b₁, is:

A)	{-0.766 < b₁ < 3.406}.

B)	{0.444 < b₁ < 2.196}.

C)	{0.480 < b₁ < 2.160}.

The degrees of freedom are found by n-k-1 with k being the number of independent variables or 1 in this case. DF = 22-1-1 = 20. Looking up 20 degrees of freedom on the student's t distribution for a 95% confidence level and a 2 tailed test gives us a critical value of 2.086. The confidence interval is 1.32 ± 2.086 (0.42), or {0.444 < b₁ < 2.196}.