返回列表 发帖

Detecting Multicollinearity - What's The R^2?

So a high R^2 is a sign of multicollinearity. What is the cutoff for this? Is 0.69 not multicollinearity while 0.70 is??

Paraguay Wrote:
-------------------------------------------------------
> If there are only 2 independent variables pairwise
> correlation is all you need.
>
> When adding variables past two you need to look
> for significant f, non-significant t-values.

+1

I answered C in the Mock and went back to cfai book and had to read that part how to detect multico. several times to find the little sentence about two independent variables and ok to use correl

TOP

i just looked over it, R^2 in a linear regression with one independent variable is the coefficient determination. Correlation is the Square root of that, in a single linear regression model.

TOP

I believe the pairwise correlation needs to be .7 and up to be considered high also

TOP

Book mentions pairwise with multiple variables, read fine print on only two independent .

TOP

Right answer is :

B. correlation between S&P500 and SPREAD is low.

The fine print gives the correlation between SPREAD and S&P to be 0.3, which supposedly is low.

TOP

Should be B. Def of multicollinearity is that variables are highly correlated.

R^2 that is high and an adjusted R^2 that is extremely low (very rare).

TOP

i looked it up, the correlation is .3
the r^2 is .40

the answer B

They don't mention r^2 at all in the explanation. I don't see a relation in the answer.

TOP

definitive answer from the book:

High R^2 and significant F-statistic even though the t--statistics on the slope coefficients are themselves not significant (indicating inflated std. errors for the slope coeffs).

Also they specifically also write - magnitude of pairwise correlations between independent variables has occasionally been suggested to assess multicoll. but is generally not adequate. It is not necessary that the pairwise correlations be high for there to be a multicoll. problem.

so based on the above
B) is eliminated.
C) F-value is high, but p-values are low - which means that the S&P500 and SPREAD are significant variables - so that too is out.

has to be A) by elimination.

CP

TOP

Try q. 42 in CFAI Mock 2010 Afternoon . You'll know what I mean.

Here 's the question:

Because there are only two independent variables in her regression, Hamilton’s most appropriate conclusion is that multicollinearity is least likely a problem, based on the observation that the:
A. model R2 is relatively low.
B. correlation between S&P500 and SPREAD is low.
C. model F-value is high and the p-values for S&P500 and SPREAD are low.

Guess the answer?

TOP

返回列表