上一主题:Quantitative Analysis【Reading 13】Sample
下一主题:Quantitative Analysis 【Reading 11】Sample
返回列表 发帖
The management of a large restaurant chain believes that revenue growth is dependent upon the month of the year. Using a standard 12 month calendar, how many dummy variables must be used in a regression model that will test whether revenue growth differs by month?
A)
13.
B)
11.
C)
12.



The appropriate number of dummy variables is one less than the number of categories because the intercept captures the effect of the other effect. With 12 categories (months) the appropriate number of dummy variables is 11 = 12 – 1. If the number of dummy variables equals the number of categories, it is possible to state any one of the independent dummy variables in terms of the others. This is a violation of the assumption of the multiple linear regression model that none of the independent variables are linearly related

TOP

A fund has changed managers twice during the past 10 years. An analyst wishes to measure whether either of the changes in managers has had an impact on performance. The analyst wishes to simultaneously measure the impact of risk on the fund’s return. R is the return on the fund, and M is the return on a market index. Which of the following regression equations can appropriately measure the desired impacts?
A)
R = a + bM + c1D1 + c2D2 + c3D3 + ε, where D1 = 1 if the return is from the first manager, and D2 = 1 if the return is from the second manager, and D3 = 1 is the return is from the third manager.
B)
The desired impact cannot be measured.
C)
R = a + bM + c1D1 + c2D2 + ε, where D1 = 1 if the return is from the first manager, and D2 = 1 if the return is from the third manager.



The effect needs to be measured by two distinct dummy variables. The use of three variables will cause collinearity, and the use of one dummy variable will not appropriately specify the manager impact.

TOP

Jill Wentraub is an analyst with the retail industry. She is modeling a company’s sales over time and has noticed a quarterly seasonal pattern. If she includes dummy variables to represent the seasonality component of the sales she must use:
A)
three dummy variables.
B)
one dummy variables.
C)
four dummy variables.



Three. Always use one less dummy variable than the number of possibilities. For a seasonality that varies by quarters in the year, three dummy variables are needed.

TOP

Consider the following model of earnings (EPS) regressed against dummy variables for the quarters:

EPSt = α + β1Q1t + β2Q2t + β3Q3t
where:
EPSt is a quarterly observation of earnings per share
Q1t takes on a value of 1 if period t is the second quarter, 0 otherwise
Q2t takes on a value of 1 if period t is the third quarter, 0 otherwise
Q3t takes on a value of 1 if period t is the fourth quarter, 0 otherwise

Which of the following statements regarding this model is most accurate? The:
A)
EPS for the first quarter is represented by the residual.
B)
significance of the coefficients cannot be interpreted in the case of dummy variables.
C)
coefficient on each dummy tells us about the difference in earnings per share between the respective quarter and the one left out (first quarter in this case).





The coefficients on the dummy variables indicate the difference in EPS for a given quarter, relative to the first quarter.

TOP

An analyst wishes to test whether the stock returns of two portfolio managers provide different average returns. The analyst believes that the portfolio managers’ returns are related to other factors as well. Which of the following can provide a suitable test?
A)
Paired-comparisons.
B)
Difference of means.
C)
Dummy variable regression.



The difference of means and paired-comparisons tests will not account for the other factors

TOP

An analyst is trying to determine whether fund return performance is persistent. The analyst divides funds into three groups based on whether their return performance was in the top third (group 1), middle third (group 2), or bottom third (group 3) during the previous year. The manager then creates the following equation: R = a + b1D1 + b2D2 + b3D3 + ε, where R is return premium on the fund (the return minus the return on the S&P 500 benchmark) and Di is equal to 1 if the fund is in group i. Assuming no other information, this equation will suffer from:
A)
heteroskedasticity.
B)
serial correlation.
C)
multicollinearity.


When we use dummy variables, we have to use one less than the states of the world. In this case, there are three states (groups) possible. We should have used only two dummy variables. Multicollinearity is a problem in this case. Specifically, a linear combination of independent variables is perfectly correlated. X1 + X2 + X3 = 1.
There are too many dummy variables specified, so the equation will suffer from multicollinearity

TOP

Suppose the analyst wants to add a dummy variable for whether a person has an undergraduate college degree and a graduate degree. What is the CORRECT representation if a person has both degrees?
Undergraduate Degree
Dummy Variable
Graduate Degree
Dummy Variable
A)
11
B)
00
C)
01



Assigning a zero to both categories is appropriate for someone with neither degree. Assigning one to the undergraduate category and zero to the graduate category is appropriate for someone with only an undergraduate degree. Assigning zero to the undergraduate category and one to the graduate category is appropriate for someone with only a graduate degree. Assigning a one to both categories is correct since it reflects the possession of both degrees.

TOP

The amount of the State of Florida’s total revenue that is allocated to the education budget is believed to be dependent upon the total revenue for the year and the political party that controls the state legislature. Which of the following regression models is most appropriate for capturing the effect of the political party on the education budget? Assume Yt is the amount of the education budget for Florida in year t, X is Florida’s total revenue in year t, and Dt = {1 if the legislature has a Democratic majority in year t, 0 otherwise}.
A)
Yt = b1Dt + b2Xt + et.
B)
Yt = b0 + b1Dt + b2Xt + et.
C)
Yt = b0 + b1Dt + et.


In this application, b0, b1, and b2 are estimated by regressing Yt against a constant, Dt, and Xt.The estimated relationships for the two parties are:
Non-Democrats: Ŷ = b0 + b2Xt
Democrats: Ŷ = (b0 + b1) + b2Xt

TOP

Raul Gloucester, CFA, is analyzing the returns of a fund that his company offers. He tests the fund’s sensitivity to a small capitalization index and a large capitalization index, as well as to whether the January effect plays a role in the fund’s performance. He uses two years of monthly returns data, and runs a regression of the fund’s return on the indexes and a January-effect qualitative variable. The “January” variable is 1 for the month of January and zero for all other months. The results of the regression are shown in the tables below.
Regression Statistics

Multiple R

0.817088

R2

0.667632

Adjusted R2

0.617777

Standard Error

1.655891

Observations

24


ANOVA

df

SS

MS


Regression

3

110.1568

36.71895


Residual

20

54.8395

2.741975


Total

23

164.9963




Coefficients

Standard Error

t-Statistic


Intercept

-0.23821

0.388717

-0.61282


January

2.560552

1.232634

2.077301


Small Cap Index

0.231349

0.123007

1.880778


Large Cap Index

0.951515

0.254528

3.738359


Gloucester will perform an F-test for the equation. He also plans to test for serial correlation and conditional and unconditional heteroskedasticity.
Jason Brown, CFA, is interested in Gloucester’s results. He speculates that they are economically significant in that excess returns could be earned by shorting the large capitalization and the small capitalization indexes in the month of January and using the proceeds to buy the fund. The percent of the variation in the fund’s return that is explained by the regression is:
A)
66.76%.
B)
81.71%.
C)
61.78%.



The R2 tells us how much of the change in the dependent variable is explained by the changes in the independent variables in the regression: 0.667632.

In a two-tailed test at a five percent level of significance, the coefficients that are significant are:
A)
the large cap index only.
B)
the January effect and the small capitalization index only.
C)
the January effect and the large capitalization index only.



For a two-tailed test with 20 = 24 – 3 – 1 degrees of freedom and a five percent level of significance, the critical t-statistic is 2.086. Only the coefficient for the large capitalization index has a t-statistic larger than this.

Which of the following best summarizes the results of an F-test (5 percent significance) for the regression? The F-statistic is:
A)
13.39 and the critical value is 3.10.
B)
9.05 and the critical value is 3.86.
C)
13.39 and the critical value is 3.86.



The F-statistic is the ratio of the Mean Square of the Regression divided by the Mean Square Error (Residual): 13.39 = 36.718946 / 2.74197510. The F-statistic has 3 and 20 degrees of freedom, so the critical value, at a 5 percent level of significance = 3.10.

The best test for unconditional heteroskedasticity is:
A)
the Durbin-Watson test only.
B)
neither the Durbin-Watson test nor the Breusch-Pagan test.
C)
the Breusch-Pagan test only.



The Durbin-Watson test is for serial correlation. The Breusch-Pagan test is for conditional heteroskedasticity; it tests to see if the size of the independent variables influences the size of the residuals. Although tests for unconditional heteroskedasticity exist, they are not part of the CFA curriculum, and unconditional heteroskedasticity is generally considered less serious than conditional heteroskedasticity.

In the month of January, if both the small and large capitalization index have a zero return, we would expect the fund to have a return equal to:
A)
2.799.
B)
2.322.
C)
2.561.



The forecast of the return of the fund would be the intercept plus the coefficient on the January effect: 2.322 = -0.238214 + 2.560552.

Assuming (for this question only) that the F-test was significant but that the t-tests of the independent variables were insignificant, this would most likely suggest:
A)
serial correlation.
B)
conditional heteroskedasticity.
C)
multicollinearity.



When the F-test and the t-tests conflict, multicollinearity is indicated.

TOP

John Rains, CFA, is a professor of finance at a large university located in the Eastern United States. He is actively involved with his local chapter of the Society of Financial Analysts. Recently, he was asked to teach one session of a Society-sponsored CFA review course, specifically teaching the class addressing the topic of quantitative analysis. Based upon his familiarity with the CFA exam, he decides that the first part of the session should be a review of the basic elements of quantitative analysis, such as hypothesis testing, regression and multiple regression analysis. He would like to devote the second half of the review session to the practical application of the topics he covered in the first half.
Rains decides to construct a sample regression analysis case study for his students in order to demonstrate a “real-life” application of the concepts. He begins by compiling financial information on a fictitious company called Big Rig, Inc. According to the case study, Big Rig is the primary producer of the equipment used in the exploration for and drilling of new oil and gas wells in the United States. Rains has based the information in the problem on an actual equity holding in his personal portfolio, but has simplified the data for the purposes of the review course.
Rains constructs a basic regression model for Big Rig in order to estimate its profitability (in millions), using two independent variables: the number of new wells drilled in the U.S. (WLS) and the number of new competitors (COMP) entering the market:

Profits = b0 + b1WLS – b2COMP + ε

Based on the model, the estimated regression equation is:

Profits = 22.5 + 0.98(WLS) − 0.35(COMP)

Using the past 5 years of quarterly data, he calculated the following regression estimates for Big Rig, Inc:

Coefficient

Standard Error


Intercept

22.5

2.465


WLS

0.98

0.683


COMP

0.35

0.186

Using the information presented, the t-statistic for the number of new competitors (COMP) coefficient is:
A)
1.435.
B)
1.882.
C)
9.128.



To test whether a coefficient is statistically significant, the null hypothesis is that the slope coefficient is zero. The t-statistic for the COMP coefficient is calculated as follows:
(0.35 – 0.0) / 0.186 = 1.882
(Study Session 3, LOS 11.g)


Rains asks his students to test the null hypothesis that states for every new well drilled, profits will be increased by the given multiple of the coefficient, all other factors remaining constant. The appropriate hypotheses for this two-tailed test can best be stated as:
A)
H0: b1 ≤ 0.98 versus Ha: b1 > 0.98.
B)
H0: b1 = 0.35 versus Ha: b1 ≠ 0.35.
C)
H0: b1 = 0.98 versus Ha: b1 ≠ 0.98.



The coefficient given in the above table for the number of new wells drilled (WLS) is 0.98. The hypothesis should test to see whether the coefficient is indeed equal to 0.98 or is equal to some other value. Note that hypotheses with the “greater than” or “less than” symbol are used with one-tailed tests. (Study Session 3, LOS 11.g)

Continuing with the analysis of Big Rig, Rains asks his students to calculate the mean squared error(MSE). Assume that the sum of squared errors (SSE) for the regression model is 359.
A)
21.118.
B)
17.956.
C)
18.896.



The MSE is calculated as SSE / (n – k – 1). Recall that there are twenty observations and two independent variables. Therefore, the SEE in this instance = 359 / (20 – 2 − 1) = 21.118. (Study Session 3, LOS 11.i)

Rains now wants to test the students’ knowledge of the use of the F-test and the interpretation of the F-statistic. Which of the following statements regarding the F-test and the F-statistic is the most correct?
A)
The F-test is usually formulated as a two-tailed test.
B)
The F-statistic is used to test whether at least one independent variable in a set of independent variables explains a significant portion of the variation of the dependent variable.
C)
The F-statistic is almost always formulated to test each independent variable separately, in order to identify which variable is the most statistically significant.



An F-test assesses how well a set of independent variables, as a group, explains the variation in the dependent variable. It tests all independent variables as a group, and is always a one-tailed test. The decision rule is to reject the null hypothesis if the calculated F-value is greater than the critical F-value. (Study Session 3, LOS 11.i)

One of the main assumptions of a multiple regression model is that the variance of the residuals is constant across all observations in the sample. A violation of the assumption is known as:
A)
robust standard errors.
B)
heteroskedasticity.
C)
positive serial correlation.



Heteroskedasticity is present when the variance of the residuals is not the same across all observations in the sample, and there are sub-samples that are more spread out than the rest of the sample. (Study Session 3, LOS 12.i)

Rains reminds his students that a common condition that can distort the results of a regression analysis is referred to as serial correlation. The presence of serial correlation can be detected through the use of:
A)
the Breusch-Pagen test.
B)
the Hansen method.
C)
the Durbin-Watson statistic.



The Durbin-Watson test (DW ≈ 2(1 − r)) can detect serial correlation. Another commonly used method is to visually inspect a scatter plot of residuals over time. The Hansen method does not detect serial correlation, but can be used to remedy the situation. Note that the Breusch-Pagen test is used to detect heteroskedasticity. (Study Session 3, LOS 12.i)

TOP

返回列表
上一主题:Quantitative Analysis【Reading 13】Sample
下一主题:Quantitative Analysis 【Reading 11】Sample