




版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡介
1、Analysis of Cross Section and Panel Data,Yan Zhang School of Economics, Fudan University CCER, Fudan University,Introductory EconometricsA Modern Approach,Yan Zhang School of Economics, Fudan University CCER, Fudan University,Analysis of Cross Section and Panel Data,Part 1. Regression Analysis on Cr
2、oss Sectional Data,Chap 2. The Simple Regression ModelPractice for learning multiple Regression,Bivariate linear regression model :the slope parameter in the relationship between y and x holding the other factors in u fixed; it is of primary interest in applied economics. :the intercept parameter, a
3、lso has its uses, although it is rarely central to an analysis.,More Discussion,:A one-unit change in x has the same effect on y, regardless of the initial value of x. Increasing returns: wage-education (f. form) Can we draw ceteris paribus conclusions about how x affects y from a random sample of d
4、ata, when we are ignoring all the other factors? Only if we make an assumption restricting how the unobservable random variable u is related to the explanatory variable x,Classical Regression Assumptions,Feasible assumption if the intercept term is included Linearly uncorrelated zero conditional exp
5、ectation Meaning = 內(nèi)生性 PRF (Population Regression Function): sth. fixed but unknown,OLS,Minimize uu sample regression function (SRF) The point is always on the OLS regression line.,擬合值與殘差,PRF:,OLS,Coefficient of determination the fraction of the sample variation in y that is explained by x. the squa
6、re of the sample correlation coefficient between and Low R-squareds,Units of Measurement,If one of the dependent variables is multiplied by the constant cwhich means each value in the sample is multiplied by cthen the OLS intercept and slope estimates are also multiplied by c. If one of the independ
7、ent variables is divided or multiplied by some nonzero constant, c, then its OLS slope coefficient is also multiplied or divided by c respectively. The goodness-of-fit of the model, R-squareds, should not depend on the units of measurement of our variables.,Function Form,Linear Nonlinear Logarithmic
8、 dependent variable A Percentage change in y, semi-elasticity an increasing return to edu. Other nonlinearity: diploma effect Bi-Logarithmic A a Constant elasticity Change of units of measurement P45, error: b0*b0+log(c1)-b1log(c2),Bi-Logarithmic A a Constant elasticity Change of units of measuremen
9、t P45, error b0*b0+log(c1)-b1log(c2) Be proficient at interpreting the coef.,Unbiasedness of OLS Estimators,Statistical properties of OLS 從總體中隨機(jī)抽樣取出的不同樣本的OLS估計(jì) 的分布性質(zhì) Assumptions Linear in parameters (f. form; advanced methods) Random sampling (time series data; nonrandom sampling) Zero conditional m
10、ean (unbiased biased; spurious cor) Sample Variation in the independent variables (colinearity) Theorem (Unbiasedness) Under the four assumptions above, we have:,Variance of OLS Estimators,的隨機(jī)抽樣以 為中心,問題是 究竟距離 多遠(yuǎn)? Assumptions Homoskedasticity: Error variance A larger means that the distribution of th
11、e unobservables affecting y is more spread out. Theorem (Sampling variance of OLS estimators) Under the five assumptions above:,Variance of y given x,Conditional mean and variance of y: Heteroskedasticity,What does depend on?,More variation in the unobservables affecting y makes it more difficult to
12、 precisely estimate The more spread out is the sample of xi -s, the easier it is to find the relationship between E(y x) and x As the sample size increases, so does the total variation in the xi. Therefore, a larger sample size results in a smaller variance of the estimator,Estimating Error Variance
13、,Errors (Disturbances) and Residuals Errors: , population Residuals: , estimated f. Theorem (The unbiased estimator of ) Under the five assumptions above, we have: standard error of the regression (SER): Estimating the standard deviation in y after the effect of x has been taken out. Standard Error
14、of :,Regression through the Origin,Regression through the Origin: Pass through E.g. income tax revenue income The estimator of OLS: = only if 0 if the intercept 0, then is a biased estimator of,Chap 3. Multiple Regression Analysis:Estimation,Advantages of multiple regression analysis build better mo
15、dels for predicting the dependent variable. E.g. generalize functional form. Marginal propensity to consume Be more amenable to ceteris paribus analysis Chap 3.2 Key assumption: Implication: other factors affecting wage are not related on average to educ and exper. Multiple linear regression model:,
16、:the ceteris paribus effect of xj on y,Ordinary Least Square Estimator,SPF: OLS: Minimize F.O.C: ceteris paribus interpretations: Holding fixed, then Thus, we have controlled for the variables when estimating the effect of x1 on y.,Holding Other Factors Fixed,The power of multiple regression analysi
17、s is that it provides this ceteris paribus interpretation even though the data have not been collected in a ceteris paribus fashion. it allows us to do in non-experimental environments what natural scientists are able to do in a controlled laboratory setting: keep other factors fixed.,OLS and Ceteri
18、s Paribus Effects,Step of OLS: (1) :the OLS residuals from a multiple regression of x1 on (2) :the OLS estimator from a simple regression of y on measures the effect of x1 on y after x2, xk have been partialled or netted out. Two special cases in which the simple regression of y on x1 will produce t
19、he same OLS estimate on x1 as the regression of y on x1 and x2.,Goodness-of-fit,also equal the squared correlation coef. between the actual and the fitted values of y. R never decreases, and it usually increases when another independent variable is added to a regression. The factor that should deter
20、mine whether an explanatory variable belongs in a model is whether the explanatory variable has a nonzero partial effect on y in the population.,Regression through the origin,the properties of OLS derived earlier no longer hold for regression through the origin. the OLS residuals no longer have a ze
21、ro sample average. can actually be negative. to calculate it as the squared correlation coefficient if the intercept in the population model is different from zero, then the OLS estimators of the slope parameters will be biased.,The Expectation of OLS Estimator,Assumptions(簡單回歸模型假定的直接推廣;比較) Linear i
22、n parameters Random sampling Zero conditional mean No perfect co-linearity none of the independent variables is constant; and there are no exact linear relationships among the independent variables Theorem (Unbiasedness) Under the four assumptions above, we have:,rank (X)=K,Notice 1: Zero conditiona
23、l mean,Exogenous Endogenous Misspecification of function form (Chap 9) Omitting the quadratic term The level or log of variable Omitting important factors that correlated with any independent v. 如果被遺漏的變量與解釋變量相關(guān),則零條件方差不成立,回歸結(jié)果有偏 Measurement Error (Chap 15, IV) Simultaneously determining one or more x
24、-s with y (Chap 16, 聯(lián)立方程組),Omitted Variable Bias: The Simple Case,Problem:Excluding a relevant variable or Under-specifying the model(遺漏本來應(yīng)該包括在總體(真實(shí))模型中的變量) Omitted Variable Bias (misspecification analysis) The true population model: The underspecified OLS line: The expectation of : The Omitted vari
25、able bias:,前面3.2節(jié)中是x1對x2回歸,Omitted Variable Bias: Nonexistence,Two cases where is unbiased: The true population model: is the sample covariance between x1 and x2 over the sample variance of x1 If , then 的無偏性與x2無關(guān),估計(jì)時(shí)只需調(diào)整截距,將x2放入誤差項(xiàng)不影響零條件均值假定 Summary of Omitted Variable Bias: The expectation of : The
26、 Omitted variable bias:,The Size of Omitted Variable Bias,Direction Size A small bias of either sign need not be a cause for concern. Unknown Some idea we usually have a pretty good idea about the direction of the partial effect of x2 on y, that is, the sign of in many cases we can make an educated
27、guess about whether x1 and x2 are positively or negatively correlated. E.g. (Upward/downward Bias; biased toward zero),高估!,Omitted Variable Bias: More General Cases,Suppose: x2 and x3 are uncorrelated, but that x1 is correlated with x3. Both and will normally be biased. The only exception to this is
28、 when x1 and x2 are also uncorrelated. Difficult to obtain the direction of the bias in and Approximation: if x1 and x2 are also uncor.,Notice 2: No Perfect Collinearity,An assumption only about x-s, nothing about the relationship between u and x-s Assumption MLR.4 does allow the independent variabl
29、es to be correlated; they just cannot be perfectly correlated. Ceteris Paribus effect If we did not allow for any correlation among the independent variables, then multiple regression would not be very useful for econometric analysis. Significance,Cases of Perfect Collinearity,When can independent v
30、ariables be perfectly collinear software“singular” Nonlinear functions of the same variable is not an exact linear f. Not to include the same explanatory variable measured in different units in the same regression equation. More subtle ways one independent variable can be expressed as an exact linea
31、r function of some or all of the other independent variables. Drop it Key:,Notice 3: Unbiase,the meaning of unbiasedness: an estimate cannot be unbiased: an estimate is a fixed number, obtained from a particular sample, which usually is not equal to the population parameter. When we say that OLS is
32、unbiased under Assumptions MLR.1 through MLR.4, we mean that the procedure by which the OLS estimates are obtained is unbiased when we view the procedure as being applied across all possible random samples.,Notice 4: Over-Specification,Inclusion of an irrelevant variable or over-specifying the model
33、: does not affect the unbiasedness of the OLS estimators. including irrelevant variables can have undesirable effects on the variances of the OLS estimators.,Variance of The OLS Estimators,Adding Assumptions Homoskedasticity: Error variance A larger means that the distribution of the unobservables a
34、ffecting y is more spread out. Gauss-Markov assumptions (for cross-sectional regression): Assumption 1-5 Theorem (Sampling variance of OLS estimators) Under the five assumptions above:,More about,The stastical properties of y on x=(x1, x2, , xk) Error variance only one way to reduce the error varian
35、ce: to add more explanatory variablesnot always possible and desirable The total sample variations in xj: SSTj Increase the sample size,Multi-collinearity(多重共線性),The linear relationships among the independent v. 其他解釋變量對xj的擬合優(yōu)度(含截距項(xiàng)) If k=2: :the proportion of the total variation in xj that can be ex
36、plained by the other independent variables : : : High (but not perfect) correlation between two or more of the in dependent variables is called multicollinearity.,Micro-numerosity: problem of small sample size,High Low SSTj one thing is clear: everything else being equal, for estimating j, it is better to have less correlation between xj and the other x-s. How to “solve” the multicollinearity? Increase sample size Dropping som
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。
最新文檔
- 2025年點(diǎn)火系統(tǒng):點(diǎn)火開關(guān)項(xiàng)目合作計(jì)劃書
- 2025年紅外線汽車尾氣分析儀項(xiàng)目合作計(jì)劃書
- 教育技術(shù)與OER的深度融合研究
- 2025年年中國電子政務(wù)合作協(xié)議書
- 腦波技術(shù)在學(xué)習(xí)障礙診斷中的應(yīng)用前景
- 2025年廣東省深圳市龍文一對一物理高一下期末質(zhì)量檢測試題含解析
- 湖北省漢川二中2025年物理高二下期末檢測模擬試題含解析
- 商業(yè)洞察生物機(jī)制驅(qū)動的可持續(xù)發(fā)展戰(zhàn)略
- 技術(shù)引領(lǐng)未來中醫(yī)傳承在智能教育領(lǐng)域的發(fā)展
- 2025年湖南省東安一中高一物理第二學(xué)期期末質(zhì)量檢測模擬試題含解析
- 三年級上冊萬以內(nèi)的加減法計(jì)算題200道及答案
- 北京市《配電室安全管理規(guī)范》(DB11T 527-2021)地方標(biāo)準(zhǔn)
- 湖南省邵陽市海誼中學(xué)2024-2025學(xué)年高一上學(xué)期分班考試化學(xué)試題(原卷版)
- 三年級數(shù)學(xué)下冊計(jì)算題大全(每日一練共23份)
- 湖南省邵陽市海誼中學(xué)2024-2025學(xué)年高一上學(xué)期(開學(xué))分班考試物理試題(解析版)
- 交通信號統(tǒng)一管控優(yōu)化平臺
- 個人農(nóng)村承包土地合同范本
- 光伏工程合同
- 海姆立克評分標(biāo)準(zhǔn)
- 碳匯經(jīng)濟(jì)與政策智慧樹知到期末考試答案章節(jié)答案2024年浙江農(nóng)林大學(xué)
- 支付寶商戶經(jīng)營模式說明模版
評論
0/150
提交評論