Term
|
Definition
|
|
Term
|
Definition
|
|
Term
|
Definition
|
|
Term
|
Definition
|
|
Term
|
Definition
Value of Beta 1 tells you "if the value of x goes up by one unit, you expect y to go up by a certain (slope) amount" Not causation, correlation |
|
|
Term
|
Definition
|
|
Term
|
Definition
Beta not does not mean a whole lot oftentimes |
|
|
Term
|
Definition
|
|
Term
Partial regression coefficients |
|
Definition
They are the change in y expected when u change the x 1 unit, all others held constant Beta 1 hat, beta 2 hat... |
|
|
Term
|
Definition
y-y hat how much you missed it by Method of least squares adds them all up and creates one unique line |
|
|
Term
|
Definition
Residual distributed normal Independent (If you have cross sextional data) mean = 0, constant variance (in between those 2 bars) |
|
|
Term
|
Definition
x values are correlated can cause problems don't assume they are and then throw them out If the sign is backwards (-) means they may be tied together Will have to illiminate one and keep the others If you leave it, it inflates the standard error of the estimate |
|
|
Term
Standard error of the estimate |
|
Definition
how accurate your model is, st. deviation of residuals Want to be small |
|
|
Term
Heteroscedasticity (homoscedasticity) |
|
Definition
|
|
Term
|
Definition
building the best model you can with the fewest variables you can. You want a good model with 3 than a slightly better model with 5. You run a risk of the variables not being really good when you add ones that help slightly |
|
|
Term
|
Definition
coefficient of determination, tells you how much of the variablility of the y value has been explained by knowing x value. If it is .7, this means that 70% variability of the y values is explained by knowing the x, want to be closer to 1 since that means you are explaining more of the data |
|
|
Term
standard error of the estimates (standard deviation of residuals |
|
Definition
|
|
Term
|
Definition
|
|
Term
t-tests on each coefficient |
|
Definition
Null hypothesis: Beta=0 (reject if p-value less than alpha) |
|
|
Term
Confidence interval on Beta |
|
Definition
shows 95% confident between 2 points Predict only in range of X's in model (extrapolation) extrapolation = ou outside the value of possible x's |
|
|
Term
Prediction interval for one specific observation (not for the mean) |
|
Definition
will be wider interval when you do it on the mean, it will be smaller |
|
|
Term
|
Definition
What to see if its a straight line If not they are not correlated. If they follow a pattern and its not linear, they could still be x^2 or x^3 |
|
|
Term
Correlations of Y and X's |
|
Definition
want to be close to 1 or -1, sign means slope is up or down, also use to see if there is multicollinarity. Even if it is low by itself, it is possible that it may be correlated with the other residuals from the other variables. May be significant with more variables |
|
|
Term
|
Definition
want to check normality, as long as it is not bimodal that's good, you can also see outliers. Do not delete otliers. Would have to convince yourself that it is not part of the population you intend in you data |
|
|
Term
Plot of standardized residuals vs standardized Y's |
|
Definition
Want to see if the residuals are constant. Should be between the 2 horizontal lines. Build 2 models if need be |
|
|
Term
|
Definition
used for qualitative data. 1 less than the number of categories you have. If you have 5, ucan only use 4. Comparing what would happen if you were in the category with the ones in your base model. Some may fail out if they are not significant |
|
|
Term
|
Definition
say a non linear scatter plot, need x^2 or x^3 logs lg:its not x goes up some % and y goes up some % |
|
|
Term
|
Definition
taking varible in data (dependent) to see if sales this month is correlated to past months. Lag 1 is one period back. Lag 2 is 2 months back |
|
|
Term
Regression on time series variable |
|
Definition
Time series: data collected over a certain time frame. There is an order of what data came first. Intervals have to be the same siz. You are extraplating. Historical datat does not always mean future data. Therefore do not extrapolate too far |
|
|
Term
|
Definition
Long term, more than a year (t)=time variable=is used to find trent |
|
|
Term
|
Definition
short term! based on seasons within a given year. Is there differences in months? In quaters? Most companies have seasonality is some form or another. AC repair, retail. Use dummy variables to see which ones are significant |
|
|
Term
|
Definition
residuals are independent of each other. Used with lagged variables |
|
|
Term
|
Definition
smoothes the data based on past history. Stock market often uses this |
|
|