STAT 331 Project

download STAT 331 Project

of 22

Transcript of STAT 331 Project

  • 8/9/2019 STAT 331 Project

    1/22

    STAT 331 ProjectAuto sales vs Oil Prices

    Team Members

    Salman Asif, Jawed Karim, Umer Iqbal,

    Gautam Karwa, Nicky Chen, Chulmin Lee

  • 8/9/2019 STAT 331 Project

    2/22

    Introduction Purpose of project

    Analyze the relationship between autosales and oil prices, electronic goodssales and sports sales

    Regressors

    Electronic sales and sports goods sales

  • 8/9/2019 STAT 331 Project

    3/22

    Source of Data

    Our datasets are U.S. monthly auto sales for retail, U.S.monthly gasoline price, U.S. monthly electronics sales,and U.S. monthly sporting goods sales from Jan, 1992 to

    Aug, 2008. The gasoline price is U.S. city average.

    Source: U.S. Census, and U.S. Bureau of LaborStatistics

  • 8/9/2019 STAT 331 Project

    4/22

    Analysis Selection of model

    Based on R-squared

    Methods used in the analysis Estimation and Significance of Regressors

    Residual Diagnostics

    Prediction intervals

    Plots and visuals

  • 8/9/2019 STAT 331 Project

    5/22

    Proposed Model AUTO SALES = 0 + 1x OIL PRICE +

    2x ESALES + 3x SSALES + ei

    Where AUTO SALES = Monthly auto sales in US ($ Millions)

    OIL PRICES = Average monthly oil prices ($/gallon)

    ESALES = Electronic goods sales ($ Millions)

    SSALES = Sports goods sales ($ Millions)

  • 8/9/2019 STAT 331 Project

    6/22

    Diagnostics Homoscedasity

    Residual vs Fitted plot

  • 8/9/2019 STAT 331 Project

    7/22

    Diagnostics Homoscedasity

    Scatter plot of residuals

  • 8/9/2019 STAT 331 Project

    8/22

    Diagnostics Auto Correlation

    DW test

    Durbin-Watson testdata: AutoSales ~ Gasoline + Electronic + Sporting

    DW = 0.8647, p-value < 2.2e-16

    alternative hypothesis: true autocorrelation is greater than 0

    Significant p-value

  • 8/9/2019 STAT 331 Project

    9/22

    Diagnostics Auto Correlation

    The runs test

    Runs Test - Two sided

    data: l$residuals

    StandardizedRuns

    Statistic = -5.8129, p-value = 6.14e-09

    Significant p-value

  • 8/9/2019 STAT 331 Project

    10/22

    Diagnostics Auto Correlation

    ACF plot

    0 5 10 15 20

    0.0

    0.2

    0.4

    0.6

    0.8

    1.0

    Lag

    ACF

    ACF plot of residuals

  • 8/9/2019 STAT 331 Project

    11/22

    Diagnostics Normality

    Histogram of Residuals

  • 8/9/2019 STAT 331 Project

    12/22

    Diagnostics Normality

    QQ Plot

  • 8/9/2019 STAT 331 Project

    13/22

    Diagnostics Normality

    Shapiro-Wilk Test

    Shapiro-Wilk normality test

    data: l$residuals

    W = 0.9259, p-value = 1.608e-08

  • 8/9/2019 STAT 331 Project

    14/22

    Normalizing the Residuals Shapiro-Wilk Test

    Shapiro-Wilk normality testdata: L$residuals

    W = 0.9939, p-value = 0.6147

  • 8/9/2019 STAT 331 Project

    15/22

    Normalizing the Residuals Histogram of Residuals

    Histogram of Residuals

    Residuals

    Frequency

    -6000 -4000 -2000 0 2000 4000

    0

    10

    20

    30

    40

  • 8/9/2019 STAT 331 Project

    16/22

    Normalizing the Residuals QQ plot

    -3 -2 -1 0 1 2 3

    -6000

    -4000

    -2000

    0

    2000

    4000

    Normal Q-Q Plot

    Theoretical Quantiles

    SampleQuantiles

  • 8/9/2019 STAT 331 Project

    17/22

    Results of Diagnostics Homoscedasticity

    Residuals are not Homoscedastic

    Correlation Residuals are Correlated

    Normality

    Residuals are Normal

  • 8/9/2019 STAT 331 Project

    18/22

    Implications Observed that increase in electronic

    goods sales and/or sports sales

    causes an increase in auto sales Decrease in oil prices results in

    increase in Auto sales

    Most Important Regressor is Oil Prices

    Proposed model is useful forpredicting auto sales

  • 8/9/2019 STAT 331 Project

    19/22

    Pros and cons of Model Pros

    R-squared value is high

    Regressors are significant Model has good predictive value

    Cons

    Residuals are not homoscedastic and not

    uncorrelated

  • 8/9/2019 STAT 331 Project

    20/22

    Conclusions Oil Prices very good at predicting auto

    sales

    Positive relationship between autosales and electronic goods andsporting goods

    Proposed model fits the data very

    well Residuals are normal but not

    homoscedastic and uncorrelated

  • 8/9/2019 STAT 331 Project

    21/22

    Recommendations Increase our sample size

    Add more regressors

    Perform variance-stabilizingtransformations

  • 8/9/2019 STAT 331 Project

    22/22

    Questions?