Clarification of Question by
keithp-ga
on
24 Aug 2004 13:38 PDT
--I have: observations of different sales transactions (first
dimension) over time (second dimension) with other dimensions - about
25 in all.
When you run a regression do you pool all data (i.e. aggregate all
individuals/firms) or do you run separate regressions for all
individuals/firms? In the former case you might not get consistent
estimates (i.e. your results may be wrong), unless some additional
assumptions hold (for instance, all individuals/firms must have
similar means). I am not sure, however, whether you can run panel data
estimation with Excel's Analyze-It...
----Currently, I run separate regressions for each month to create a
sales forecast formula. The mean has been nearly identical across all
samples. Here is my dilemma - hypothetically,
-I use data from July, where n=500, only and get .80 r-squared.
-If I add in June data then run the model with June and July (n=1000),
I get r-squared of .70
-If I repeat this process of adding in months of data, the sample size
increases, but the r-squared decreases
-Theoretically, if I use the past 12 months of data, adding a new
month and removing the oldest month, the regression model from month
to month results in fewer fluctuations for Y as a result
-This tells me that I would like to use older data to increase the
sample size, but weight the newest data more and the older data less
to reduce the fluctuations from month to month and create a more
consistent model. But how do I determine the weight for each month and
how do I incorporate that into the regression model?