Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

agg(did_res,:rel) BoundsError #29

Open
eohne opened this issue Jul 24, 2024 · 3 comments
Open

agg(did_res,:rel) BoundsError #29

eohne opened this issue Jul 24, 2024 · 3 comments

Comments

@eohne
Copy link

eohne commented Jul 24, 2024

Hi cool package!
I just ran into an issue when trying to use the agg() function. The result of the @did call (using dynamic treatment) has 6965 treatweights and treatcounts but only 6942 coefficients. Not sure how that happened but it leads to a BoundsError when trying to aggregate.
Any suggestions?

Thanks in advance

Edit: I set the cohort variable for the never treated group to the last t in my sample (same as in your example) and the data contains no missing values when passed to the @did function.

@junyuan-chen
Copy link
Member

Somehow there is some collinearity going on with the dummy variables and some of them are dropped when doing the OLS. Did you include additional regressors?

@eohne
Copy link
Author

eohne commented Jul 24, 2024

Thanks for the rapid reply.

I tried both with and without including additional regressors. If I include additional regressors only 4 items are missing from the coef vector rather than 23 without using additional regressors. I find this a bit strange - I would have thought more (or the same number of) coefficent etstimates would be missing if the issue was collinearity.

I would compare to the R fixest package but run out of memory there. Will do some further testing and report back.
Just thought you may have encountered this before.

@junyuan-chen
Copy link
Member

It still sounds to me like something about collinearity. The way how the collinear columns are dropped may not be so "exact" if there is any collinearity going on.

Some of the columns can be dropped in the following lines:

basiscols = basis!(Symmetric(copy(crossx)); has_intercept = has_intercept)
if !all(basiscols)
X = X[:, basiscols]
crossx = crossx[basiscols, basiscols]
end
crossx = Symmetric(crossx)

I would suggest trying to inspect exactly which columns are dropped and then take a look at the sample to get a better idea on how this can happens. There is a keepall=true option with @did that could be helpful to preserve basiscols. You would add a bracket like this: @did [keepall = true] ....

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants