-
Notifications
You must be signed in to change notification settings - Fork 485
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Example files are using legacy timezone names (US/Pacific) #2049
Comments
Thank you for reporting, @bdice . cc @williamhyun , @wgtmac , too. |
To @bdice , according to our official Java tool, the type of column
Please see here. Given that there is no timezone, I'm not sure if the root cause is the file.
Instead, it looks like the C++ library side issue because |
It looks like a breaking change of timezone name from TZDB. I will take a look. cc @ffacs |
Thank you so much, @wgtmac . |
https://bugs.launchpad.net/ubuntu/+source/tzdata/+bug/2058249 has explained the root cause that IMO, we should not change |
That is fine with me! I have worked around this by installing Another possible course of action would be to leave 2038 test file outputUsing
|
@bdice I think we can keep those files are they are created by legacy writers: |
Thank you all. Let me close this issue because it seems that we agree that the old files should be kept in AS-IS. Feel free to make a PR for the newly proposed file. |
The example ORC files use a timezone of
US/Pacific
which is no longer included in all Linux distributions. Ubuntu 24.04, for example, has moved this to a separatetzdata-legacy
package. This can cause issues for ORC file readers on systems missing that legacy time zone data.Should the example ORC files be updated to use a more current time zone name, like
America/Los_Angeles
?Verifying the time zone in the stripe footers:
wget https://github.com/apache/orc/raw/refs/heads/main/examples/TestOrcFile.testDate1900.orc orc-metadata -v TestOrcFile.testDate1900.orc # Shows stripe footers with "timezone": "US/Pacific"
Additional context
https://bugs.launchpad.net/ubuntu/+source/tzdata/+bug/2058249
apache/arrow#40633
pandas-dev/pandas#56292
rapidsai/cudf#16998 (comment)
The text was updated successfully, but these errors were encountered: