Able to parse name-mapping into a recusive structure. #723

Fokko · 2024-11-27T12:39:35Z

Name mapping is used when the files in the table don't have field-IDs encoded in the Parquet files. For example, when adding files through add_files in the case of a table migration from Hive, the Parquet files don't have field-IDs in them. In this case we want to make use of name-mapping: https://iceberg.apache.org/spec/#name-mapping-serialization This is a JSON blob that's stored alongside the table in a table property.

This issue is solely on the deserialization of the JSON blob into a memory structure. Tests can be found here: https://github.com/apache/iceberg-python/blob/main/tests/table/test_name_mapping.py

Future tip: It is best to store this in a recursive field so it can be traversed using a VisitorWithParent where both a Schema and NameMapping can be traversed at once. This is important because we cannot flatten the name-mapping because of potential dots in the field name, and this disallows us to split between fields and subfields. This is done in PyIceberg here: apache/iceberg-python#1014

The text was updated successfully, but these errors were encountered:

barronw · 2024-11-28T13:44:21Z

Can I pick this up?

c-thiel · 2024-11-28T13:58:56Z

@barronw gladly! Assigned the issue to you.
If there are any questions, just post them here or contact us on Slack :)

Fokko added the good first issue Good for newcomers label Nov 27, 2024

Fokko mentioned this issue Nov 27, 2024

Iceberg-rust Write support #700

Open

28 tasks

c-thiel assigned barronw Nov 28, 2024

barronw linked a pull request Nov 28, 2024 that will close this issue

name mapping serde #740

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Able to parse name-mapping into a recusive structure. #723

Able to parse name-mapping into a recusive structure. #723

Fokko commented Nov 27, 2024

barronw commented Nov 28, 2024

c-thiel commented Nov 28, 2024

Able to parse name-mapping into a recusive structure. #723

Able to parse name-mapping into a recusive structure. #723

Comments

Fokko commented Nov 27, 2024

barronw commented Nov 28, 2024

c-thiel commented Nov 28, 2024