Boolean type #312

afolarin · 2022-09-10T18:50:46Z

Quick question:

An Issue arose with merging the pyspark dataframes. It is related to the boolean/string issue we faced while reading from the schema. The reason for the inconsistency was that in some schema, the datatype is defined as:
'type': 'boolean'
While for some others, it is defined as:

'type': {
    'type': 'boolean'
}

@blootsvoets any reason for this and would it make sense to try validating for a consistent form?

cc/ @thepushkarp @Hsankesara

The text was updated successfully, but these errors were encountered:

blootsvoets · 2022-09-11T07:29:08Z

Indeed simple types should not be nested like that. It’s only used for complex types: lists, maps, enums and records. If the schema compiles (does it?) it is hard to check this programmatically, because in the compiled schema you cannot distinguish these notations.

Feel free to add an entry to the readme and/or pyspark Github actions test

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Boolean type #312

Boolean type #312

afolarin commented Sep 10, 2022

blootsvoets commented Sep 11, 2022

Boolean type #312

Boolean type #312

Comments

afolarin commented Sep 10, 2022

blootsvoets commented Sep 11, 2022