Skip to content

Commit

Permalink
feat: skip any byte column for profiling, adds number of column
Browse files Browse the repository at this point in the history
  • Loading branch information
nazarfil committed Dec 3, 2024
1 parent 5e8081e commit 833d4d0
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion hexa/datasets/queue.py
Original file line number Diff line number Diff line change
Expand Up @@ -110,11 +110,12 @@ def generate_profile(df: pd.DataFrame) -> list:
unique_values = df.nunique().to_dict()
distinct_values = df.apply(lambda x: x.nunique(dropna=False)).to_dict()
constant_values = df.apply(lambda x: x.nunique() == 1).astype("bool").to_dict()
count = df.count()

metadata_per_column = [
{
"column_name": column,
"number_of_rows": len(df),
"count": count.get(column),
"data_type": data_types.get(column),
"missing_values": missing_values.get(column),
"unique_values": unique_values.get(column),
Expand Down

0 comments on commit 833d4d0

Please sign in to comment.