TypeError: unsupported operand type(s) for -: 'float' and 'NoneType' #1227

TheGermanEngie · 2024-04-01T16:59:27Z

Continuing off issue #861 where a path problem has turned into a data loading problem. Opened a new lightning-ai studio instance, freshly installed the repo and downloaded the checkpoint, still same error.

Here is a sample of my dataset.

sample.json

TheGermanEngie · 2024-04-01T17:50:35Z

I even updated the dataset to have one more indent for the {} to match litgpt's given .json object formatting:

From this:

To this:

And I still get the same error.

TheGermanEngie · 2024-04-01T18:07:11Z

Running a test load of the default Alpaca dataset with the command litgpt finetune lora --data Alpaca --checkpoint_dir checkpoints/mobiuslabsgmbh/aanaphi2-v0.1 runs fine, so I think it's my dataset. I can't find any formatting differences between the default dataset and mine, so kind of at a loss here.

TheGermanEngie · 2024-04-01T18:40:28Z

I would like to add that my dataset does exponentially grow in size because it is based around a speaker diarization type of dataset. To keep all the context of the conversation, each time the Speaker 0 and Speaker 1 nametags change in the dataset, the output is loaded back into the "input" format and the text continues until Speakers change, for example:

As you can imagine, it grows very exponentially. This makes me wonder if the error earlier has to do with context length limits or anything like that when initially loading the dataset: as the finetuning scripts work bydetermining the size of the longest tokenized sample in the dataset to determine the block size.and the files do get quite long. I also did try to truncate the dataset earlier with --train.max_seq_length 256 to no difference.

gitgroman · 2024-04-03T16:57:48Z

Same for me, I don't think size matters as it failed with really small dataset with few records.

carmocca · 2024-04-03T17:04:47Z

@rasbt can you check this out? There might be a bug in the json datamodule

gitgroman · 2024-04-03T17:15:25Z

--data.val_split_fraction 0.1

fixed issue.

@carmocca

carmocca · 2024-04-04T15:42:58Z

#1241 improves the messaging

carmocca added the bug Something isn't working label Apr 3, 2024

carmocca closed this as completed Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TypeError: unsupported operand type(s) for -: 'float' and 'NoneType' #1227

TypeError: unsupported operand type(s) for -: 'float' and 'NoneType' #1227

TheGermanEngie commented Apr 1, 2024

TheGermanEngie commented Apr 1, 2024

TheGermanEngie commented Apr 1, 2024

TheGermanEngie commented Apr 1, 2024

gitgroman commented Apr 3, 2024

carmocca commented Apr 3, 2024

gitgroman commented Apr 3, 2024 •

edited

Loading

carmocca commented Apr 4, 2024

TypeError: unsupported operand type(s) for -: 'float' and 'NoneType' #1227

TypeError: unsupported operand type(s) for -: 'float' and 'NoneType' #1227

Comments

TheGermanEngie commented Apr 1, 2024

TheGermanEngie commented Apr 1, 2024

TheGermanEngie commented Apr 1, 2024

TheGermanEngie commented Apr 1, 2024

gitgroman commented Apr 3, 2024

carmocca commented Apr 3, 2024

gitgroman commented Apr 3, 2024 • edited Loading

carmocca commented Apr 4, 2024

gitgroman commented Apr 3, 2024 •

edited

Loading