jtd-infer crashes with a stack overflow on large amounts of concatenated input #11

bjorndm · 2021-06-11T09:59:18Z

I can use jtd-infer with a few concatenated files, but when I concatetenate a 1000+ JSON files, the program crashed with a stack overflow. Similar inferral tools for JSON Schema, like Genson, work fine with these same 1000+ files.

$ ls -Aq testdata/metadata/  | wc -l 
2
$ cat testdata/metadata/*.json | jtd-infer 
# This works OK
$ ls -Aq ../test/testdata/metadata/  | wc -l 
1129
$ cat ../test/testdata/metadata/*.json | jtd-infer 
thread 'main' has overflowed its stack
fatal runtime error: stack overflow
Afgebroken (geheugendump gemaakt)

I would appreciate if this could be fixed somehow. Perhaps allow several input files on the command line in stead of one?

zekefast · 2023-04-07T23:55:50Z

@bjorndm It seems that schema inferred by jtd-infer is too large to fit default stack size on your system. jtd-infer could be rewritten a bit to occupy less space on stack and use more heap instead (which is like slow it down a bit) and assume default stack size which on linux systems usually 8MiB.

But you can also increase stack size to workaround the issue.

ulimit -all

shows configuration of you system. You interested in stack size line. Or you can grep it: ulimit -all | grep "stack size". The value usually in KiB (*1024).

To set new value use

ulimit -s <VALUE_IN_KiB>

You can try to double (ulimit -s 16384) or quadruple(ulimit -s 32768) stack size and run jtd-infer once again.

@ucarion It could be worth to add somewhere to documentation or README that kind of workaround as people with large JSON files and schemas hit that issue quite frequently and not everybody may know about ability to increase default stack size on their systems.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jtd-infer crashes with a stack overflow on large amounts of concatenated input #11

jtd-infer crashes with a stack overflow on large amounts of concatenated input #11

bjorndm commented Jun 11, 2021

zekefast commented Apr 7, 2023

jtd-infer crashes with a stack overflow on large amounts of concatenated input #11

jtd-infer crashes with a stack overflow on large amounts of concatenated input #11

Comments

bjorndm commented Jun 11, 2021

zekefast commented Apr 7, 2023