Reindex after or during log compaction #199

staltz · 2022-03-25T12:18:11Z

staltz · 2022-03-29T15:22:59Z

@arj03 I'm working on this already, but I bumped into a design issue. My original plan is to rebuild all indexes while log compaction is still ongoing, based on what Dominic suggested here:

Instead of writing a separate second log, superimpose the compacted log on top of the old one, keeping track of the min uncompacted sequence number. When you do a query from the old indexes, drop results with offsets smaller than the uncompacted sequence, and mix with results from the new indexes. merge index responses by sequence numbers, so numbers smaller than the uncompacted sequence are from the compacted section and the numbers greater are from the uncompacted log.

For instance, my plan is to create a folder .ssb/db2/indexes/.compacting where the new index files exist, and once those are done catching up with the log, we delete all jitdb indexes in .ssb/db2/indexes and move up the files in .ssb/db2/indexes/.compacting to .ssb/db2/indexes.

However, does this go against the JIT design of JITDB?

Alternatively, should we just delete all JITDB indexes once log compaction is done, and let the JIT nature take care of rebuilding them?

arj03 · 2022-03-29T18:59:44Z

Will get back to you on this tomorrow :)

arj03 · 2022-03-30T13:19:22Z

For JITDB I don't think we shouldn't allow queries while compaction is running, instead keep track of the offset from which you started compacting and on done call reindex() with that. This way if you compacted only the end of the log it should be quite fast. For level indexes in db2 it might make sense to allow lookups for things we know are before compaction. I do believe that we should have a "high-level" compact function in db2 that takes care of doing these things. Because it also needs to keep track of state while compacting, so that we can continue if the app is shut down while compacting.

This could be a starting point for the calls to jitdb & level indexes, it's a bit different because here we are running on individual messages.

staltz · 2022-04-01T11:23:19Z

@arj03 Thanks!

reindex() currently doesn't rebuild the core indexes (seq, sequence, timestamp), is there a reason for that? I think with compaction we would also have to rebuild the core indexes.

Why do you say that level indexes should be allowed to lookup while jitdb shouldn't? Is it because of speed of rebuilding the indexes? (By the way, in my tests, I felt that some jitdb indexes do take a considerable time to build, like value_author.32prefix, which is 6 MB in a production database, and has to write 4 bytes for each record.)

arj03 · 2022-04-01T11:26:23Z

reindex() currently doesn't rebuild the core indexes (seq, sequence, timestamp), is there a reason for that? I think with compaction we would also have to rebuild the core indexes.

Right, that is because for encrypted they don't change. Here they do :)

Why do you say that level indexes should be allowed to lookup while jitdb shouldn't?

Just because it is easier. We don't need to flag to jitdb that says: please don't update indexes, just run on what you have.

I can maybe check the value_author prefix building later later to see if there is anything we can do to optimize that one.

staltz · 2022-04-01T11:40:56Z

Okay thanks, so I'm preparing for the plan to "not allow queries while compaction is running", and here are some disorganized thoughts (mostly questions to myself):

What happens when a jitdb query is ongoing and compaction starts? Do we cancel the query, or do we pause it and re-run it after compaction (and reindexing) is done? What if it was a 4th call to paginate and the starting seq would now mean that the 4th paginate would get results that were in the 3rd paginate (or, worse, would skip some results).

This is a bit similar to questions about log.get and log.stream: do we abort it or do we wait for compaction to end and rerun it?

staltz · 2022-04-01T11:55:31Z

I can maybe check the value_author prefix building later later to see if there is anything we can do to optimize that one.

Just as a reference, here are other (leveldb and jitdb) indexes of similar size (sorted). There are plenty of big jitdb indexes, computing them all and writing to disk isn't fast.

Index	Size	Type
search2	101 MB	level
ebt	21 MB	level
contacts	12 MB	level
timestamp	12 MB	bitvector
value_content_vote_link__map	6,2 MB	prefix map
seq	6,0 MB	bitvector
sequence	6,0 MB	bitvector
value_author	6,0 MB	prefix
fullMentions	5,1 MB	level
hashtags	5,0 MB	level
aboutSelf	3,0 MB	level
value_content_root__map	3,0 MB	prefix map
value_content_contact__map	2,6 MB	prefix map
base	1,3 MB	level
value_content_fork__map	218 KB	prefix map
meta_	173 KB	bitvector
meta_private_true	173 KB	bitvector
value_author_MYSSBID	173 KB	bitvector
value_content_root_	173 KB	bitvector
value_content_type_contact	173 KB	bitvector
value_content_type_post	173 KB	bitvector
value_content_type_pub	173 KB	bitvector
value_content_type_roomx2Falias	173 KB	bitvector
value_content_type_vote	173 KB	bitvector
canDecrypt	24 KB	bitvector
encrypted	64 B	bitvector

staltz · 2022-04-01T12:31:10Z

What happens when a jitdb query is ongoing and compaction starts?

Answering myself: if there are ongoing queries (of any kind), postpone compaction, such that compaction always starts when everything else in the database is idle, and then from that point onwards, queue all incoming queries so that they apply only after compaction is done.

arj03 · 2022-04-01T12:51:42Z

Yep agree. Also for most of these indexes you wouldn't need to do a full rebuild. Only from compaction start and onwards.

arj03 · 2022-04-01T17:38:01Z

I felt that some jitdb indexes do take a considerable time to build, like value_author.32prefix, which is 6 MB in a production database, and has to write 4 bytes for each record.)

I had to check this. Building author or type is around the same amount of time (40s). Note this is from totally empty indexes folder, so includes keys and base. Rebuilding author after type is 6s. Rebuilding both are 45s. So a tiny bit less. I tried disabling writing the jitdb indexes and it only took rebuilding from 45s to 43.2s. Decrypting is still very heavy as documented here at around 10s overhead. If I leave the canDecrypt file and level indexes the time to build both goes down to 18.7s.

staltz · 2022-04-07T12:01:23Z

staltz · 2022-04-08T14:02:40Z

@arj03 I think I hit a pretty sad issue: it seems like we have to reset all leveldb indexes after compact happens, and that's because most leveldb indexes hold state, and we don't know how to undo the state only for the deleted records.

Consider e.g. the about-self index which has key/value feedId => hydratedAboutObj. Suppose we deleted an about message with the name field, but we still have an about message with the image field. There's nothing in this index that tells us how to "partially" delete the hydratedAboutObj, so either we get the wrong reduced state, or we have to do a full rebuild.

Any ideas about this?

arj03 · 2022-04-08T17:11:20Z

@staltz right, reduce based indexes versus pure. I think we could introduce that abstraction and then you'd only have to do a full reindex on the reduce based ones. Most base indexes are not reduce.

staltz · 2022-04-08T20:27:25Z

Yeah, we could do that split. But it might still be hard to find and remove the outdated entries. Like the EBT index, I think it's [feedId, sequence] => offset, we need to look up the old ones via the right-hand side, i.e. the value, not the key.

arj03 · 2022-04-11T06:19:14Z

Oh right, you are correct. I guess there isn't any other good way than to reindex everything in this case :(

staltz added the enhancement New feature or request label Mar 25, 2022

staltz self-assigned this Mar 25, 2022

staltz mentioned this issue Mar 29, 2022

use binary search to get seq from offset #200

Merged

staltz mentioned this issue Apr 8, 2022

force flush after reindexEncrypted is done ssbc/ssb-db2#330

Merged

This was referenced Apr 20, 2022

add APIs necessary for log compaction algorithms #217

Merged

use sizeDiff=1 to represent holes filled ssbc/async-append-only-log#71

Merged

staltz mentioned this issue May 1, 2022

add compact() API and auto-rebuild indexes ssbc/ssb-db2#339

Merged

staltz closed this as completed May 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reindex after or during log compaction #199

Reindex after or during log compaction #199

staltz commented Mar 25, 2022 •

edited

Loading

staltz commented Mar 29, 2022 •

edited

Loading

arj03 commented Mar 29, 2022

arj03 commented Mar 30, 2022

staltz commented Apr 1, 2022

arj03 commented Apr 1, 2022

staltz commented Apr 1, 2022

staltz commented Apr 1, 2022

staltz commented Apr 1, 2022

arj03 commented Apr 1, 2022

arj03 commented Apr 1, 2022 •

edited

Loading

staltz commented Apr 7, 2022 •

edited

Loading

staltz commented Apr 8, 2022

arj03 commented Apr 8, 2022

staltz commented Apr 8, 2022

arj03 commented Apr 11, 2022

Reindex after or during log compaction #199

Reindex after or during log compaction #199

Comments

staltz commented Mar 25, 2022 • edited Loading

staltz commented Mar 29, 2022 • edited Loading

arj03 commented Mar 29, 2022

arj03 commented Mar 30, 2022

staltz commented Apr 1, 2022

arj03 commented Apr 1, 2022

staltz commented Apr 1, 2022

staltz commented Apr 1, 2022

staltz commented Apr 1, 2022

arj03 commented Apr 1, 2022

arj03 commented Apr 1, 2022 • edited Loading

staltz commented Apr 7, 2022 • edited Loading

staltz commented Apr 8, 2022

arj03 commented Apr 8, 2022

staltz commented Apr 8, 2022

arj03 commented Apr 11, 2022

staltz commented Mar 25, 2022 •

edited

Loading

staltz commented Mar 29, 2022 •

edited

Loading

arj03 commented Apr 1, 2022 •

edited

Loading

staltz commented Apr 7, 2022 •

edited

Loading