I think the document did a good job of introducing the idea of index hacking and provided a nice overview of all the associated concepts.
I am a little confused, however, on how the bin 'value' is currently determined. "Instead of calculating the bin values from the offsets, use the first chunk_beg and chunk_end to calculate the relative size of the bin." Is there a reason that only the first chunk is used?
To me, it would make the most sense to sum the differences between each chunk's data start and data end. Also, it seems that the start/end integers would need to first be converted into their two-part bit constituents (48 | 16) to then determine the chunk's actual length/size of compressed data rather than using the raw integer values.
I may just be confused, however. Please let me know what you think.
The working slide deck and document are in Google Drive > IGB Project Documentation and Plans > Index Hacking
Currently creating a document using figures from the slides.