Pinot Note

Posted by Shane on December 31, 2023

Index

Pinot Index UML

Bullet Point of Pinot Index System

  • IndexCreator build the immutable index; IndexReaderFactory create the reader for immutable index
  • MutableXXXIndex inherit MutableIndex (write api) and IndexReader (read api)
  • ForwardIndex: Given a docID (rowID), return the raw value or the dictionaryID of the raw value depending on the encoding method you set up for that column. Each columns must have a ForwardIndex
  • Pinot’s InvertedIndex (column value to the bitmap), JsonIndex, TextIndex, GeoIndex are the general concept of index in database world, i.e. given a column value (raw value or dictionaryID of raw value), return the docID.
  • Pinot create a mutable index per column for mutable segments. Personally, storing mutable segments in row orientied format might be more cost efficient.