New file formats in Lucene 4.1+ index
Posted by Kelvin on 30 Apr 2013 at 06:48 pm | Tagged as: Lucene / Solr / Elasticsearch / Nutch
Lucene 4.1 introduces new files in the index.
Here's a link to the documentation: https://builds.apache.org/job/Lucene-Artifacts-trunk/javadoc/core/org/apache/lucene/codecs/lucene41/Lucene41PostingsFormat.html
The different types of files are:
.tim: Term Dictionary
.tip: Term Index
.doc: Frequencies and Skip Data
.pos: Positions
.pay: Payloads and Offsets