content/core/core_news/2019-11-02-8-3-0-available.md

Title: Apache Lucene™ 8.3.0 available category: core/news URL: save_as:

The Lucene PMC is pleased to announce the release of Apache Lucene 8.3.0.

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross

This release contains numerous bug fixes, optimizations, and improvements, some of which are highlighted below. The release is available for immediate download at:

https://lucene.apache.org/core/downloads.html

Lucene 8.3.0 Release Highlights:

New SpanishMinimalStemFilter
New “export all terms and doc freqs” feature to Luke with delimiters
Composite Matches from multiple subqueries now allow access to their submatches, and a new NamedMatches API allows marking of subqueries and a simple way to find which subqueries have matched on a given
Range Query For Multiple Connected Ranges
LatLonDocValuesPointInPolygonQuery for LatLonDocValuesField
New UniformSplitPostingsFormat (name “UniformSplit”) primarily benefiting in simplicity and extensibility
New STUniformSplitPostingsFormat (name “SharedTermsUniformSplit”) that shares a single internal term dictionary across fields
DisjunctionMaxQuery more efficiently leverages impacts to skip non-competitive hits
BooleanQuery with no scoring clause can now early terminate the query when the total hits is not requested
Matches on wildcard queries will defer building their full disjunction until a MatchesIterator is pulled
spatial-extras quad and packed quad prefix trees now index points faster
Add additional leaf node level optimizations in LatLonShapeBoundingBoxQuery
Improve performance of WITHIN and DISJOINT queries for Shape queries by doing just one pass whenever possible
Introduce shared count based early termination across multiple slices
Blocktree‘s seekExact now short-circuits false if the term isn’t in the min-max range of the segment. Large perf gain for ID/time like data when populated sequentially
Show SPI names instead of class names in Luke Analysis tab
GraphTokenStreamFiniteStrings preserves all Token attributes through its finite strings TokenStreams
Introduced SpanPositionRange into XML Query Parser
Use a sort key instead of true distance in NearestNeighbor
Tessellator labels the edges of the generated triangles whether they belong to the original polygon
Use exact distance between point and bounding rectangle in FloatPointNearestNeighbor
The Korean analyzer now splits tokens on boundaries between digits and alphabetic characters
MoreLikeThis is biased for uncommon fields

Please read CHANGES.txt for a full list of new features and changes:

https://lucene.apache.org/core/8_3_0/changes/Changes.html