Lucene.NET Contributor's Guide

You have found a bug or you have an idea for a cool new enhancement? Contributing code is a great way to give something back to the open source community. Before you dig right into the code there are a few guidelines that we need contributors to follow so that we can have a chance of keeping on top of things.

Getting Started

  • Read Open Source Contribution Etiquette and Don't “Push” Your Pull Requests.
  • Make sure you have a JIRA account.
  • Make sure you have a GitHub account. NOTE: Although this is a mirror of our Git repository, pull requests are accepted through GitHub.
  • If you are thinking of making a change that will result in more than 25 lines of changed code, we would appreciate you opening a discussion on our developer mailing list before you start writing. It could save both you and our team quite a bit of work if the code doesn't have to be rewritten to fit in with our overall objectives.
  • Submit a JIRA ticket for your issue, assuming one doesn't exist already.
    • If reporting a bug, clearly describe the issue including steps to reproduce, observed behavior, and expected behavior.
    • If reporting a bug, provide source code that we can run without any alteration demonstrating the issue. Issues submitted with runnable code will be given a higher priority than those submitted without.
  • If you will be submitting a pull request, fork the repository on GitHub.

If You are Willing to Help with Porting Code

  • Please make sure nobody else is working on porting it already. We would like to avoid doing redundant work. We ask that you communicate clearly in this list that you are going to work on some part of the project. A PMC member will then either approve or alert you someone else is working on that part already.

  • Use automated tools to do the basic porting work, and then start a manual clean-up process. For automatic conversion we are using Tangible's Java to C# Converter (we have licenses to give to committers). It has proven to work quite nicely, but I also hear good things on Sharpen. Check it out here and pick the tool you are more comfortable with.

  • Conventions & standards: not too picky at this point, but we should definitely align with the common conventions in .NET: PascalCase and not camelCase for method names, properties instead of getters/setters of fields, etc. I'm not going to list all the differences now but we probably want to have such a document up in the future. For reference have a look at Lucene.Net, while not perfect it is starting to shape up the way we want it.

  • In general, prefer .NETified code over code resembling Java. Enumerators over Iterators, yields when possible, Linq, BCL data structures and so on. We are targeting .NET 4.5.1, use this fact. Sometimes you will have to resort to Java-like code to ensure compatibility; it's ok. We would rather ship fast and then iterate on improving later.

  • While porting tests, we don't care about all those conventions and .NETification. Porting tests should be reduced to a copy-paste procedure with minimal cleaning up. We are working on tools and code helpers to help with that, see for examples see our Java style methods to avoid many search-replace in porting tests, and a R# plugin that will help making some stuff auto-port when pasting.

Porting Work - Up For Grabs

Note that even though we are currently a port of Lucene 4.8.0, we recommend porting over new work from 4.8.1. We hope to begin the work of upgrading to 4.8.1 soon (let us know if interested). There are only about 100 files that changed between 4.8.0 and 4.8.1.

Pending being ported from scratch (code + tests) plus have additional dependencies that either need to be sourced from the .NET ecosystem or ported.

Partially Completed

  • Lucene.Net.Misc
    • Missing native C++ Directory implementations for Windows and Unix/Posix along with wrapper classes to utilize them. See JIRA issue 562 and the Store namespace.
  • Lucene.Net.Sandbox
    • Missing all of the SlowCollatedXXX classes, the RegEx namespace (+ related tests). (casing intentional to prevent naming collisions with .NET Regex class)

If you are more into Fixing Existing Tests

We have already managed to get all of the tests green (most of the time). However, there are still a few flaky tests that fail randomly that need to be addressed. Since tests are using randomized testing, failures are changing. But if you put a [Repeat(number)] attribute on the tests they will fail more often, making them a bit easier to debut.

Some of the code (in particular code in the Support namespace) has no code coverage, and porting/adding tests for those is up for grabs.

Other Ways To Help

  • Making demos and tutorials, blogging about Lucene.Net, etc. (and providing feedback on how we can make the API better!). If you write a helpful Lucene.Net post on your blog, be sure to let us know so we can link to it.
  • Helping out with documentation. We are still trying to make the API docs easily navigable (see #206), and there are many files that are not formatted correctly (links not appearing, tables not very readable, etc). Also, we need help getting all of the Java-related documentation converted to use .NET methodologies.
  • Fixing TODOs. There are several TODOs throughout the code that need to be reviewed and action taken, if necessary. Search for LUCENENET TODO|LUCENE TO-DO using the regular expression option in Visual Studio to find them. Do note there are a lot of TODOs left over from Java Lucene that are safe to ignore.
  • Reviewing code. Pick a random section, review line by line, comparing the code against the original Lucene 4.8.0 code. Many of the bugs have been found this way, as the tests are not showing them. Let us know if you find anything suspicious on the dev mailing list or submit a pull request.
  • Optimizing code. During porting we have ended up with some code that is less than optimal. We could use a hand getting everything up to speed (pun intended).
  • Helping update the API, or at least just providing feedback on what is important. There are several things on our radar, like integrating something like Lucene.Net.Linq directly into our project, converting the remaining public-facing iterator classes into IEnumerator<T> so they can be used with foreach loops, adding extension methods to remove the need for casting, etc.
  • Creating projects to make Lucene.Net easier to use with various .NET frameworks (ASP.NET MVC, WebApi, AspNetCore, WPF, EntityFramework, etc). In general, we would like common tasks as easy as possible to integrate into applications build on these frameworks without everyone having to write the same boilerplate code.
  • Building automation tools to eliminate some of the manual work of managing the project, updating information on various web pages, creating tools to make porting/upgrading more automated, etc.
  • Be a power beta tester. Make it your mission to track down bugs and report them to us on JIRA.

Also, check out the JIRA issue tracker for any other issues that you might be interested in helping with. You can signup for a JIRA account here (it just takes a minute).

Or, if none of that interests you, join our dev mailing list and ask!

Thank You For Your Help!

Again, thank you very much for your contribution. May the fork be with you!