Title: Nutch Incubation Status

On 2005-06-01, the Nutch project has been voted in by the Lucene PMC to become part of the Lucene project.

The Nutch project graduated on 2005-06-01

Nutch is web search software. It builds on the Apache Lucene search library, adding a crawler, web database (including full link graph), plugins for various document formats, user interface, etc. It is currently used by sites such as The Creative Commons , Oregon State University , and The Internet Archive.

If the project website and code repository are not yet setup, use the following table:

itemtypereference
Websitewwwhttp://incubator.apache.org/nutch/
wikihttp://wiki.apache.org/nutch/
Mailing listdevnutch-dev@incubator.apache.org
commitsnutch-commits@incubator.apache.org
usernutch-user@incubator.apache.org
agentnutch-agent@incubator.apache.org
nutch-general@incubator.apache.org
Bug trackingJirahttp://issues.apache.org/jira/
Source codeSVNhttps://svn.apache.org/repos/asf/incubator/nutch
MentorscuttingDoug Cutting
ehatcherErik Hatcher
CommitterscuttingDoug Cutting
cafarellaMichael Cafarella
abialAndrzej Bialecki
johnxJohn Xing
zirenSami Siren
  • none yet

Project Setup

This is the first phase on incubation, needed to start the project at Apache.

Item assignment is shown by the Apache id. Completed tasks are shown by the completion date (YYYY-MM-dd).

Identify the project to be incubated

statusitem
DONEMake sure that the requested project name does not already exist and check www.nameprotect.com to be sure that the name is not already trademarked for an existing software product.
DONEIf request from an existing Apache project to adopt an external package, then ask the Apache project for the cvs module and mail address names.
DONEIf request from outside Apache to enter an existing Apache project, then post a message to that project for them to decide on acceptance.
DONEIf request from anywhere to become a stand-alone PMC, then assess the fit with the ASF, and create the lists and modules under the incubator address/module names if accepted.

Interim responsibility

statusitem
DONEIdentify all the Mentors for the incubation, by asking all that can be Mentors.
DONESubscribe all Mentors on the pmc and general lists.
DONEGive all Mentors access to all incubator CVS modules. (to be done by PMC chair)
DONETell Mentors to track progress in the file ‘incubator/projects/{project.name}.cwiki’

Copyright

statusitem
DONECheck and make sure that the papers that transfer rights to the ASF been received. It is only necessary to transfer rights for the package, the core code, and any new code produced by the project.
DONECheck and make sure that the files that have been donated have been updated to reflect the new ASF copyright.

Verify distribution rights

statusitem
DONECheck and make sure that for all code included with the distribution that is not under the Apache license, e have the right to combine with Apache-licensed code and redistribute.
DONECheck and make sure that all source code distributed by the project is covered by one or more of the following approved licenses: Apache, BSD, Artistic, MIT/X, MIT/W3C, MPL 1.1, or something with essentially the same terms.

Establish a list of active committers !

statusitem
DONECheck that all active committers have submitted a contributors agreement.
DONEAdd all active committers in the STATUS file.
DONEAsk root for the creation of committers' accounts on cvs.apache.org.

Infrastructure !

statusitem
DONEAsk infrastructure to create source repository modules and grant the committers karma.
DONEAsk infrastructure to set up and archive Mailing lists.
DONEDecide about and then ask infrastructure to setup an issuetracking system (Bugzilla, Scarab, Jira).
DONEMigrate the project to our infrastructure.

Project specific

Add project specific tasks here.

Incubation

These action items have to be checked for during the whole incubation process.

These items are not to be signed as done during incubation, as they may change during incubation. They are to be looked into and described in the status reports and completed in the request for incubation signoff.

Collaborative Development

  • Have all of the active long-term volunteers been identified and acknowledged as committers on the project?

Yes. All existing committers have migrated to Apache.

  • Are there three or more independent committers? (The legal definition of independent is long and boring, but basically it means that there is no binding relationship between the individuals, such as a shared employer, that is capable of overriding their free will as individuals, directly or indirectly.)

Yes. Nutch has five committers. No two committers share an employer. Yahoo! at times contracts with two of the five committers (Mike & Doug) to work on Nutch.

  • Are project decisions being made in public by the committers?

Yes. Project discussions are on public mailing lists.

  • Are the decision-making guidelines published and agreed to by all of the committers?

Yes. Committers are using Apache decision-making guidelines.

  • Are all licensing, trademark, credit issues being taken care of and acknowleged by all committers?

Add project specific tasks here.

Organizational acceptance of responsibility for the project

  • If graduating to an existing PMC, has the PMC voted to accept it?

Yes. It has on 2005-06-01.

Incubator sign-off

  • Has the Incubator decided that the project has accomplished all of the above tasks?

Yes. It has on 2005-06-01.