Title: DataSketches Incubation Status

This page tracks the project status, incubator-wise. For more general project status, look on the project website.

The DataSketches project graduated on 2020-12-16

DataSketches is an open source, high-performance library of streaming algorithms commonly called “sketches” in the data sciences. Sketches are small, stateful programs that process massive data as a stream and can provide approximate answers, with mathematical guarantees, to computationally difficult queries orders-of-magnitude faster than traditional, exact methods.

itemtypereference
Websitewwwhttps://datasketches.apache.org/
.wikihttps://cwiki.apache.org/confluence/display/DATASKETCHES/Home
Mailing listsdevdev @ datasketches.apache.org
.usersusers @ datasketches.apache.org
.commitscommits @ datasketches.apache.org
.issuesissues @ datasketches.apache.org
.notificationsnotifications @ datasketches.apache.org
.privateprivate @ datasketches.apache.org
Bug trackingJava Corehttps://github.com/apache/incubator-datasketches-java/issues
.CPP Corehttps://github.com/apache/incubator-datasketches-cpp/issues
.Java Hivehttps://github.com/apache/incubator-datasketches-hive/issues
.Java Pighttps://github.com/apache/incubator-datasketches-pig/issues
.CPP PostreSQLhttps://github.com/apache/incubator-datasketches-postgresql/issues
.Java Memoryhttps://github.com/apache/incubator-datasketches-memory/issues
.GitHubUse issues on any of our GitHub sites (below).
Source codeGitHubhttps://github.com/apache?q=datasketches
.Zip Archivehttps://archive.apache.org/dist/incubator/datasketches/
.Nexus (Java)https://repository.apache.org/content/repositories/releases/org/apache/datasketches/
MentorskamaciFurkan Kamaci
.kennKenneth Knowles
.chenliang613Liang Chen
.waveDave Fisher
.evansyeEvans Ye
RosterWhimsyDatasketches Roster

Project Setup

This is the first phase on incubation, needed to start the project at Apache.

Item assignment is shown by the Apache id. Completed tasks are shown by the completion date (YYYY-MM-dd).

Identify the project to be incubated

dateitem
2019-04-09Datasketches Name Approved by Mark Thomas

Infrastructure

dateitem
2019-04-10Request DNS (first step after creating podling status page) https://issues.apache.org/jira/browse/INFRA-18197. Completed.
2019-04-10Request Mailing Lists Completed.
2019-05-10Request git repositories https://issues.apache.org/jira/browse/INFRA-18362. Completed
N/A (using GitHub)Ask infrastructure to set up issue tracker ( JIRA , Bugzilla). DataSketches JIRA Not used.
N/A (not using wiki)Ask infrastructure to set up wiki ( Confluence ). Not used.
2020-01-24 (website was the final item)Migrate the project to our infrastructure. ( INFRA-18362 and INFRA-19259 )

Mentor-related responsibility/oversight

dateitem
2019-05-10Subscribe all Mentors on the pmc and general lists.
....-..-..Give all Mentors access to the incubator SVN repository. (to be done by the Incubator PMC chair or an Incubator PMC Member wih karma for the authorizations file). Completed.
....-..-..Tell Mentors to track progress in the file ‘incubator/projects/{project.name}.html’

Copyright and IP

dateitem
2019-05-07Check and make sure that the papers that transfer rights to the ASF been received. It is only necessary to transfer rights for the package, the core code, and any new code produced by the project.
2019-10-25Check and make sure that the files that have been donated have been updated to reflect the new ASF copyright.
2019-05-07Matt Sicker (secretary@apache.org) acknowledged receipt of Software Grant
2019-05-07Matt Sicker (secretary@apache.org) acknowledged receipt of SGA and Logo source file.

Verify distribution rights

dateitem
2019-10-25Check and make sure that for all code included with the distribution that is not under the Apache license, we have the right to combine with Apache-licensed code and redistribute. Completed.
2019-10-25Check and make sure that all source code distributed by the project is covered by one or more of the following approved licenses: Apache, BSD, Artistic, MIT/X, MIT/W3C, MPL 1.1, or something with essentially the same terms. Completed.

Establish a list of active committers

CommitterICLA?
Alexander SaydakovYES
David CrombergeYES
Edo LibertyYES
Eshcar HillelYES
Jon MalkinYES
Justin ThalerYES
Lee RhodesYES
Pavel VeselýYES
Roman LeventovYES
dateitem
2019-04-10Check that all active committers have submitted a contributors agreement. YES.
2019-04-10Add all active committers in the relevant section above. Done.
2019-04-10Ask root for the creation of committers' accounts in LDAP. Done.

Project specific

Add project specific tasks here.

Incubation

These action items have to be checked for during the whole incubation process.

These items are not to be signed as done during incubation, as they may change during incubation. They are to be looked into and described in the status reports and completed in the request for incubation signoff.

Collaborative Development

  • Have all of the active long-term volunteers been identified and acknowledged as committers on the project? YES.

  • Are there three or more independent committers? (The legal definition of independent is long and boring, but basically it means that there is no binding relationship between the individuals, such as a shared employer, that is capable of overriding their free will as individuals, directly or indirectly.) YES.

  • Are project decisions being made in public by the committers? YES.

  • Are the decision-making guidelines published and agreed to by all of the committers? YES.

Licensing awareness

  • Are all licensing, trademark, credit issues being taken care of and acknowleged by all committers? YES.

Project Specific

Add project specific tasks here.

Exit

Things to check for before voting the project out.

Organizational acceptance of responsibility for the project

  • If graduating to an existing PMC, has the PMC voted to accept it? YES.

  • If graduating to a new PMC, has the board voted to accept it? YES.

Incubator sign-off

  • Has the Incubator decided that the project has accomplished all of the above tasks? YES.