This project helps create validate hadoop release candidates
https://github.com/apache/hadoop-release-support
It has an Apache Ant build.xml
file to help with preparing the release, validating gpg signatures, creating release messages and other things.
There is a maven pom.xml
file. This is used to validate the dependencies from staging repositories as well as run some basic tests to validate the classpath.
Installed applications/platforms
To use the scp/ssh commands we need the jsch jar on the classpath.
ant -diagnostics
BAD
------------------------------------------- Tasks availability ------------------------------------------- sshsession : Missing dependency com.jcraft.jsch.Logger scp : Missing dependency com.jcraft.jsch.Logger sshexec : Missing dependency com.jcraft.jsch.Logger
Here are the apt-get commands to set up a raspberry pi for the arm validation
apt-get install openjdk-17-jdk apt-get install ant libjsch-java apt-get install gpgv apt-get install maven apt-get install subversion
/build.xml
It has an Apache Ant build.xml
file to help with preparing the release, validating gpg signatures, creating release messages and other things.
/pom.xml
There is a maven pom.xml
file. This is used to validate the dependencies from staging repositories as well as run some basic tests to validate the classpath.
/build.properties
This is an optional property file which contains all user-specific customizations and options to assist in the release process.
This file is not SCM-managed (it is explicitly ignored).
It is read before all other property files are read/ant properties set, so can override any subsequent declarations.
/release.properties
This is a single-entry property file which provides a relative path to the latest release being worked on in this branch.
build.properties
release.version=3.4.1
Ant uses this to to set the property release.info.file
to the path src/releases/release-info-${release.version}.properties
release.info.file=src/releases/release-info-3.4.1.prpoperties
This is then loaded, with the build failing if it is not found.
src/releases/release-info-*.properties
Definition files of base properties for the active RC.
As an example, here is the value src/releases/release-info-3.4.0.properties
for the RC2 release candidate
hadoop.version=3.4.0 rc=RC2 previous.version=3.3.6 release.branch=3.4 git.commit.id=88fbe62f27e jira.id=HADOOP-19018 jira.title=Release 3.4.0 amd.src.dir=https://dist.apache.org/repos/dist/dev/hadoop/hadoop-3.4.0-RC2 arm.src.dir=${amd.src.dir} http.source=${amd.src.dir} asf.staging.url=https://repository.apache.org/content/repositories/orgapachehadoop-1402 cloudstore.profile=sdk2
Build the RC using the docker process on whichever host is set to do it using following doc https://cwiki.apache.org/confluence/display/HADOOP2/HowToRelease
Start EC2 Ubuntu 22 instance
Create a local user before anything else
groupadd -g 1024 mthakur useradd -g 1024 -u 1024 -m mthakur newgrp docker usermod -aG docker mthakur service docker restart su - mthakur
Install java
sudo apt install openjdk-8-jre-headless java -version
Install maven
wget https://dlcdn.apache.org/maven/maven-3/3.8.8/binaries/apache-maven-3.8.8-bin.tar.gz tar -xvf apache-maven-3.8.8-bin.tar.gz mv apache-maven-3.8.8 /opt/
Setup maven home in .profile
export M2_HOME="/opt/apache-maven-3.8.8/" PATH="$M2_HOME/bin:$PATH" export PATH source .profile
Create ~/.m2/settings.xml file as mentioned in https://cwiki.apache.org/confluence/display/HADOOP2/HowToRelease
Install docker
https://www.digitalocean.com/community/tutorials/how-to-install-and-use-docker-on-ubuntu-22-04
sudo apt update sudo apt install apt-transport-https ca-certificates curl software-properties-common curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo gpg --dearmor -o /usr/share/keyrings/docker-archive-keyring.gpg echo "deb [arch=$(dpkg --print-architecture) signed-by=/usr/share/keyrings/docker-archive-keyring.gpg] https://download.docker.com/linux/ubuntu $(lsb_release -cs) stable" | sudo tee /etc/apt/sources.list.d/docker.list > /dev/null sudo apt update apt-cache policy docker-ce sudo apt install docker-ce sudo systemctl status docker
Download hadoop git module.
git clone https://github.com/apache/hadoop.git
Setup gpg signing in remote EC2 host.
On local Mac, export PGP key:
gpg --export --armor > gpg_public gpg --export-secret-keys "Mukund Thakur" > private.key
Upload the file gpg_public to the Linux VM
On the Linux VM, import PGP key:
gpg --import gpg_public gpg --import private.key
Follow rest of process as mentioned in above HowToRelease doc.
src/releases/release-X.Y.X.properties
Create a new release properties file, using an existing one as a template. Update as a appropriate.
/release.properties
Update the value of release.version in
/release.properties` to declare the release version. This is used to determine the specific release properties file for that version.
release.version=X.Y.Z
You can switch to a new release on the command line; this is needed when validating PRs.
ant -Drelease.version=3.4.1
build.properties
.
# info for copying down the RC from the build host scp.hostname=stevel-ubuntu scp.user=stevel scp.hadoop.dir=hadoop # SVN managed staging dir staging.dir=/Users/stevel/hadoop/release/staging # where various modules live for build and test spark.dir=/Users/stevel/dev/spark cloud-examples.dir=/Users/stevel/dev/sparkwork/cloud-integration/cloud-examples cloud.test.configuration.file=/Users/stevel/dev/config/test-configs/s3a.xml bigdata-interop.dir=/Users/stevel/dev/gcs/bigdata-interop hboss.dir=/Users/stevel/dev/hbasework/hbase-filesystem cloudstore.dir=/Users/stevel/dev/cloudstore fs-api-shim.dir=/Users/stevel/dev/Formats/fs-api-shim/
Clean up all the build files, including any remote downloads in the downloads/
dir.
ant clean
And then purge all artifacts of that release from maven. This is critical when validating downstream project builds.
ant mvn-purge
target/incoming
This will take a while! look in target/incoming for progress
ant scp-artifacts
Copies the files from downloads/incoming/artifacts
to downloads/hadoop-$version-$rc
'
ant copy-scp-artifacts release.dir.check
The release.dir.check
target just lists the directory.
The normal binary tar.gz files huge because they install a version of the AWS v2 SDK “bundle.jar” file which has been validated with the hadoop-aws module and the S3A connector which was built against it.
This is a really big file because it includes all the “shaded” dependencies as well as client libraries to talk with many unused AWS services up to and including scheduling satellite downlink time.
We ship the full bundle jar as it allows Hadoop and its downstream applications to be isolated from the choice of JAR dependencies in the AWS SDK. That is: it ensures a classpath that works out the box and stop having to upgrade on a schedule determined by maintains the AWS SDK pom files.
It does make for big images and that has some negative consequences.
The “lean” x86 binary tar.gz file aims to reduce eliminate these negative issues by being a variant of the normal x86 binary distribution with the relevant AWS SDK jar removed.
The build target release.lean.tar
can do this once the normal x86 binaries have been downloaded.
ant release.lean.tar
It performs the following actions:
target/bin-lean
bundle-*
from this expanded SDK-lean.tar.gz
If arm64 binaries are being created then they must be built on an arm docker image. Do not use the --asfrelease
option as this stages the JARs. Instead use the explicit --deploy --native --sign
options.
The arm process is one of
.tar.gz
with the native binaries only.asc
file..sha512
checksum file containing the new name. Renaming the old file is not sufficient.downloads/release/$RC
dirTo perform these stages, you need a clean directory of the same hadoop commit ID as for the x86 release.
In build.properties
declare its location
arm.hadoop.dir=/Users/stevel/hadoop/release/hadoop
In that dir, create the release using command below. If signing fails in your ARM docker container, you can skip signing by removing --sign
option. The signing happens in the next step if ant arm.release
process after this.
time dev-support/bin/create-release --docker --dockercache --mvnargs="-Dhttp.keepAlive=false -Dmaven.wagon.http.pool=false" --deploy --native --sign
Important make sure there is no duplicate staged hadoop repo in nexus. If there is: drop and restart the x86 release process to make sure it is the one published
# create the release. # Broken until someone fixes HADOOP-18664. you can't launch create-release --docker from a build file #ant arm.create.release # copy the artifacts to this project's target/ dir, renaming ant arm.copy.artifacts # sign artifacts then move to the shared RC dir alongside the x86 artifacts ant arm.release release.dir.check
When committed to subversion it will be uploaded and accessible via a https://svn.apache.org URL.
This makes it visible to others via the apache svn site, but it is not mirrored yet.
When the RC is released, an svn move
operation can promote it directly.
do this after preparing the arm64 binaries
Final review of the release files
ant release.dir.check
Now stage the files, first by copying the dir of release artifacts into the svn-mananaged location
ant stage
This can take a while...exit any VPN for extra speed.
ant stage-to-svn
Manual
cd $stagingdir svn update svn add <RC directory name> svn commit
This isn't automated as it needs to be done in the source tree.
The ant print-tag-command
prints the command needed to create and sign a tag.
ant print-tag-command
Review/update template message in src/text/vote.txt
. All ant properties referenced will be expanded if set.
ant vote-message
The message is printed and saved to the file target/vote.txt
do not send it until you have validated the URLs resolve
Now wait for the votes to come in. This is a good time to repeat all the testing of downstream projects, this time validating the staged artifacts, rather than any build locally.
This relies on the release-info file pointing to the source directory
http.source=https://dist.apache.org/repos/dist/dev/hadoop/hadoop-${hadoop.version}-RC${rc}/
target | action |
---|---|
release.fetch.http | fetch artifacts |
gpg.keys | import the hadoop KEYS |
gpg.verify | verify the signature of the retrieved artifacts |
release.dir.check | verify release dir exists |
release.src.untar | untar retrieved artifacts |
release.src.build | build the source; call release.src.untar first |
release.src.test | build and test the source; call release.src.untar first |
release.bin.untar | untar the binary file |
release.bin.commands | execute a series of commands against the untarred binaries |
release.site.untar | untar the downloaded site artifact |
release.site.validate | perform minimal validation of the site. |
release.arm.untar | untar the ARM binary file |
release.arm.commands | execute commands against the arm binaries |
release.lean.tar | create a release of the x86 binary tar without the AWS SDK |
set check.native.binaries
to false to skip native binary checks on platforms without them
Downloads under downloads/incoming
ant release.fetch.http
ant gpg.keys gpg.verify
This will import all the KEYS from https://downloads.apache.org/hadoop/common/KEYS, then verify the signature of each downloaded file.
If you don't yet trust the key of whoever signed the release then
Refresh the keys from the OpenPGP server, to see if they've been signed by others.
gpg --refresh-keys
Perform whatever key verification you can and sign the key that level -ideally push up the signature to the servers.
This puts the built artifacts into the local maven repo so do not do this while building/testing downstream projects and call ant mvn-purge
after
ant release.src.untar release.src.build
This build does not attempt to build the native binaries. The Pnative
profile can be enabled (or any other maven arguments) in by declaring them in the property source.compile.maven.args
source.compile.maven.args=-Pnative
These are added at the end of the hard-coded arguments (clean install -DskipTests
)
Testing is also possible through the target release.src.test
ant release.src.test
Again, the options set in source.compile.maven.args
are passed down.
These targets are simply invoking maven in the source subdirectory of downloads/untar/source
, for example downloads/untar/source/hadoop-3.4.1-src
ant release.site.untar release.site.validate
Validation is pretty minimal; it just looks for the existence of index.html files in the site root and under api/.
Untars the (already downloaded) binary tar to bin/hadoop fs -ls $BUCKET/
ant release.bin.untar
Once expanded, the binary commands can be tested
ant release.bin.commands
This will fail on a platform where the native binaries don't load, unless the hadoop checknative
command has been disabled.
This can be done in build.properties
check.native.binaries=false
ant release.bin.commands -Dcheck.native.binaries=false
If check.native.binaries
is false, the bin/hadoop checknative
is still executed, with the outcome printed (reporting a failure if the binaries are not present).
The and build itself is successful.
# untars the `-aarch64.tar.gz` binary ant release.arm.untar ant release.arm.commands
The way to do this is to clone this hadoop-release-support
repository to the remote server and run the validation commands there.
git clone https://github.com/apache/hadoop-release-support.git
A lot of the targets build maven projects from the staged maven artifacts.
For this to work:
build.properties
fileSome of these builds/tests are slow, but they can all be executed in parallel unless you are actually trying to transitively build components, such as run spark tests with the parquet artifact you build with the RC. If you find yourself doing this: you've just become a CI system without the automation.
First, purge your maven repository of all hadoop-
JAR files of the pending release version
ant mvn-purge
Download the artifacts from maven staging repositories and compile/test a minimal application
mvn clean ant mvn-test
Note: setting up on linux needs a number of dependencies installed (at least on an arm64 ubuntu system):
sudo apt-get install libssl-dev openssl subversion maven gpgv openjdk-17-jdk libjsch-java ant ant-optional ant-contrib cmake # add to .bashrc export JAVA_HOME=/usr/lib/jvm/java-17-openjdk-arm64/
To see the dependencies of the maven project:
ant mvn-validate-dependencies
This saves the output to the file target/mvndeps.txt
and explicitly checks for some known “forbidden” artifacts that must not be exported as transitive dependencies.
Review this to make sure there are no unexpected artifacts coming in,
ant cloudstore.build
Note: this does not include the AWS V1 SDK -Pextra
profile.
Ideally, you should run the tests, or even better, run them before the RC is up for review.
Building the libraries. Do this only if you aren't running the tests.
ant gcs.build
Validates hadoop client artifacts; the cloud tests cover hadoop cloud storage clients.
ant spark.build
To view the hadoop-cloud
dependencies
ant spark.hadoop-cloud.dependencies
Review this to look for conflict.
To run the hadoop-cloud
tests
ant spark.hadoop-cloud.test
A full spark test run takes so long that CI infrastructure should be used.
Then followup cloud integration tests if you are set up to build. Spark itself does not include any integration tests of the object store connectors. This independent module tests the s3a, gcs and abfs connectors, and associated committers, through the spark RDD and SQL APIs.
To clean build Apache Parquet:
ant parquet.build
There's no profile for using ASF staging as a source for artifacts. Run this after the spark build so the files are already present.
To clean build Apache Parquet and then run the tests in the parquet-hadoop
module:
ant parquet.test
Edit src/text/announcement.txt
to have an up-to-date description of the release.
The release.site.announcement
target will generate these annoucements. Execute the target and then review the generated files in target/
ant release.site.announcement
The announcement must be generated before the next stage, so make sure the common body of the site and email annoucement is up-to-date: src/text/core-announcement.txt
Set hadoop.site.dir
to be the path of the local clone of the ASF site repository https://gitbox.apache.org/repos/asf/hadoop-site.git
hadoop.site.dir=/Users/stevel/hadoop/release/hadoop-site
Prepare the site; this also demand-generates the release announcement.
The site .tar.gz distributable is used for the site; this must already have been downloaded. It must be untarred and copied under the SCM-managed ${hadoop.site.dir}
repository, linked up and then committed.
ant release.site.untar ant release.site.docs
In the hadoop site dir content/docs subdir
# update git pull # review current status ls -l # symlink current rm current3 ln -s r.3.3.5 current3 # symlink stable rm stable3 ln -s r3.3.5 stable3 # review new status ls -l
Run hugo
to build the site and verify. Follow the doc https://cwiki.apache.org/confluence/display/HADOOP/How+to+generate+and+push+ASF+web+site NOTE: On some setups hugo server
command to test locally may change the links to localhost. In that case, create a commit first before running hugo server
command and verify the links on https://hadoop.apache.org/ after pushing the changes to the remote asf-site.
Finally, commit
git add . git status git commit -S -m "HADOOP-18470. Release Hadoop 3.3.5" git push
svn move
# check that the source and dest URLs are good ant staging-init # do the promotion ant stage-move-to-production
This does a sequence of
Both commits use the generated message production.commit.msg
.
The svn-init
command prints this out.
current
refTODO: document/automate.
https://dist.apache.org/repos/dist/release/hadoop/common
Check that release URL in your browser.
do this at https://repository.apache.org/#stagingRepositories
to verify this is visible search for hadoop-common -verify the latest version is in the production repository.
Go to JIRA and
Release links.
The ant print-tag-command
target prints the command needed to create and sign a tag.
ant print-tag-command
Use the “tagging the final release” commands printed
For safety, purge your maven repo of all versions of the release, so as to guarantee that everything comes from the production store.
ant mvn-purge
There are a lot of files, and if your shell has a prompt which shoes the git repo state, scanning can take a long time. Disable it, such as for fish:
set -e __fish_git_prompt_showdirtystate
asf-staging
Many projects have a profile to use a staging repository, especially the ASF one.
Not all do -these builds are likely to fail. Here is a profile, asf-staging
which can be used to enable this. The paths to the repository can be changed too, if desired.
Some of the maven builds invoked rely on this profile (e.g. avro). For some unknown reason the parquet build doesn't seem to cope.
<profile> <id>asf-staging</id> <properties> <!-- override point for ASF staging/snapshot repos --> <asf.staging>https://repository.apache.org/content/groups/staging/</asf.staging> <asf.snapshots>https://repository.apache.org/content/repositories/snapshots/</asf.snapshots> </properties> <pluginRepositories> <pluginRepository> <id>ASF Staging</id> <url>${asf.staging}</url> </pluginRepository> <pluginRepository> <id>ASF Snapshots</id> <url>${asf.snapshots}</url> <snapshots> <enabled>true</enabled> </snapshots> <releases> <enabled>false</enabled> </releases> </pluginRepository> </pluginRepositories> <repositories> <repository> <id>ASF Staging</id> <url>${asf.staging}</url> <snapshots> <enabled>true</enabled> </snapshots> <releases> <enabled>true</enabled> </releases> </repository> <repository> <id>ASF Snapshots</id> <url>${asf.snapshots}</url> <snapshots> <enabled>true</enabled> </snapshots> <releases> <enabled>true</enabled> </releases> </repository> </repositories> </profile>
Drop the staged artifacts from nexus https://repository.apache.org/#stagingRepositories
Delete the tag. Print out the delete command and then copy/paste it into a terminal in the hadoop repository
ant print-tag-command
Remove downloaded files and maven artifactgs
ant clean mvn-purge
svn rm
the RC subdirsvn commit -m "rollback RC"
ant stage-svn-rollback # and get the log ant stage-svn-log
See releasing Hadoop-thirdparty
There are lots of opportunities to contribute to the module
During the release phase of a Hadoop release: whatever is needed to ship!
This repo works on Commit-then-Review; that is: no need to wait for review by others before committing. This is critical for rapid evolution during the release process. Just expect to be required to justify changes after the fact.