blob: 5a36dc4a5211e539484f5f4cc2803e9b52b2d899 [file] [log] [blame]
# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements. See the NOTICE file distributed with
# this work for additional information regarding copyright ownership.
# The ASF licenses this file to You under the Apache License, Version 2.0
# (the "License"); you may not use this file except in compliance with
# the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
Source: oozie
Section: misc
Priority: extra
Maintainer: Bigtop <dev@bigtop.apache.org>
Build-Depends: debhelper (>= 7.0.50~)
Depends: zip, unzip
Standards-Version: 3.8.0
Homepage: http://oozie.apache.org/
Package: oozie-client
Architecture: all
Depends: bigtop-utils (>= 0.7)
Description: Client for Oozie Workflow Engine
Oozie client is a command line client utility that allows remote
administration and monitoring of worflows. Using this client utility
you can submit worflows, start/suspend/resume/kill workflows and
find out their status at any instance. Apart from such operations,
you can also change the status of the entire system, get vesion
information. This client utility also allows you to validate
any worflows before they are deployed to the Oozie server.
Package: oozie
Architecture: all
Depends: oozie-client (= ${source:Version}), hadoop-client, bigtop-tomcat
Description: Oozie is a system that runs workflows of Hadoop jobs.
Oozie is a system that runs workflows of Hadoop jobs.
Oozie workflows are actions arranged in a control dependency DAG (Direct
Acyclic Graph).
.
Oozie coordinator functionality allows to start workflows at regular
frequencies and when data becomes available in HDFS.
.
An Oozie workflow may contain the following types of actions nodes:
map-reduce, map-reduce streaming, map-reduce pipes, pig, file-system,
sub-workflows, java, hive, sqoop and ssh (deprecated).
.
Flow control operations within the workflow can be done using decision,
fork and join nodes. Cycles in workflows are not supported.
.
Actions and decisions can be parameterized with job properties, actions
output (i.e. Hadoop counters) and HDFS file information (file exists,
file size, etc). Formal parameters are expressed in the workflow definition
as ${VARIABLE NAME} variables.
.
A Workflow application is an HDFS directory that contains the workflow
definition (an XML file), all the necessary files to run all the actions:
JAR files for Map/Reduce jobs, shells for streaming Map/Reduce jobs, native
libraries, Pig scripts, and other resource files.
.
Running workflow jobs is done via command line tools, a WebServices API
or a Java API.
.
Monitoring the system and workflow jobs can be done via a web console, the
command line tools, the WebServices API and the Java API.
.
Oozie is a transactional system and it has built in automatic and manual
retry capabilities.
.
In case of workflow job failure, the workflow job can be rerun skipping
previously completed actions, the workflow application can be patched before
being rerun.