blob: 04a5a6560f6d68da75ce7e538393205515a7665b [file] [log] [blame]
# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements. See the NOTICE file distributed with
# this work for additional information regarding copyright ownership.
# The ASF licenses this file to You under the Apache License, Version 2.0
# (the "License"); you may not use this file except in compliance with
# the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
Source: pig-udf-datafu
Section: misc
Priority: extra
Maintainer: Bigtop <dev@bigtop.apache.org>
Build-Depends: debhelper (>= 7.0.50~)
Standards-Version: 3.8.0
Homepage: https://github.com/linkedin/datafu
Package: pig-udf-datafu
Architecture: all
Depends: pig
Description: A collection of user-defined functions for Hadoop and Pig.
DataFu is a collection of user-defined functions for working with large-scale
data in Hadoop and Pig. This library was born out of the need for a stable,
well-tested library of UDFs for data mining and statistics. It is used
at LinkedIn in many of our off-line workflows for data derived products like
"People You May Know" and "Skills".
.
It contains functions for: PageRank, Quantiles (median), variance, Sessionization,
Convenience bag functions (e.g., set operations, enumerating bags, etc),
Convenience utility functions (e.g., assertions, easier writing of EvalFuncs)
and more...