commit | fa80240f72a6551c2ee567d471afa499fd1d1efe | [log] [tgz] |
---|---|---|
author | Rahul Iyer <riyer@apache.org> | Fri Feb 24 14:32:32 2017 -0800 |
committer | Rahul Iyer <riyer@apache.org> | Mon Feb 27 13:13:18 2017 -0800 |
tree | 958ea8833fa84f52a7ddeadaba7410124d0e2f6b | |
parent | 0b8415e7eec5c9ebb83fbf22923c69a99b0056ef [diff] |
Build: Fix module sort order for PGXN installation JIRA: MADLIB-1024 PGXN installation involves creating a single extension sql file that contains all the SQL commands run during MADlib deployment. The modules added into this extension file are to be placed in the right order, taking dependencies into account. MADlib has a function that compares a given file path with topologically sorted modules to decide the order of concatenation to extension file. This comparison is faulty since the module name is searched for in the whole path, leading to false positive with modules that have another module name as substring. The specific bug was reported as 'svec_util' being flagged in same order as 'svec'. This commit fixes this issue taking advantage of the file path names being of the form '.../modules/<module_name>/...', hence comparing the whole module name. Closes #106
MADlib® is an open-source library for scalable in-database analytics. It provides data-parallel implementations of mathematical, statistical and machine learning methods for structured and unstructured data.
See the project webpage MADlib Home
for links to the latest binary and source packages. For installation and contribution guides, please see MADlib Wiki
The latest documentation of MADlib modules can be found at MADlib Docs
.
The following block-diagram gives a high-level overview of MADlib's architecture.
MADlib incorporates material from the following third-party components
argparse 1.2.1
“provides an easy, declarative interface for creating command line tools”Boost 1.47.0 (or newer)
“provides peer-reviewed portable C++ source libraries”Eigen 3.2.2
“is a C++ template library for linear algebra”PyYAML 3.10
“is a YAML parser and emitter for Python”PyXB 1.2.4
“is a Python library for XML Schema Bindings”License information regarding MADlib and included third-party libraries can be found inside the license
directory.
Changes between MADlib versions are described in the ReleaseNotes.txt
file.
MAD Skills : New Analysis Practices for Big Data (VLDB 2009)
Hybrid In-Database Inference for Declarative Information Extraction (SIGMOD 2011)
Towards a Unified Architecture for In-Database Analytics (SIGMOD 2012)
The MADlib Analytics Library or MAD Skills, the SQL (VLDB 2012)