Introduction
Built-In Construction Functions
- tensor-Function
DML-Bodied Built-In functions

Introduction

The DML (Declarative Machine Learning) language has built-in functions which enable access to both low- and high-level functions to support all kinds of use cases.

Builtins are either implemented on a compiler level or as DML scripts that are loaded at compile time.

Built-In Construction Functions

There are some functions which generate an object for us. They create matrices, tensors, lists and other non-primitive objects.

`tensor`-Function

The tensor-function creates a tensor for us.

Usage

tensor(data, dims, byRow = TRUE)

Arguments

Name	Type	Default	Description
data	Matrix[?], Tensor[?], Scalar[?]	required	The data with which the tensor should be filled. See `data`-Argument.
dims	Matrix[Integer], Tensor[Integer], Scalar[String], List[Integer]	required	The dimensions of the tensor. See `dims`-Argument.
byRow	Boolean	TRUE	NOT USED. Will probably be removed or replaced.

Note that this function is highly unstable and will be overworked and might change signature and functionality.

Returns

Type	Description
Tensor[?]	The generated Tensor. Will support more datatypes than `Double`.

`data`-Argument

The data-argument can be a Matrix of any datatype from which the elements will be taken and placed in the tensor until filled. If given as a Tensor the same procedure takes place. We iterate through Matrix and Tensor by starting with each dimension index at 0 and then incrementing the lowest one, until we made a complete pass over the dimension, and then increasing the dimension index above. This will be done until the Tensor is completely filled.

If data is a Scalar, we fill the whole tensor with the value.

`dims`-Argument

The dimension of the tensor can either be given by a vector represented by either by a Matrix, Tensor, String or List. Dimensions given by a String will be expected to be concatenated by spaces.

Example

print("Dimension matrix:");
d = matrix("2 3 4", 1, 3);
print(toString(d, decimal=1))

print("Tensor A: Fillvalue=3, dims=2 3 4");
A = tensor(3, d); # fill with value, dimensions given by matrix
print(toString(A))

print("Tensor B: Reshape A, dims=4 2 3");
B = tensor(A, "4 2 3"); # reshape tensor, dimensions given by string
print(toString(B))

print("Tensor C: Reshape dimension matrix, dims=1 3");
C = tensor(d, list(1, 3)); # values given by matrix, dimensions given by list
print(toString(C, decimal=1))

print("Tensor D: Values=tst, dims=Tensor C");
D = tensor("tst", C); # fill with string, dimensions given by tensor
print(toString(D))

Note that reshape construction is not yet supported for SPARK execution.

DML-Bodied Built-In Functions

DML-bodied built-in functions are written as DML-Scripts and executed as such when called.

`gridSearch`-Function

The gridSearch-function is used to find the optimal hyper-parameters of a model which results in the most accurate predictions. This function takes train and eval functions by name.

Usage

gridSearch(X, y, train, predict, params, paramValues, verbose)

Arguments

Name	Type	Default	Description
X	Matrix[Double]	required	Input Matrix of vectors.
y	Matrix[Double]	required	Input Matrix of vectors.
train	String	required	Specified training function.
predict	String	required	Evaluation based function.
params	List[String]	required	List of parameters
paramValues	List[Unknown]	required	Range of values for the parameters
verbose	Boolean	`TRUE`	If `TRUE` print messages are activated

Returns

Type	Description
Matrix[Double]	Parameter combination
Frame[Unknown]	Best results model

Example

X = rand (rows = 50, cols = 10)
y = X %*% rand(rows = ncol(X), cols = 1)
params = list("reg", "tol", "maxi")
paramRanges = list(10^seq(0,-4), 10^seq(-5,-9), 10^seq(1,3))
[B, opt]= gridSearch(X=X, y=y, train="lm", predict="lmPredict", params=params, paramValues=paramRanges, verbose = TRUE)

`KMeans`-Function

The kmeans() implements the KMeans Clustering algorithm.

Usage

kmeans(X = X, k = 20, runs = 10, max_iter = 5000, eps = 0.000001, is_verbose = FALSE, avg_sample_size_per_centroid = 50)

Arguments

Name	Type	Default	Description
x	Matrix[Double]	required	The input Matrix to do KMeans on.
k	Int	`10`	Number of centroids
runs	Int	`10`	Number of runs (with different initial centroids)
max_iter	Int	`100`	Max no. of iterations allowed
eps	Double	`0.000001`	Tolerance (epsilon) for WCSS change ratio
is_verbose	Boolean	FALSE	do not print per-iteration stats

Returns

Type	Description
String	The mapping of records to centroids
String	The output matrix with the centroids

`lm`-Function

The lm-function solves linear regression using either the direct solve method or the conjugate gradient algorithm depending on the input size of the matrices (See lmDS-function and lmCG-function respectively).

Usage

lm(X, y, icpt = 0, reg = 1e-7, tol = 1e-7, maxi = 0, verbose = TRUE)

Arguments

Name	Type	Default	Description
X	Matrix[Double]	required	Matrix of feature vectors.
y	Matrix[Double]	required	1-column matrix of response values.
icpt	Integer	`0`	Intercept presence, shifting and rescaling the columns of X (Details)
reg	Double	`1e-7`	Regularization constant (lambda) for L2-regularization. set to nonzero for highly dependant/sparse/numerous features
tol	Double	`1e-7`	Tolerance (epsilon); conjugate gradient procedure terminates early if L2 norm of the beta-residual is less than tolerance * its initial norm
maxi	Integer	`0`	Maximum number of conjugate gradient iterations. 0 = no maximum
verbose	Boolean	`TRUE`	If `TRUE` print messages are activated

Note that if number of features is small enough (rows of X/y < 2000), the lmDS-Function' is called internally and parameters tol and maxi are ignored.

Returns

Type	Description
Matrix[Double]	1-column matrix of weights.

`icpt`-Argument

The icpt-argument can be set to 3 modes:

0 = no intercept, no shifting, no rescaling
1 = add intercept, but neither shift nor rescale X
2 = add intercept, shift & rescale X columns to mean = 0, variance = 1

Example

X = rand (rows = 50, cols = 10)
y = X %*% rand(rows = ncol(X), cols = 1)
lm(X = X, y = y)

`lmDS`-Function

The lmDS-function solves linear regression by directly solving the linear system.

Usage

lmDS(X, y, icpt = 0, reg = 1e-7, verbose = TRUE)

Arguments

Name	Type	Default	Description
X	Matrix[Double]	required	Matrix of feature vectors.
y	Matrix[Double]	required	1-column matrix of response values.
icpt	Integer	`0`	Intercept presence, shifting and rescaling the columns of X (Details)
reg	Double	`1e-7`	Regularization constant (lambda) for L2-regularization. set to nonzero for highly dependant/sparse/numerous features
verbose	Boolean	`TRUE`	If `TRUE` print messages are activated

Returns

Type	Description
Matrix[Double]	1-column matrix of weights.

Example

X = rand (rows = 50, cols = 10)
y = X %*% rand(rows = ncol(X), cols = 1)
lmDS(X = X, y = y)

`lmCG`-Function

The lmCG-function solves linear regression using the conjugate gradient algorithm.

Usage

lmCG(X, y, icpt = 0, reg = 1e-7, tol = 1e-7, maxi = 0, verbose = TRUE)

Arguments

Name	Type	Default	Description
X	Matrix[Double]	required	Matrix of feature vectors.
y	Matrix[Double]	required	1-column matrix of response values.
icpt	Integer	`0`	Intercept presence, shifting and rescaling the columns of X (Details)
reg	Double	`1e-7`	Regularization constant (lambda) for L2-regularization. set to nonzero for highly dependant/sparse/numerous features
tol	Double	`1e-7`	Tolerance (epsilon); conjugate gradient procedure terminates early if L2 norm of the beta-residual is less than tolerance * its initial norm
maxi	Integer	`0`	Maximum number of conjugate gradient iterations. 0 = no maximum
verbose	Boolean	`TRUE`	If `TRUE` print messages are activated

Returns

Type	Description
Matrix[Double]	1-column matrix of weights.

Example

X = rand (rows = 50, cols = 10)
y = X %*% rand(rows = ncol(X), cols = 1)
lmCG(X = X, y = y, maxi = 10)

`lmpredict`-Function

The lmpredict-function predicts the class of a feature vector.

Usage

lmpredict(X, w)

Arguments

Name	Type	Default	Description
X	Matrix[Double]	required	Matrix of feature vector(s).
w	Matrix[Double]	required	1-column matrix of weights.
icpt	Matrix[Double]	`0`	Intercept presence, shifting and rescaling of X (Details)

Returns

Type	Description
Matrix[Double]	1-column matrix of classes.

Example

X = rand (rows = 50, cols = 10)
y = X %*% rand(rows = ncol(X), cols = 1)
w = lm(X = X, y = y)
yp = lmpredict(X, w)

`scale`-Function

The scale function is a generic function whose default method centers or scales the column of a numeric matrix.

Usage

scale(X, center=TRUE, scale=TRUE)

Arguments

Name	Type	Default	Description
X	Matrix[Double]	required	Matrix of feature vectors.
center	Boolean	required	either a logical value or numerical value.
scale	Boolean	required	either a logical value or numerical value.

Returns

Type	Description
Matrix[Double]	1-column matrix of weights.

Example

X = rand(rows = 20, cols = 10)
center=TRUE;
scale=TRUE;
Y= scale(X,center,scale)

`sigmoid`-Function

The Sigmoid function is a type of activation function, and also defined as a squashing function which limit the output to a range between 0 and 1, which will make these functions useful in the prediction of probabilities.

Usage

sigmoid(X)

Arguments

Name	Type	Default	Description
X	Matrix[Double]	required	Matrix of feature vectors.

Returns

Type	Description
Matrix[Double]	1-column matrix of weights.

Example

X = rand (rows = 20, cols = 10)
Y = sigmoid(X)

`steplm`-Function

The steplm-function (stepwise linear regression) implements a classical forward feature selection method. This method iteratively runs what-if scenarios and greedily selects the next best feature until the Akaike information criterion (AIC) does not improve anymore. Each configuration trains a regression model via lm, which in turn calls either the closed form lmDS or iterative lmGC.

Usage

steplm(X, y, icpt);

Arguments

Name	Type	Default	Description
X	Matrix[Double]	required	Matrix of feature vectors.
y	Matrix[Double]	required	1-column matrix of response values.
icpt	Integer	`0`	Intercept presence, shifting and rescaling the columns of X (Details)
reg	Double	`1e-7`	Regularization constant (lambda) for L2-regularization. set to nonzero for highly dependent/sparse/numerous features
tol	Double	`1e-7`	Tolerance (epsilon); conjugate gradient procedure terminates early if L2 norm of the beta-residual is less than tolerance * its initial norm
maxi	Integer	`0`	Maximum number of conjugate gradient iterations. 0 = no maximum
verbose	Boolean	`TRUE`	If `TRUE` print messages are activated

Returns

Type	Description
Matrix[Double]	Matrix of regression parameters (the betas) and its size depend on `icpt` input value. (C in the example)
Matrix[Double]	Matrix of `selected` features ordered as computed by the algorithm. (S in the example)

`icpt`-Argument

The icpt-arg can be set to 2 modes:

0 = no intercept, no shifting, no rescaling
1 = add intercept, but neither shift nor rescale X

`selected`-Output

If the best AIC is achieved without any features the matrix of selected features contains 0. Moreover, in this case no further statistics will be produced

Example

X = rand (rows = 50, cols = 10)
y = X %*% rand(rows = ncol(X), cols = 1)
[C, S] = steplm(X = X, y = y, icpt = 1);

`slicefinder`-Function

The slicefinder-function returns top-k worst performing subsets according to a model calculation.

Usage

slicefinder(X,W, y, k, paq, S);

Arguments

Name	Type	Default	Description
X	Matrix[Double]	required	Recoded dataset into Matrix
W	Matrix[Double]	required	Trained model
y	Matrix[Double]	required	1-column matrix of response values.
k	Integer	1	Number of subsets required
paq	Integer	1	amount of values wanted for each col, if paq = 1 then its off
S	Integer	2	amount of subsets to combine (for now supported only 1 and 2)

Returns

Type	Description
Matrix[Double]	Matrix containing the information of top_K slices (relative error, standart error, value0, value1, col_number(sort), rows, cols,range_row,range_cols, value00, value01,col_number2(sort), rows2, cols2,range_row2,range_cols2)

Usage

X = rand (rows = 50, cols = 10)
y = X %*% rand(rows = ncol(X), cols = 1)
w = lm(X = X, y = y)
ress = slicefinder(X = X,W = w, Y = y,  k = 5, paq = 1, S = 2);

`normalize`-Function

The normalize-function normalises the values of a matrix by changing the dataset to use a common scale. This is done while preserving differences in the ranges of values. The output is a matrix of values in range [0,1].

Usage

normalize(X);

Arguments

Name	Type	Default	Description
X	Matrix[Double]	required	Matrix of feature vectors.

Returns

Type	Description
Matrix[Double]	1-column matrix of normalized values.

Example

X = rand(rows = 50, cols = 10)
y = X %*% rand(rows = ncol(X), cols = 1)
y = normalize(X = X)

`gnmf`-Function

The gnmf-function does Gaussian Non-Negative Matrix Factorization. In this, a matrix X is factorized into two matrices W and H, such that all three matrices have no negative elements. This non-negativity makes the resulting matrices easier to inspect.

Usage

gnmf(X, rnk, eps = 10^-8, maxi = 10)

Arguments

Name	Type	Default	Description
X	Matrix[Double]	required	Matrix of feature vectors.
rnk	Integer	required	Number of components into which matrix X is to be factored.
eps	Double	`10^-8`	Tolerance
maxi	Integer	`10`	Maximum number of conjugate gradient iterations.

Returns

Type	Description
Matrix[Double]	List of pattern matrices, one for each repetition.
Matrix[Double]	List of amplitude matrices, one for each repetition.

Example

X = rand(rows = 50, cols = 10)
W = rand(rows = nrow(X), cols = 2, min = -0.05, max = 0.05);
H = rand(rows = 2, cols = ncol(X), min = -0.05, max = 0.05);
gnmf(X = X, rnk = 2, eps = 10^-8, maxi = 10)

`naivebayes`-Function

The naivebayes-function computes the class conditional probabilities and class priors.

Usage

naivebayes(D, C, laplace, verbose)

Arguments

Name	Type	Default	Description
D	Matrix[Double]	required	One dimensional column matrix with N rows.
C	Matrix[Double]	required	One dimensional column matrix with N rows.
Laplace	Double	`1`	Any Double value.
Verbose	Boolean	`TRUE`	Boolean value.

Returns

Type	Description
Matrix[Double]	Class priors, One dimensional column matrix with N rows.
Matrix[Double]	Class conditional probabilites, One dimensional column matrix with N rows.

Example

D=rand(rows=10,cols=1,min=10)
C=rand(rows=10,cols=1,min=10)
[prior, classConditionals] = naivebayes(D, C, laplace = 1, verbose = TRUE)

`outlier`-Function

This outlier-function takes a matrix data set as input from where it determines which point(s) have the largest difference from mean.

Usage

outlier(X, opposite)

Arguments

Name	Type	Default	Description
X	Matrix[Double]	required	Matrix of Recoded dataset for outlier evaluation
opposite	Boolean	required	(1)TRUE for evaluating outlier from upper quartile range, (0)FALSE for evaluating outlier from lower quartile range

Returns

Type	Description
Matrix[Double]	matrix indicating outlier values

Example

X = rand (rows = 50, cols = 10)
outlier(X=X, opposite=1)

`toOneHot`-Function

The toOneHot-function encodes unordered categorical vector to multiple binarized vectors.

Usage

toOneHot(X, numClasses)

Arguments

Name	Type	Default	Description
X	Matrix[Double]	required	vector with N integer entries between 1 and numClasses.
numClasses	int	required	number of columns, must be greater than or equal to largest value in X.

Returns

Type	Description
Matrix[Double]	one-hot-encoded matrix with shape (N, numClasses).

Example

numClasses = 5
X = round(rand(rows = 10, cols = 10, min = 1, max = numClasses))
y = toOneHot(X,numClasses)

`msvm`-Function

The msvm-function implements builtin multiclass SVM with squared slack variables It learns one-against-the-rest binary-class classifiers by making a function call to l2SVM

Usage

msvm(X, Y, intercept, epsilon, lamda, maxIterations, verbose)

Arguments

Name	Type	Default	Description
X	Double	---	Matrix X of feature vectors.
Y	Double	---	Matrix Y of class labels.
intercept	Boolean	False	No Intercept ( If set to TRUE then a constant bias column is added to X)
num_classes	Integer	10	Number of classes.
epsilon	Double	0.001	Procedure terminates early if the reduction in objective function value is less than epsilon (tolerance) times the initial objective function value.
lamda	Double	1.0	Regularization parameter (lambda) for L2 regularization
maxIterations	Integer	100	Maximum number of conjugate gradient iterations
verbose	Boolean	False	Set to true to print while training.

Returns

Name	Type	Default	Description
model	Double	---	Model matrix.

Example

X = rand(rows = 50, cols = 10)
y = round(X %*% rand(rows=ncol(X), cols=1))
model = msvm(X = X, Y = y, intercept = FALSE, epsilon = 0.005, lambda = 1.0, maxIterations = 100, verbose = FALSE)

Table of Contents

Introduction

Built-In Construction Functions

tensor-Function

Usage

Arguments

Returns

data-Argument

dims-Argument

Example

DML-Bodied Built-In Functions

gridSearch-Function

Usage

Arguments

Returns

Example

KMeans-Function

Usage

Arguments

Returns

lm-Function

Usage

Arguments

Returns

icpt-Argument

Example

lmDS-Function

Usage

Arguments

Returns

Example

lmCG-Function

Usage

Arguments

Returns

Example

lmpredict-Function

Usage

Arguments

Returns

Example

scale-Function

Usage

Arguments

Returns

Example

sigmoid-Function

Usage

Arguments

Returns

Example

steplm-Function

Usage

Arguments

Returns

icpt-Argument

selected-Output

Example

slicefinder-Function

Usage

Arguments

Returns

Usage

normalize-Function

Usage

Arguments

Returns

Example

gnmf-Function

Usage

Arguments

Returns

Example

naivebayes-Function

Usage

Arguments

Returns

Example

outlier-Function

Usage

`tensor`-Function

`data`-Argument

`dims`-Argument

`gridSearch`-Function

`KMeans`-Function

`lm`-Function

`icpt`-Argument

`lmDS`-Function

`lmCG`-Function

`lmpredict`-Function

`scale`-Function

`sigmoid`-Function

`steplm`-Function

`icpt`-Argument

`selected`-Output

`slicefinder`-Function

`normalize`-Function

`gnmf`-Function

`naivebayes`-Function

`outlier`-Function

`toOneHot`-Function

`msvm`-Function