Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

These Aggregate functions are aggregate functions as available on in stream views and pipes. Aggregate functions serve the purpose of aggregating data and giving it more eaningfull meaning. They are usually a common feature in any data based environment. In database management, for instance aggregate functions are used where multiple rows are grouped together to form a singe value or to find out more significant meaning, represented by a single value on a set of values.Aggregate functions are also available on pipes and views in PhixFlow. The following composes a set of available functions (all aggregate function below expect a single argument - field name):aggregate data, often grouped by other attributes, to give totals, max, min, avg values etc. for each key value, e.g. for each customer, region.

FunctionDescription
average
Average

This function returns the arithmetic average value of

the set of numbers. It takes one argument (a name of a field) of numeric type: Long, Float or Integer. It is only available if field on which it is applied, is of such type.maximum

an attribute from records with the same key value.

In detail, it returns the sum of all values for a key value, divided by the number of values that are not null.

It is only available for numeric attributes.

CountThis function returns the
maximum numeric value out of the set of numeric, dates as well as string and values. It takes one argument (a name of a field) of type, either: Long, Float, Integer or a String. In case of a String, it will return a "maximum string", that is a string that would be the last string out of the whole set of strings, if they were sorted in alphabetical order. In case of a date - it will return the latest date.minimum
number of non-null values of an attribute from records with the same key value.
DistinctThis function returns the number of unique values of an attribute from records with the same key value.
Maximum

This function returns the

minimum numeric

maximum value

out

of

the set of numeric, dates as well as string values. It takes one argument (a name of a field) of type, either: Long, Float, Integer or a String. In case of a String, it will return a "minimum string", that is a string that would be the first string out of the whole set of strings, if they were sorted in alphabetical order. In case of a date - it will return the earliest date.sumThis function returns the sum calculated by suming all the values in a given set of values. It is applicable only (and only then it is available) if the name of the field passed to this function represents numeric field of type: Long, Float or Integer.percentThis function expects a name of numeric field to calculate percentage of sum of numerical values of each group in relation to a whole set of grouped values. It is therefore necessary to use this function in conjunction with grouping attribute, as otherwise only default grouping would be applied and this would present no meaningful result, as it would always default to 100% (each record would have been in its own group). If however we assume having a file with 2 records, like so

an attribute from records with the same key value.

It is available for numeric, string and date/time attributes.

Strings are ordered alphabetically, with the shortest strings first.

Minimum

This function returns the minimum value of an attribute from records with the same key value.

It is available for numeric, string and date/time attributes.

 Strings are ordered alphabetically, with the shortest strings first.

Percent

This function can be applied to a numeric field, to return the percentage of the sum of values for each key value in relation to the total sum of values.

You should use this function in conjunction with one or more grouping attributes - each combination of values from these grouping attributes gives a key value.

For example, you have a stream with 2 fields:

NumberLabel
12A
14B
13C
10A
5A
2

B

and

we also assume grouping by "Label", our percent function would return (if applied o "Number" attribute), the following result in a view (assuming we also selected Label attribute to be shown in our view and rounding to 2 decimal places)

you create a grid view:

  • dragging in the attribute Label - which automatically becomes a grouping attribute when one or more of the other attributes are aggregated
  • dragging in the attribute Number, and applying the aggregate function Percent
  • setting the rounding on the stream view attribute Number to 2 decimal places

This will give the view:

NumberLabel
48.22A
28.57B
23.21C
The above are percentages of values assigned to given grouping parameters (A, B, C) as a percentage portion of the whole, computed as


Sum

This function returns the sum of all

such

values

.distinctReturns the number of unique values for the given attribute.countCounts the number of elements

of an attribute from records with the same key value.

It is only available for numeric attributes.