Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Excerpt
hiddentrue

cache(cache key, expression1, expression2,..., expressionN) Build and use a set of values across the records in a stream setrecordset.

Function: cache()

Build and use a set of values across the records in a stream setrecordset.

The first parameter is the cache key expression. This is evaluated and the result used as the key value for the cache. Before the subsequent expressions are evaluated the key value is used to retrieve the current value from the cache associated with that key. This retrieved value is then assigned to the internal variable _cacheValue. If there was no value in the cache for this key then _cacheValue is set to _NULL.

This cache key expression is followed by a number of additional expressions. These are evaluated in turn, and the value returned by the final expression is placed into the cache against the original key value.

The final cache value is returned by the expression.

Please note that the internal variable _cacheValue should not be used outside of the cache function.

Tip

When you use this function, PhixFlow log messages include the cache size.

...


Synchronisation

The cache function "synchronises" itself across each call. In PhixFlow, to make best use of modern computer architectures with parallel processing capabilities, when you run analysis the work to be done is divided up into parcels. These may be processed in any order. However, to make sure that the results from the cache function remain consistent, PhixFlow ensures that calls to the cache function are "synchronised" across all these parcels of work.

It is only necessary to be concerned about synchronisation if the same key will be used across multiple parcels of work. However, this consideration can become complex, so as a general rule, try and use a single call to the cache function in the stream table configuration.

Example

You have a calculate stream table processing, say, 10,000 records - and you want to accumulate totals based on a key value, of which there are 100 across the data set. For each key value, many records will be added to the total. To do this, you should use a call to the cache function such as:

...

ArgumentTypeDescription
cache keyExpressionAny valid Expression
expressionNExpressionAny valid Expression

Examples

Example Setup

Code Block
$result = cache("accountRef", _cacheValue + 1)

...

Return the cache value for key value "accountRef".

Code Block
$result = cache("accountRef", _cacheValue + 1)


Example Generating UIDs

This example is for the case where a user wants to generate UIDs in a specific way.

In this example, the table receives both updates for existing records and new records. This table is used to correctly set/lookup the various ID fields.

If 3 records came into this table that had the same value for a specific attribute (e.g. another ID field such as "BarcodeNumber"), then we do not want to generate a new ID for each record, instead, we want to generate a single ID and use this value for all three records. The configuration is as follows:


Code Block
do(
	if($newRecord,
	// then
		do(
            $recordID = cache(_out.BarcodeNumber,
				if( _cacheValue == _NULL,
				// then	
					$recordID = nextValue('RecordID'),
				// else
					_cacheValue
                )
            )
                
		),
	// else
		$recordID = in.RecordID
    ),
    $recordID
)


This expression above uses the cache function to hold a newly generated ID (nextValue('RecordID') in memory for each unique _out.BarcodeNumber field. So the 1st record that is processed gets the new ID generated by the sequence, and then any subsequent records that are processed will find a match for their BarcodeNumber in the cache, and take that ID that was generated for the 1st BarcodeNumber.


See Also