By the end of this chapter you will be able to:
So far your model has merged data from two files, then grouped and sorted the records. You now have a stream that has the latest data about the TV package that each customer has.
You now want all the channels that the customer has ordered. You will look up the data about the channels provided by each package in database table. To make sure that PhixFlow finds all the channels, you will use an output multiplier.
Load data from a database table
First, load the database table SOURCE_PACKAGE_CHANNELS
into a stream. Your model will then have the list of channels available in each package.
Remember, the Introduction to Modelling course explains about loading data from databases; see section 3 Creating Datasources and Database Collectors.
- In the repository browser, expand the list of Datasources.
- Drag the datasource called
Rating and Billing
into your model. - In your model, hover your mouse pointer over
Rating and Billing
and clickError rendering macro 'excerpt-include' : No link could be created for '_show_tables'.. PhixFlow displays a list of the tables available in this database. You can resize the list to make it easier to browse. - Right-click on
SOURCE_PACKAGE_CHANNELS
and selectError rendering macro 'excerpt-include' : No link could be created for '_create_collector'.. PhixFlow adds a new database collector to the model, and gives it the same name as the source data table,SOURCE_PACKAGE_CHANNELS
. In the settings → Basic Settings section → Statement Expression, you can see the database query is:select * from SOURCE_PACKAGE_CHANNELS
. - Hover your mouse pointer over the new database collector, and click Error rendering macro 'excerpt-include' : No link could be created for '_add_stream_with_attributes'.. PhixFlow adds a new stream to your model and also gives it the name
. In the settings → Attributes section, you can see that the data has two attributes (columns): Package and Channel.SOURCE_PACKAGE_CHANNELS
To load the data into the stream, run analysis on this section of the model and check the data has loaded.
Snapshot of your model and the data you have loaded:
Take a moment to look at this data. For each package, there are multiple channels included. For example, the News package includes Channel 1, Channel 2, Science 2 and NewsExtra.
Add another stream
Update your model to add a stream. This stream will take all the data that we have merged, sorted and grouped in the model so far.
- Hover your mouse pointer over the
Latest Package Update
stream and clickError rendering macro 'excerpt-include' : No link could be created for '_add_stream'.. - In Basic Settings, set the Name to
Latest Package All Channels
. Drag all the attributes from
Latest Package Update
to the settings forLatest Package All Channels
.- Click Apply and Close to save and close the settings tab.
- Close the attributes list.
Supplement the stream with look-up data
You now need to add to the data in the Latest Package All Channels
stream. You will use a look-up pipe to add the channels provided by a package. PhixFlow needs to know how the data in the source stream matches the data in the receiving stream.
- In the model, add a pipe from
SOURCE_PACKAGE_CHANNELS
toLatest Package All Channels
. PhixFlow opens the settings for the new pipe. - In the Basic Settings section, set:
- Name:
find_channels
- Type: Look-up
- Name:
- In the Order/Index section, click Error rendering macro 'excerpt-include' : No link could be created for '_stream_attributes'.. PhixFlow opens two lists of attributes: one list for the source stream and another for the receiving stream.
- From the source stream list, on the left, drag
PACKAGE
into the Order/Index grid.
This tells PhixFlow about the key column in the source stream. Notice that PhixFlow has added a line to the Order/Index grid. - From the receiving stream list, on the right, drag
Package
over the top of the line in the Order/Index grid.
This tells PhixFlow about the matching key column in the receiving stream. Notice that PhixFlow has now added the Index Expression_outPackage
. However, we do not want PhixFlow to use_outPackage
. because output multipliers are calculated before the output is generated. We need it to use the data on the input. - In the Order/Index section, double-click the new line that PhixFlow has added. PhixFlow opens the settings for the
PACKAGE
attribute. Set:- Basic Settings → Index Expression from
_out.Package
toin.Package
. - Click Apply and Close to save and close the settings tab.
- Basic Settings → Index Expression from
- Close the attribute list.
- In the pipe settings, click Apply and Close to save and close the settings tab.
- In the model toolbar, click Apply to save your model.
At this point, if you run your model (I think) it will find each customer, e.g. C010, and will read that the customer has the News package. It will then look-up the channel for this to find News: Channel 1. But having done one look-up for the customer it will move onto the next customer.
This means that the lookup will match records in the reference data (SOURCE_PACKAGE_CHANNELS
) where PACKAGE
= Package
from the input pipe in
We must do this, rather than using the value of Package
from the output because we are GOING TO USE the lookup to drive an output multiplier – and output multipliers are calculated before the output is generated
Set an output multiplier
An output multiplier tells PhixFlow to process the same record multiple times, each time producing a new record. This is called breaking-out a record. In this case we want PhixFlow to keep processing the same customer record until it has found all the channels. This means our single input record:
C010 | News |
becomes multiple output records, like this:
C010 | News | Channel 1 |
C010 | News | Channel 2 |
C010 | News | Channel 3 |
C010 | News | NewsExtra |
The output multiplier needs to know:
- which pipe it will use -
find_channels
. - what data you want to add - the channels.
The stream also needs to create the new column for the channel information.
- In the model, double-click
Latest Package All Channels
to open its settings tab. - To create the output multiplier, expand the Multipliers and Filters section and set:
- Output Multiplier:
find_channels.CHANNEL
- Output Multiplier:
- To create a new column for the channel data, expand the Attributes section. Click Error rendering macro 'excerpt-include' : No link could be created for '_add_object'.to add a new attribute. PhixFlow opens a new attribute settings tab. Set:
- Name:
Channel
- Expression:
_outputMultiplier
- Name:
- Click Apply and Close to save and close the settings tab.
- In the model toolbar, click Apply to save your model.
You can run the model and view the stream data.
Troubleshooting: What do I do if there is no data.