PhixFlow Help
3 Combining Data Sets
By the end of this chapter you will be able to:
You will also learn how to use an internal variable, _pipeName.
In this exercise you will produce a single list of updates to customers' Cable TV packages. Customers can update their packages both through a customer care system and in shops. You will bring all these updates together into a single list.
Read in data sets from files
For this exercise you need two files from the zip file you downloaded in 2 Using Excel Templates, step 5. The files contain updates from the two sources: the customer care service and the shop.
<path>\train\inputData\ChannelPackages\CustomerCareUpdates\CustomerCareUpdates.txt
<path>\train\inputData\ChannelPackages\ShopUpdates\ShopUpdates.txt
- Create a new model with the Name
Channel Package Check
. - Add a file collector to your model and load the data:
- From the model toolbar, drag a into the model. File Collector
- PhixFlow displays the settings tab for the new file collector. Set the Name to
Customer Care Updates
. - In the model, hover your mouse pointer over the
Channel Package Check
file collector to display the context toolbar and click . Upload File - Navigate to the directory
<path>\train\inputData\ChannelPackages\CustomerCareUpdates
. - Select the file
CustomerCareUpdates.txt
, click Open, then click . - PhixFlow adds a new stream to the model, called
CustomerCareUpdate
. - In the model, hover over the new
CustomerCareUpdates
stream and click Run Analysis - Check the data is loaded. There should be 8 lines of data, with columns for Customer Ref, Sales Date and Package.
- Now the data is loaded into the stream, set the stream to . Static
- Add another file collector to your model and repeat the process described in step 2. This time:
- Set the file collector Name to
Shop Updates
. - Upload
<path>\train\inputData\ChannelPackages\ShopUpdates\ShopUpdates.txt
. - PhixFlow loads 7 rows of data to this stream.
- Remember to set the steam to . Static
- Set the file collector Name to
Snapshot of the Channel Package Check
model so far:
Combine the data sets
- Create a new stream in your model and populate its attributes.
- Hover over
CustomerCareUpdates
and click Add Stream - In the settings, set the Name to
Combined Updates
. - In the model, show the stream attributes for
CustomerCareUpdates
. - Select all the attributes and drag them into the properties for the
Combined Updates
stream. Drop them into the Attributes section. - Click . OK
- Hover over
- To save your model, in the model toolbar click . Save
- Connect the
ShopUpdates
stream to theCombined Updates
stream.- Hover over
ShopUpdates
and click . Connector Click
Combined Updates
to link the pipe to the stream.When you connect streams, PhixFlow automatically adds a reference to the input pipe name. This appears in the attribute expression. To merge this data successfully, you must update the attribute expressions to remove the
in.
prefix; see step 4 below.
- Hover over
- Fix all the attribute expressions.
- Double-click on
Combined Updates
to open its settings tab. For each attribute: - In the Attributes section, double-click on an attribute to open its settings.
- In the Basic Settings section, edit the Expression to remove
in.
and click OK
- Double-click on
- In the
Combined Updates
stream settings tab, click to save and close. OK - In the model, hover your mouse pointer over
Combined Updates
and click . Run Analysis - When the stream has run, check that the stream has:
- the columns CustomerRef, Sales Date and Package
- 15 rows
- all cells have data.
The stream default view:
Snapshot of the Chanel Package Check model so far:
Record the source for each update
You will now update your settings to record the source for each update, in the set of combined updates. To do this:
- To update the name of the pipe from the
CustomerCareUpdates
stream to theCombined Updates
stream:- In the model, click on the pipe to open its settings tab.
- Set Name to
CC
. - Click to save and close the pipe settings. OK
- Update the name of the pipe from the
ShopUpdates
file collector to theShopUpdates
stream.- In the model, click on the pipe to open its settings tab.
- Set Name to Shop
- Click to save and close pipe settings. OK
- Double-click the stream
Combined Updates
to open its settings tab. - In the Attributes section, click add a new attribute and set: Add
- Name:
Source
. Expression:
if (_pipeName == "CC", "Customer Care" , "Shop" )
- Click to save and close the attribute settings. OK
- Name:
- To make
Source
the first attribute in the stream, drag it to the top of the attributes list. - Click to save and close the stream settings. OK
- Run analysis on
Combined Updates
. - Check the output data set and make sure that the source has been recorded on each record correctly.
Related content
Please let us know if we could improve this page feedback@phixflow.com