PhixFlow Help

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

It is often useful to merge two or more streams of data to add additional attributes for records from another source and / or to identify duplicates and missing records. The examples below show how to do this manually to understand the process as well as using a built in feature of PhixFlow that provides some useful additional attributes on the merged data.

Example 1: Manually merging customers and accounts

In this first example, one stream contains Customer names and references and a second stream contains address details from accounts. The data is able to be merged because the Customer reference and Account ID are the same

Step-by-step guide

 

  1. Add a new merge stream to hold the merged data. 
  2. Connect the source streams to the merge stream using pull pipes e.g in and in2
  3. Drag all the non-key attributes from the source streams into the destination stream i.e all attributes except the one(s) that will be used to define the key of the records.
  4. Hover over the destination stream and select Quick Configure Options->Configure Input
  5. For each of the source streams displayed, select Show the List of Attributes and drag the key attribute into the box below
  6. Save the changes
  7. Now add a new attribute to the destination stream for the key. In the attribute expression type _key
  8. Run the destination stream. The results will contain one record for each unique key in the source streams, with attributes from all source streams

Example 2: Merging customers and accounts using the Merge feature

In this example, additional attributes are created that show where customers whose address couldn't be found, addresses with no corresponding customer and duplicate records, i.e Customers with multiple records in the address file

Step-by-step guide

 

  1. Select the source streams by right clicking on them whilst holding down Shift
  2. Right click on the model background and select Merge Selected Streams
  3. For each of the source streams displayed, select Show the List of Attributes and drag the key attribute into the box below as in example 1
  4. Save the changes and select All attributes in the automatic stream configuration screen
  5. The stream will now be configured with additional attributes as shown below:

 

gridRowColor = Highlights rows in red if there are multiple records / duplicates

in_Count & in_2_Count = Reports the number of records matching they key in each of the source streams

in_HasDate & in_2_HasData = Reports whether or not a matching record was found in each stream

Filter by label

There are no items with the selected labels at this time.

  • No labels