PhixFlow Help

Enriching Data Sets

By the end of this chapter you will be able to:

  • Create Lookup Pipes
  • Enrich a data set using a Lookup Pipe

Copy data from merge

First, you will create a new Stream to copy over all data from your merge:

  • Hover over the stream Customer All Details, and press - Add a new Stream
  • Call this Stream: Customer Details with Region
  • Copy all attributes into your new Stream:
    • Hover over Customer All Details and in the hover menu press - Show Stream Attributes
    • Select all attributes in the list:
      • Select the top attribute
      • Keep shift pressed down and select the bottom attribute in the list
    • Drag the attributes on top of the stream Customer Details with Region
  • Run your new Stream and look at the data in it to verify that you have copied data from the merge Stream
  • Remember to save your model layout

    Here you have copied data from a merge Stream into a new Stream – but this method works for any type of input Stream. Copying data into a new Stream is a common starting point for a modelling step.

    Create lookup into reference data

    Now you will add a Lookup Pipe from Regions by County to enrich the data in Customer Details with Region, by finding the region for each customer address:
  • Hover over RegionsByCounty and press - Add a new Connector
  • Move the pointer over the top of Customer Details with Region and select – this will attach the end of the pipe to the selected Stream
  • In the Pipe Setup form that pops up:
    • Enter the Name: region
    • Type: Look-up
    • Data To Read: leave as the default – Latest
    • Press 
    • In the matching attributes form that pops up create a match:
      • Drag the attribute County from the left hand box into the list of matching attributes
      • Drag the attribute in_County from the right hand box into the list of matching attributes – on top of the entry for County you created above
        • This means that the lookup will match records in the reference data (RegionsByCounty) where County = in_County in the output data (Customer Details with Region)
      • Press 
    • A pipe will appear linking RegionsByCounty to Customer Details with Region - note that the pipe is dashed; this indicates that this is a lookup pipe
    • When adding Lookup Pipes remember that the arrow points in the direction that the data flows; here you are getting data from RegionsByCounty to add to Customer Details with Region

Enrich a data set using a Lookup Pipe

Now you can add the region – from RegionsByCounty – to Customer Details with Region:

  • Hover over RegionsByCounty and in the hover menu press - Show Stream Attributes
  • The list of the Stream attributes will pop up
  • Drag the attribute Region from RegionsByCounty on top of Customer Details with Region
  • Run Analysis on Customer Details with Region
  • Have a look at the data in this Stream – check that the Region Attribute has been populated with the value looked up from RegionsByCounty

Please let us know if we could improve this page feedback@phixflow.com