Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 6 Next »

By the end of this chapter you will be able to:

You will also learn how to limit a pipe to read a maximum number of data records.

In this exercise you will select the latest (most recent) sales record for each customer, from the set of combined updates you created in the previous exercise.

Create an aggregate stream

  1. In the model toolbar, find 
    Error rendering macro 'excerpt-include' : No link could be created for '_add_stream_aggregate'.
      and drag it into the model.
  2. Set the stream Name to Latest Package Update.
  3. Add a pipe from Combined Updates:
    1. Hover your mouse pointer over the stream Combined Updates.
    2. In the context toolbar, click 
      Error rendering macro 'excerpt-include' : No link could be created for '_add_pipe'.
      .
    3. Click the stream Latest Package Update to connect the pipe.
  4. Drag all attributes from Combined Updates to Latest Package Update.
    1. Hover your mouse pointer over the stream Combined Updates again.
    2. In the context toolbar, click 
      Error rendering macro 'excerpt-include' : No link could be created for '_stream_attributes'.
      . Phixflow lists the data attributes (column titles) for this stream.
    3. Select all the attributes and drag them into the Latest Package Update settings → Aggregate Attributes section.
    4. Click  OK to save and close the Latest Package Updates settings.
    5. Close the attributes pop-up window.
  5. In the Channel Package model toolbar, click  Save.

When you next run this model, the Latest Package Updates stream will pull in the all the data from the Combined Updates stream.

Group and order data on an input pipe

At the moment the data will come into the aggregate stream in any order. You are going to sort the data as it comes through the input pipe to the aggregate stream.

  • Group the data, so that all the records with the same Customer Ref number are adjacent in the table.
  • Sort the data by date, newest to oldest, using Sales Date. This means that the first record delivered by the input pipe, for each Customer Ref, is the most recent for that customer.
  1. Double click on the pipe from Combined Updates
  2. Go to the Sort/Group section
    1. Press  - Show Stream Attributes
    2. Drag in CustomerRef
    3. Drag in SalesDate
    4. You will leave CustomerRef as a grouping attribute
    5. You will update SalesDate to be a ordering attribute:
      1. Double click on SalesDate in the Sort/Group attributes list
      2. Set the Sort Direction to descending: Z-A
      3. Untick the Group flag
      4. Press 
    6. Set Maximum Number of Records per Group to: 1

This setting means that for each key value – in this case for each value of CustomerRef – only the first record will be returned; since you have also ordered by SalesDate descending, this will be the latest (most recent) record for each CustomerRef

  • Press  in the main pipe configuration form to save your changes

Test your new Stream

  • Run analysis on your new stream
  • Make sure that for all customers the most recent sales update has been selected
  • No labels