Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

It is often useful to merge two or more streams of data to add additional attributes for records from another source and / or to identify duplicates and missing records. The examples below show how to do this manually to understand the process as well as using a built in feature of Centerview CenterView that provides some useful additional attributes on the merged data.

...

  1. Add a new merge stream to hold the merged data. 
  2. Connect the source streams to the merge stream using pull pipes e.g in and in2
  3. Drag all the non-key attributes from the source streams into the destination stream i.e all attributes except the one(s) that will be used to define the key of the records.
  4. Hover over the destination stream and select Quick Configure Options->Configure Input
  5. For each of the source streams displayed, select Show the List of Attributes and drag the key attribute into the box below
  6. Save the changes
  7. Now add a new attribute to the destination stream for the key. In the attribute expression type _key
  8. Run the destination stream. The results will contain one record for each unique key in the source streams, with attributes from all source streams

Image RemovedImage Added

Image RemovedImage RemovedImage Added

Example 2: Merging customers and accounts using the Merge feature

...

  1. Select the source streams by right clicking on them whilst holding down Shift
  2. Right click on the model background and select Merge Selected Streams
  3. For each of the source streams displayed, select Show the List of Attributes and drag the key attribute into the box below as in example 1
  4. Save the changes and select All attributes in the automatic stream configuration screen
  5. The stream will now be configured with additional attributes as shown below:

 

  • gridRowColor = Highlights rows in red if there are multiple records / duplicates
  • in_Count & in_2_Count = Reports the number of records matching they key in each of the source streams
  • in_HasDate & in_2_HasData = Reports whether or not a matching record was found in each stream

...

Model Ref: 001

Filter by label (Content by label)
showLabelsfalse
max5
spacesHELP60
showSpacefalse
sortmodified
reversetrue
typepage
cqllabel = "kb-how-to-article" and type = "page" and space = "HELP60"
labelskb-how-to-article

...