Reltio Connect

 View Only
  • 1.  When using Kafka, can we aggregate the events Reltio creates into SQS?

    Reltio Employee
    Posted 09-18-2021 14:07
    More specifically, (merge of entities sends out create, change events)?

    #ReltioInsights
    #Kafka
    ​​​

    ------------------------------
    Chris Detzel
    Community Program Manager
    Reltio
    ------------------------------


  • 2.  RE: When using Kafka, can we aggregate the events Reltio creates into SQS?

    Reltio Employee
    Posted 09-18-2021 14:09

    I know that this is a challenge for customers right now because there isn't really a FIFO, a first in, first out necessarily available mechanism in the context of something like SQA. Which means that as you pick those messages up, you may receive them out of sequence.

    This on the outbound side for Reltio, sounds like a very similar problem to the one that we were just talking about the solution to on the inbound side. And really requires that, again, that you're going to buffer for a short period of time until you understand everything you think you need to know before you send that information wherever they need to go next beyond Reltio.

    Make sure you watch the Reltio Community Webinar

     



    ------------------------------
    Matthew Gagan
    ------------------------------



  • 3.  RE: When using Kafka, can we aggregate the events Reltio creates into SQS?

    Reltio Employee
    Posted 09-20-2021 08:33

    On the FIFO, so it's not just about SQS because the events themselves on the platform level are generated asynchronously. And that's due to Reltio's scalable nature of the platform because events are being processed by multiple nodes in parallel at the same time.

    But you can use Kafka specifically. Once you stream from SQS to Kafka, you can create a producer that will aggregate. And you can put storage key, use the timestamp that every event has as a storage key to sort aggregated values. And you can aggregate everything for the last five seconds, for example. In this way, you sort them in Kafka. And as a result, get it sorted. 



    ------------------------------
    Dmitry Blinov
    Sr. Product Manager
    Reltio
    ------------------------------



  • 4.  RE: When using Kafka, can we aggregate the events Reltio creates into SQS?

    Reltio Employee
    Posted 09-20-2021 08:34

    And I oversimplified my response there, but you're right. Reltio is an asynchronous platform. There is no guarantee that any event is going to complete in advance of any other, given the distributed nature of the way those components work internally.

    But that's a great solution also, that you propose there to leverage those timestamps, which will be down to the millisecond level in terms of their detail.



    ------------------------------
    Matthew Gagan
    ------------------------------



  • 5.  RE: When using Kafka, can we aggregate the events Reltio creates into SQS?

    Reltio Employee
    Posted 09-20-2021 08:35
    Would we be able to do this on SQS, the aggregation on the SQS itself? Or Kafka provides that ability that SQS does not have today?


    "We are struggling with that. When a merge happens with an entity, Reltio throws out 15 different events. The merge could happen between three or four organizations and in 15 events. Pushing them all the way to the system back is just too much traffic. We want to aggregate and then pass one single message, giving them a view of what happened, what exactly happened."



    ------------------------------
    Chris Detzel
    Community Program Manager
    Reltio
    ------------------------------



  • 6.  RE: When using Kafka, can we aggregate the events Reltio creates into SQS?

    Reltio Employee
    Posted 09-20-2021 08:37

    Kafka has this capability and SQS does not have this capability today. We will have to use Kafka for that.

    And this is very, very common question from our customers and partners. And sometimes, there is a perception that using FIFO, first in, first out is a solution here. But you need to realize the platform will input the first in not guaranteed that it's first.

    You have to sort by timestamp. And Kafka is the best solution right now. There is best practice I found in things like Stack Overflow, specifically for Kafka, how to create the producers that aggregates and source by timestamp. So that's already known use case for Kafka. And I would suggest you use that, yes.



    ------------------------------
    Dmitry Blinov
    Sr. Product Manager
    Reltio
    ------------------------------



  • 7.  RE: When using Kafka, can we aggregate the events Reltio creates into SQS?

    Reltio Employee
    Posted 09-20-2021 08:38

    It sounds like another opportunity missing from my original list that suggests that even with our native support for things like SQS right now, that a customer may wish pragmatically, to use Kafka as the downstream mechanism. Because now, your producer that is hopping the data from one mechanism to the other, can leverage exactly what it is that Dmitri is explaining to us here.

    Rather, seeing the event stream as a trigger, rather than necessarily as the place to get the answers to all the questions. It sounds like there's potentially a different way of thinking about how to use the event stream.



    ------------------------------
    Matthew Gagan
    ------------------------------