Splunk Search

Large Joins

clyde772
Communicator

Hey Splunkers,

I am trying to join / lookup a large set of data to each other.

For example , transaction data to 20 million customer info.
so I have to lookup every transactions to 20 million customer info.

I have tried many ways to tackle this but haven't found the best solutions to this. lookups didn't work because the customer info records sized up to 2 GB. In order for splunk to be a big data engine, this seems to be a limitation.

I appreciate your expert advice. Cheers!

Tags (2)
0 Karma

Paolo_Prigione
Builder

Just an idea: you could sort and split your lookup file in more, smaller files. Then apply multiple lookups in the search.

Otherwise, you could load your customer data to a REDIS cache, then use the REDIS lookup app to match it to transactions. The same approach might work with MySQL and the MySQL app, though more complex.

Let us know if you solved the problem!

0 Karma

lguinn2
Legend

More information is needed! Can you give a (sanitized) example of your data and the search that you need to do?

Get Updates on the Splunk Community!

Stay Connected: Your Guide to May Tech Talks, Office Hours, and Webinars!

Take a look below to explore our upcoming Community Office Hours, Tech Talks, and Webinars this month. This ...

They're back! Join the SplunkTrust and MVP at .conf24

With our highly anticipated annual conference, .conf, comes the fez-wearers you can trust! The SplunkTrust, as ...

Enterprise Security Content Update (ESCU) | New Releases

Last month, the Splunk Threat Research Team had two releases of new security content via the Enterprise ...