Refine your search:

I'd like to index a directory of 50,000 gzip files. The files range in size from 1 KB to 5 MB. Can Splunk monitor these files without first unpacking the gzips?

asked 29 Jan '10, 17:47

hulahoop's gravatar image

hulahoop ♦
2.5k3240
accept rate: 40%


2 Answers:

The good news is "YES, Splunk can index gzip files as is!" The bad news is, Splunk will monitor these files one at a time, instead of in parallel. Because it is not possible to predict the uncompressed size of a gzip file, Splunk processes these files in sequence for better control of disk allocation. With respect to performance, this is not ideal for handling 50k files so please consider uncompresing them before having Splunk monitor them to take advantage of Splunk's multi-threading file monitoring capabilities.

link

answered 29 Jan '10, 17:52

hulahoop's gravatar image

hulahoop ♦
2.5k3240
accept rate: 40%

edited 29 Jan '10, 23:14

I think we're actually a bit slower than uncompressing the files first (unsure of details) but it's not far off. Mostly uncompressing that much data with the zlib algorithm just takes a lot of CPU.

link

answered 17 Apr '10, 08:12

jrodman's gravatar image

jrodman ♦
5.8k2515
accept rate: 42%

Post your answer
toggle preview

Follow this question

Log In to enable email subscriptions

RSS:

Answers

Answers + Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "Title")
  • image?![alt text](/path/img.jpg "Title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×326
×28
×13

Asked: 29 Jan '10, 17:47

Seen: 1,645 times

Last updated: 17 Apr '10, 08:12

Copyright © 2005-2012 Splunk, Inc. All rights reserved.