|
v4.3.1 on sles 11.1 the standard whitelist for data source /var/log will produce dupe indexing because by default on sles it rotates out the messages file to another file "messages-YYYYMMDD" and will bzip that on 2nd rotation (aka, delayed compress in logrotate, etc). default whitelist (.log|log$|messages|secure|auth|mesg$|cron$|acpid$|.out) default blacklist (lastlog) so, in my case i think changing whitelist to use ^messages$ would be better, and possibly changing some of the others like .log and log$ |
|
Splunk should not produce duplicate results because of file rotation. In the situation you reference, the CRC will match and the rotated file will be ignored. You shouldn't need to edit the whitelist in the situation you've mentioned. Details can be found here: http://docs.splunk.com/Documentation/Splunk/latest/Data/Howlogfilerotationishandled
actually, because the new file has been bzipped, Splunk will in fact re-index. A better solution really is blacklist gzip and bzip file formats.
(03 Apr '12, 09:51)
gkanapathy ♦
all good info. CRC will indeed skipped non bzip rotated file, but will indeed re-index the bzip's. so i think my Q is solved.
(03 Apr '12, 10:00)
cvajs
|
|
I think it would be better to blacklist bzip and gzip ( i thought gzip's got CRC check but bzip did not? is bz2 in the list of files for bzip? i typically blacklist \.(gz|gzip|bz|bz2|z|zip)$ so to avoid even a CRC check, etc. blacklist for /var/log for my sles 11.1 (lastlog|\.(gz|gzip|bz|bz2|z|zip))$ also, if the file was written to after the rename the CRC check would still match since it looks at only the 1st 256bytes. in this case it seems like events can be missed since Splunk will not see the new file as new, etc.
(03 Apr '12, 10:05)
cvajs
|