The Logs ingested to ELK is the flaws.cloud AWS CloudTrail dataset to identify unusual patterns and detect potential security threats.

CloudTrail Log Dataset Summary

The Log Dataset used in this lab is the Public dataset of Cloudtrail logs from flaws.cloud which can be accessed Here.

You can Read through the blog post Here. md5: 4f0481c6be700ffc7610dbbf8cee5578 Size: ~240MB of log data



Unfortunately, the File upload restriction to ELK is 100MB, so the FlawsReadyToIngest.json file can’t be uploaded. So, therefore, there is a need to split the data into multi-files for easy upload. A Python program has been created to separate the FlawsReadyToIngest.json into chunks of 50MB files.

Screenshot 2024-09-29 at 1.14.59 AM.png

# To execute the code, run the command 
python3 split_json.py -f -r ./FlawsReadyToIngest.json -w ./flaws_split_files/

Ingest the CloudTrail Log Files to ELK