Time well spent

Of the preceding listed tasks, popular opinion from the field (of data scientists) states that up to 60% of the time is spent performing the cleaning and transforming of data, while almost 20% (of the time) is spent on data collection.

The Splunk MLT was designed to extend the Splunk platform functions and provide a guided modeling environment for developers and data scientists.

To this end, Splunk literature indicates the following advantages for using Splunk and the Splunk MLT for machine learning projects:

  • Collect data: Inherently, Splunk is an excellent data collector and indexer and can make collecting data—even big or unformatted datamuch easier. Once setup, Splunk will automatically collect the latest data in real ...

Get Implementing Splunk 7 - Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.