Category Archives: Data Analytics

What is the future of Machine Learning (ML)

I owe everyone an update, but for a short time, I am very focused on the AI and ML space.  There will be more to come. The future of machine learning is an exciting and rapidly evolving field that holds great promise for a wide range of industries and applications. In recent years, we have… Read More »

What is Hadoop?

What is Hadoop? The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models.  It is designed to scale from single servers to thousands of machines, each offering local computation and storage.  Rather than rely on hardware to deliver high… Read More »

What is Elasticsearch?

What is Elasticsearch? Elasticsearch is an open-source, RESTful, distributed search and analytics engine built on Apache Lucene.  Since its release in 2010, Elasticsearch has quickly become the most popular search engine and is commonly used for log analytics, full-text search, security intelligence, business analytics, and operational intelligence use cases. Why do you need Elastisearch? Customers… Read More »

Splunk 101

Welcome back to my Splunk series.  Let’s continue our journey with Splunk. Splunk 101 What data can Splunk ingest? First, let us take a view of Splunk at 1000 feet.                 One thing that Splunk strives for is can ingest any data.  Splunk software collects and indexes data… Read More »

What is Splunk?

I have decided to start writing again.  I will start with the three leading big data platforms and add more as time allows.  As you can see by the title, we will dig into Splunk today. What is Splunk? Splunk is a software technology used for monitoring, searching, analyzing, and visualizing machine-generated data in real… Read More »

Why Virtualize Splunk?

Why should you virtualize Splunk? I get asked that question all the time.  So let us first look at a typical Splunk installation on bare metal servers. What information do we even need to collect to size a Splunk? We need to start with the following questions: How much data are you going in ingest… Read More »