Elasticsearch, Logstash, and Kibana for centralized logging, search, and visualization

ELK Stack

The ELK Stack (Elasticsearch, Logstash, Kibana) is the most widely adopted open-source solution for centralized log management, full-text search, and data visualization. Elastic also provides Beats (lightweight data shippers) and the broader Elastic Stack ecosystem.

Overview

Component	Role	Description
Elasticsearch	Search & Storage	Distributed search and analytics engine based on Apache Lucene
Logstash	Data Processing	Server-side data processing pipeline for ingestion and transformation
Kibana	Visualization	Web UI for searching, visualizing, and dashboarding Elasticsearch data
Beats	Data Shipping	Lightweight agents for shipping data from edge machines

Architecture

┌─────────┐   ┌─────────┐   ┌─────────┐
│  App 1  │   │  App 2  │   │  App 3  │
└────┬────┘   └────┬────┘   └────┬────┘
     │              │              │
     ▼              ▼              ▼
┌─────────┐   ┌─────────┐   ┌─────────┐
│Filebeat │   │Filebeat │   │Metricbeat│
└────┬────┘   └────┬────┘   └────┬────┘
     │              │              │
     └──────────────┼──────────────┘
                    ▼
             ┌────────────┐
             │  Logstash   │  (Parse, Transform, Enrich)
             └──────┬─────┘
                    ▼
           ┌──────────────┐
           │Elasticsearch │  (Index, Store, Search)
           └──────┬───────┘
                  ▼
             ┌─────────┐
             │ Kibana   │  (Visualize, Dashboard, Alert)
             └─────────┘

Learning Path

Read in this order if you're new — each page builds on the previous one.

1. Getting Started

Stand up the full stack with Docker Compose; ship one log line end-to-end

2. Elasticsearch

Indices, mappings, queries, cluster sizing, ILM

3. Logstash

Pipelines, grok patterns, performance tuning

4. Kibana

Discover, Lens, dashboards, alerting

Beats: Data Shippers

Beat	Purpose	Data Source
Filebeat	Log files	Application logs, system logs, container logs
Metricbeat	System metrics	CPU, memory, disk, network, container stats
Packetbeat	Network data	HTTP, DNS, MySQL, Redis protocol analysis
Heartbeat	Uptime monitoring	HTTP, TCP, ICMP health checks
Auditbeat	Audit data	File integrity, system calls, user activity

A working Filebeat config and complete pipeline live in Getting Started.

Deployment Patterns

Small (Development / Small Team)

Single-node Elasticsearch
Logstash on the same host
Filebeat on application servers

Medium (Production)

3-node Elasticsearch cluster (1 master, 2 data)
Dedicated Logstash instances
Kafka/Redis as buffer between Beats and Logstash
Kibana behind reverse proxy with authentication

Large (Enterprise)

Dedicated master, data, ingest, and coordinating nodes
Hot-warm-cold architecture for data lifecycle
Cross-cluster replication for disaster recovery
Kafka as durable message buffer
Multiple Logstash pipelines

Best Practices

ELK Stack Guidelines

Sizing: Allocate 50% of available RAM to Elasticsearch JVM heap (max 31GB)
Sharding: Use 1 primary shard per 20-40GB of data; avoid over-sharding
Index Lifecycle: Use ILM policies to manage hot/warm/cold/delete phases
Security: Enable TLS between nodes and authentication for production
Buffering: Use Kafka or Redis between Beats and Logstash for resilience
Monitoring: Use Elastic's built-in monitoring or Metricbeat to monitor the stack itself
Mapping: Define explicit index mappings instead of relying on dynamic mapping
Retention: Set index lifecycle policies to automatically delete old data

ELK vs Alternatives

Feature	ELK Stack	Grafana Loki	Datadog	Splunk
Cost	Free (self-hosted)	Free (self-hosted)	Per-GB ingested	Per-GB indexed
Full-text Search	★★★★★	★★☆☆☆	★★★★☆	★★★★★
Log Aggregation	★★★★★	★★★★★	★★★★★	★★★★★
Resource Usage	High	Low	N/A (SaaS)	High
Setup Complexity	Medium	Low	Low (SaaS)	Medium
Scalability	★★★★★	★★★★☆	★★★★★	★★★★★
Visualization	★★★★☆	★★★★★ (Grafana)	★★★★★	★★★★☆
APM Integration	★★★★☆	★★★☆☆	★★★★★	★★★★☆

ELK Stack

1. Getting Started

2. Elasticsearch

3. Logstash

4. Kibana

On this page