This repo contains guidelines and steps for setting up in-house production infrastructure and cloud services of open-source technologies from scratch.
- 
CDC Pipelines - High-performance coordination service for distributed applications:
- Distributed event streaming platform:
- Distributed framework to stream data into and out of Apache kafka:
- Distributed registry to store kafka-payload's schemas:
 
- 
Databases - SQL/RDBMS:
- NoSQL
- Document:
- Key-value:
- Graph:
- Time Series:
- Prometheus (NoSQL)
- Timescale (SQL)
 
 
 
- 
Distributed Workflow Management 
- 
Big Data - Distributed SQL Query Engine on any data storage:
- Distributed & Resilient Data Processing framework:
- SQL on HDFS:
 
- 
Search Engines 
- 
Centralized Logging - Elastic Stack
- Filebeat
- Elasticsearch-Ingest-Pipeline
- Kibana
 
 
- Elastic Stack
- 
Business Intelligence 
- 
Container Orchestration 
- 
Service Discovery, Health Checking & Configuration 
- 
Service Monitoring 
- 
SSL Certs 
- 
Load balancing & Reverse Proxying 
- 
VPN 
- 
Linux 
...
