Tags
Language
Tags
May 2024
Su Mo Tu We Th Fr Sa
28 29 30 1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30 31 1

Apache Druid for Data Engineers (Hands-On)

Posted By: lucky_aut
Apache Druid for Data Engineers (Hands-On)

Apache Druid for Data Engineers (Hands-On)
Published 1/2024
Duration: 2h30m | .MP4 1280x720, 30 fps(r) | AAC, 44100 Hz, 2ch | 909 MB
Genre: eLearning | Language: English

Learn everything about Apache Druid a modern real-time analytics database.

What you'll learn
Understanding of basic architecture of Apache Druid
Installing and Configuring Apache Druid
Apache Druid Design, Ingestion, Data management, Querying
Frequently asked Questions
Requirements
Basic knowledge of SQL is appreciated but if you don't have any knowledge on Database management its fine.
Linux as Operating System Required
8 GB RAM is required
Description
Druid is a high-performance, real-time analytics database that delivers sub-second queries on streaming and batch data at scale and under load.
Apache Druid is a real-time analytics database designed for fast slice-and-dice analytics ("
OLAP
" queries) on large data sets. Most often, Druid powers use cases where real-time ingestion, fast query performance, and high uptime are important.
Druid is commonly used as the database backend for GUIs of analytical applications, or for highly-concurrent APIs that need fast aggregations. Druid works best with event-oriented data.
One of the most valuable technology skills is the ability to Real-time analytics databases handle analytics on large amounts of data by optimizing resources to enable compute-heavy workloads, and this course is specifically designed to bring you up to speed on one of the best technologies for this task,
Apache Duid
! The top technology companies like
Google, Facebook, Netflix, Airbnb, Amazon, NASA,
and more are all using
Apache Druid
!
Apache Druid Essentials: Unleashing Real-time Analytics and Scalable Data Exploration
Unlock the potential of real-time analytics and scalable data exploration with our comprehensive Apache Druid Essentials course. In this dynamic program, participants will delve into the world of Apache Druid, an open-source, high-performance analytics database designed for fast query response and seamless scalability.
Key Learning Objectives:
Introduction to Course
Real-time Analytics Databases
What is Apache Druid?
Key Features of Druid
Technology
Use cases
When to use Druid
When not to use Druid
List of Company using Apache Druid
Installation of Apache Druid
Start up Druid services
Open the web console
Load data
Query data
Overview of the Druid Web Console
Architecture of Druid
Druid Servers
External Dependencies
Storage Design
Datasources and Segments
Segment Identifiers
Segments
Introduction to Segments
Segment File Structure
Data Loading in Druid
Load Data from Local Files
Load Data from URI
Load Data from Kafka (Prerequisite Introduction to Kafka)
Installing Single Node Kafka Cluster
Change the following to avoid Zookeeper Issue conflict
Load Data from Kafka
Query Data Explain Plan
Aggregate data with rollup
Frequently Asked Questions
Who this course is for:
Database Engineer, Big Data Engineer, Data Engineer, Data Analyst, Data Scientist, Machine Learning Engineer


More Info