Amazon S3 Ingest Bucket |
Object storage bucket for staging raw data before loading into a data lake. |
Amazon Web Services (AWS) |
A cloud platform that provides scalable computing, storage, analytics, and machine learning services. |
AWS Athena |
Serverless interactive query service to analyze data in Amazon S3 using standard SQL. |
AWS Big Data Analytics |
Suite of services for processing and analyzing big data across storage, compute, and databases. |
AWS Data Lake Formation |
Service to set up and manage data lakes with indexing, security, and data governance. |
AWS Data Warehouse |
Fully-managed data warehousing service for analytics using standard SQL. |
AWS Glue Crawler |
Discovers data via classifiers and populates the AWS Glue Data Catalog. |
AWS Glue Data Catalog |
Central metadata store on AWS for datasets, schemas, and mappings. |
AWS Lambda |
Serverless compute to run code without managing infrastructure. |
AWS QuickSight |
Business intelligence service for easy visualizations and dashboards. |
AWS RDS |
Amazon Relational Database Service is a managed relational database service that handles database administration tasks like backup, patching, failure detection, and recovery. Including RDS MySQL, a managed relational database optimized for online transaction processing. |
AWS Redshift |
Petabyte-scale data warehouse for analytics and business intelligence. |
JDBC |
JDBC (Java Database Connectivity) is a standard API for connecting to traditional relational databases from Java. The JDBC was released as part of the Java Development Kit (JDK) in version 1.1 in 1997 and has since been part of every Java edition. |