cloud

What is Glue Data Catalog?

The central metadata repository in AWS Glue that stores database and table definitions, schemas, and connection information, serving as a managed Apache Hive Metastore shared across AWS analytics services.

Detailed Explanation

The Glue Data Catalog is the backbone of the AWS analytics ecosystem. Tables defined in the catalog are automatically available to Amazon Athena for SQL queries, Amazon EMR for Spark/Hive jobs, and Redshift Spectrum for federated queries — without any duplication of schema definitions. It supports resource-based policies and integrates with AWS Lake Formation for column-level and row-level security.
AWS GluemetadataAthenaEMRLake Formation