Skip to main content

Astro glossary

The following table contains definitions for all of the key terms and concepts you'll come across on Astro. For a glossary of Apache Airflow terms, see Airflow glossary.

TermDefinition
API tokenAn API token is an alphanumeric token that grants programmatic access to Astro for automated workflows. An API token can be scoped to an Organization or a Workspace."
AstroAstro is a SaaS application that provides fully managed Apache Airflow environments for teams of all sizes. To get started, start a trial.
Astro alertsAstro alerts are customizable notifications that can alert teams of disruptions on Astro Deployments. Astro alerts are configured in the Astro UI and integrate with tools like Slack and Pagerduty. Unlike Airflow alerts, Astro alerts require no changes to DAG code.
Astro CLIThe Astro CLI is an open source command line interface built by Astronomer. You can use the Astro CLI to run Apache Airflow locally or interact programmatically with Astronomer products.
Astro HostedAstro Hosted is a distribution of Astro where the infrastructure required to run Airflow is fully managed by Astronomer in Astronomer's cloud. It is recommended for most teams running Airflow.
Astro HybridAstro Hybrid is a distribution of Astro where the infrastructure required to run Airflow is fully managed by Astronomer but hosted in your cloud and paid for by your organization.
Astro HypervisorThe Astro Hypervisor an Astronomer-managed component of the Astro platform that facilitates operating and optimizing your Deployments.
Astro projectAn Astro project contains the set of files necessary to run Airflow either locally or on Astro, including dedicated folders for the DAG files, plugins, and dependencies. Create a new Astro project by running astro dev init with the Astro CLI.
Astro RuntimeAstro Runtime is a Docker image for running Airflow that's built and maintained by Astronomer. Every Astro project and Deployment is configured with a version of Astro Runtime. Compared to the Apache Airflow Docker image, Astro Runtime additionally includes smart Airflow configurations, pre-installed packages, a security manager for role-based access control (RBAC), and expedited vulnerability fixes.
Astronomer SoftwareAstronomer Software is Astronomer's commercial software offering for running Apache Airflow on Kubernetes in a private cloud or airgapped environment. The infrastructure required to run the service is hosted and managed entirely by your organization instead of by Astronomer. It is only recommended for extremely security conscious organizations running at a unique level of scale.
CellA cell is a UI-based abstraction of an Airflow task that serves as the building block for DAGs written with the Astro Cloud IDE. Cells can run Python code, SQL code, or an Airflow operator.
Astro UIThe Astro UI is the user interface for Astro. From the Astro UI, users can manage Organizations, Workspaces, and Deployments, as well as write DAGs in the Astro Cloud IDE. The Astro UI is available at https://cloud.astronomer.io.
ClusterAn Astro cluster is a Kubernetes cluster that hosts the infrastructure required to run Deployments. Clusters can be standard or dedicated.
Control PlaneThe Astro control plane is Astro's interface for managing Airflow environments running in the cloud. The Astro UI or the Astro CLI is used to interact with the control plane. It provides end-to-end visibility, control, and management of users, teams, Workspaces, Deployments, metrics, and logs.
DAG Bundle VersionA DAG Bundle Version is a unique timestamp generated by the Astro CLI after a user completes a DAG-only deploy and identifies the version of code that Astro is running.
Data lineageData lineage is the concept of tracking and observing data flowing through a data pipeline. Data lineage can be used to understand data sources, troubleshoot run failures, manage personally identifiable information (PII), and ensure compliance with data regulations. Astro includes data lineage features in the Astro UI.
Data planeThe data plane is a component of Astro Hybrid. It is a single-tenant foundation in a customer's cloud that runs multiple Airflow environments across clusters, regions, and clouds.
DeployA deploy is the process of pushing code to a Deployment on Astro. A code push can include either a complete Astro project or DAG code.
DeploymentAn Astro Deployment is an Airflow environment that is powered by all core Airflow components, including schedulers and workers. Astro users can deploy DAGs to a Deployment, and can have one or more Deployments within a Workspace.
Deployment fileA Deployment file is a YAML-formatted snapshot of a Deployment's current configuration. In addition to configurations, a Deployment file includes a Deployment's metadata and a timestamp to associate the configuration with a specific Deployment. Deployment files are used to manage Deployments as code.
Deployment template fileA Deployment template file is a YAML configuration that can be used to create or update a Deployment. Unlike a Deployment file, Deployment template files don't contain metadata and can be applied to any new or existing Deployment. Deployment template files are used to manage Deployments as code.
Environment variableAn environment variable is a key-value pair that defines a configuration or value for a Deployment.
High availability (HA)High availability (HA) is a feature on Astro for ensuring that the components of a Deployment continue to run even in the event of an outage. On Astro Hosted, HA can be enabled or disabled per Deployment.
NamespaceA namespace is a Kubernetes component which isolates Airflow environments within a Kubernetes cluster. Each Deployment uses a separate namespace to isolate resources.
OrganizationAn Organization is the highest management level on Astro. An Organization contains Workspaces, which are collections of Deployments, or Airflow environments, that are typically owned by a single team.
PipelineA pipeline is a notebook-style configuration for DAGs which is available in the Astro Cloud IDE. A pipeline can include traditional Airflow operators, as well as Python and SQL functions that are executed through the Astro Python SDK.
Standard clusterA standard cluster is a cluster type available on Astro Hosted. It is multi-tenant and runs Deployments from multiple Organizations.
Worker NodeA worker node is a node used to run Airflow worker Pods, which are responsible for executing Airflow tasks in the Deployments.
Worker Node PoolA worker node pool is a Kubernetes node pool that's used to run worker nodes of the same type on Astro Hybrid. Each worker node pool has a worker type and a maximum node count.
Worker QueueA worker queue is a set of configurations that apply to a group of workers in a Deployment running the Celery executor. Within a worker queue, users can configure worker type, worker size, and autoscaling behavior.
Worker TypeThe worker type defines the quantity of resources a celery worker can consume. On Astro Hosted, worker types are defined in terms of Astronomer units (A5, A10, A20). Each Ax type has a different configuration for memory, cpu and default number of concurrent tasks per celery worker. On Astro Hybrid, worker types are defined by the cloud node instance type that host celery workers and KubernetesPodOperator pods.
WorkspaceWorkspaces are collections of Deployments that can be accessed by a specific group of users. Workspaces can be used to group Deployments that share a business use case or environment trait.

Was this page helpful?

Sign up for Developer Updates

Get a summary of new Astro features once a month.

You can unsubscribe at any time.
By proceeding you agree to our Privacy Policy, our Website Terms and to receive emails from Astronomer.