Apache Airflow Powered by GlobalSolutions
Apache Airflow
Powered by GS
Apache Airflow is an open-source tool to programmatically author, schedule, and monitor workflows. It is one of the most robust platforms used by Data Engineers for orchestrating workflows or pipelines. You can easily visualize your data pipelines dependencies, progress, logs, code, trigger tasks, and success status.With Airflow, users can author workflows as Directed Acyclic Graphs (DAGs) of tasks.Airflows rich user interface makes it easy to visualize pipelines running in production, monitor progress, and troubleshoot issues when needed. It connects with multiple data sources and can send an alert via email or Slack when a task completes or fails. Airflow is distributed, scalable, and flexible, making it well-suited to handle the orchestration of complex business logic. It is highly scalable and extensible, making it a good choice for automating complex data pipelines.

Airflow is a distributed system that consists of the following components:
Why Subscribe to our offering in AWS Marketplace
How to Access our AMIs from AWS Marketplace
Installation Location
Category Packages Version Used Location
Apache Airflow Server 2.9.1 /home/ubuntu/airflow





How to Connect to the Application (from Local instance):

To access the Apache Airflow web interface from your local machine:


Steps to create a DAG and sample DAG for refence: The sample dag that we created is in the /home/ubuntu/airflow/dag . You can see a python file with the name my_first_dag.py where you can see the first dag. Go on to the airflow console and search for sample_dag and you can see it running state. If you need help on creating new Dags we can help but that will be a separate engagement from us.


Support

Please contact us at support@theglobalsolutions.net for any questions on this offering in AWS Marketplace.


Connect with us
copyright - © 2016, The Globalsolutions LLC. or its affiliates. All rights reserved.

>