Aidena

Data Pipelines

ETL and data orchestration tools for feeding AI systems

18 tools

Airbyte

Freemium

Open-source data integration platform with 300+ connectors for syncing data from any source to any destination. Used in ...

Data Pipelines

Apache Airflow

Open Source

Open-source platform to programmatically author, schedule, and monitor workflows using Python DAGs. Differentiates with ...

Data Pipelines

Apache Kafka

Open Source

Open-source distributed event streaming platform for high-performance data pipelines, streaming analytics, and data inte...

Data Pipelines

Crawlee

Open Source

Web scraping and browser automation library. Handles anti-bot protections. TypeScript and Python.

Data Pipelines

Dagster

Freemium

Data orchestration platform built around the concept of software-defined assets. Designed for ML and analytics teams, it...

Data Pipelines

dbt (data build tool)

Freemium

SQL-first transformation tool that enables data analysts and engineers to transform, test, and document data using modul...

Data Pipelines

Embedchain

Open Source

RAG framework by Mem0. Create AI apps over any data in minutes. Supports 20+ data source types.

Data Pipelines

Flyte

Open Source

Open-source workflow orchestration platform for building and scaling AI/ML pipelines and data workflows. Differentiates ...

Data Pipelines

Jina Reader

Freemium

Convert any URL to LLM-friendly text. Simple API: prefix URL with r.jina.ai. Free tier available.

Data Pipelines

Kedro

Open Source

Open-source Python framework for creating reproducible, maintainable, and modular data science and ML pipelines. Develop...

Data Pipelines

Kubeflow

Open Source

Open-source ML platform on Kubernetes. Provides Pipelines (DAG orchestration), Notebooks, Model Training Operator, KServ...

Data Pipelines

LlamaParse

Freemium

LlamaIndex's document parser. Handles complex PDFs with tables, charts, and mixed layouts for RAG.

Data Pipelines

Mage AI

Freemium

Open-source data pipeline tool built for ML engineers with a block-based notebook interface. Designed to make building, ...

Data Pipelines

MegaParse

Open Source

Universal document parser. Supports PDF, DOCX, PPTX, and more. Integrates with LangChain and LlamaIndex.

Data Pipelines

Metaflow

Open Source

Python framework for building and managing ML, AI, and data science workflows with built-in versioning and orchestration...

Data Pipelines

Prefect

Freemium

Python-native workflow orchestration platform that turns functions into observable workflows using decorators. Different...

Data Pipelines

R2R (SciPhi)

Open Source

Production-ready RAG engine. Ingestion, search, and generation in one system with knowledge graph support.

Data Pipelines

ZenML

Freemium

Open-source MLOps framework for building portable, production-ready ML pipelines. Abstracts infrastructure complexity so...

Data Pipelines