Favicon of Luigi

Luigi

Luigi is a Python package by Spotify for building complex pipelines of batch jobs with dependency resolution, workflow management, and Hadoop support.

Screenshot of Luigi website

Luigi is a lightweight, reliable Python framework that helps you build complex pipelines of batch jobs with automatic dependency resolution. Created by Spotify, Luigi handles the plumbing of long-running data processes so you can focus on the tasks themselves.

Key Features:

  • Dependency Resolution — Automatically manages task dependencies and execution order
  • Workflow Visualization — Built-in UI for monitoring pipeline progress and failures
  • Hadoop Integration — Native support for HDFS, Hive, Pig, and Spark jobs
  • Atomic File Operations — Ensures data pipeline integrity with atomic file system abstractions
  • Flexible Tasks — Chain any task type from database dumps to ML training to web scraping

Whether you're a data engineer, analytics team, or batch processing specialist, Luigi provides a simple framework to orchestrate reliable data pipelines.

Share:

  • Stars

    18.6K
  • Forks

    2.4K
  • Last commit

    2 months ago
  • License

    Apache-2.0
  • Language

    Python
View Repository

Similar to Luigi

Favicon

 

  
  
Favicon

 

  
  
Favicon