MMIRAGE Documentation

MMIRAGE logo

MMIRAGEModular Multimodal Intelligent Reformatting and Augmentation Generation Engine — is an advanced platform for large-scale dataset processing using generative models, including vision-language models (VLMs).


🚀 Getting Started

Install MMIRAGE and run your first pipeline in minutes.

Installation
📖 Configuration Reference

Full YAML configuration reference for all parameters.

Configuration Reference
🖥️ CLI Reference

All mmirage subcommands, flags, and examples.

CLI Reference
📦 API Reference

Auto-generated documentation for every public module.

API Reference

Key Features

  • Multimodal support — process text and images with vision-language models.

  • YAML-driven — configure every aspect of a pipeline via a single file using Jinja2 templating and JMESPath queries.

  • Scalable — native sharding with multi-node SLURM support.

  • Modular — pluggable processors, loaders, and writers.

  • Automatic retry — configurable shard-level retry with budget tracking.

  • Structured output — produce plain text or validated JSON.