MLflow 3.12.0 has been released! 3.12.0 focuses on improving our LLM observability workflows, making tracing more accessible, feature-rich, and performant. Here are some of the major feature highlights:
Multimodal Tracing. Users can now store multimodal content in tracing spans as artifact attachments instead of inline binary data. We've also patched the UI to support the new mlflow-attachment:// style URI, with rich rendering available for PDFs, audio, and images. This feature works out of the box with autologging, but manual attachment management is also possible. Visit the documentation page to learn more.
Codex, Gemini, Qwen coding agent tracing support. Similar to our Claude Code tracing integration, we've now added support for the Codex, Gemini, and Qwen coding agent platforms as well! For intructions on how to get started, check out the documentation (Codex, Gemini CLI, Qwen Code).
Gateway guardrails. You can now set guardrails on your gateway endpoints to prevent unsafe or non-compliant model inputs and outputs. Try it out in the MLflow UI, and visit the documentation page to learn more!
Trace table pagination. The traces tab is now paginated, rather than fetching all traces up to a limit of 1000. This improves initial load time, and makes the page feel more responsive overall.
Star us on GitHub - Show your support for the project