Prompt Engineering Python Packages

mlflow-skinny

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

42.4M 27K 6K

mlflow

39.2M 27K 6K

mlflow-tracing

20.9M 27K 6K

banks

LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. It allows attaching metadata to prompts to ease their management, and versioning is first-class citizen. Banks provides ways to store prompts on disk along with their metadata.

7.4M 126 20

partial-json-parser

Parse partial JSON generated by LLM

6.1M 134 9

opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

3.8M 20K 2K

arize-phoenix

AI Observability & Evaluation

2.2M 10K 956

outlines

Structured Outputs

2M 14K 754

arize-phoenix-otel

AI Observability & Evaluation

1.8M 10K 956

headroom-ai

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

1.3M 57K 4K

arize-phoenix-client

AI Observability & Evaluation

977K 10K 956

arize-phoenix-evals

AI Observability & Evaluation

750K 10K 956

promptflow-devkit

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

287K 11K 1K

promptflow-core

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

285K 11K 1K

promptflow-tracing

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

279K 11K 1K

llm-guard

The Security Toolkit for LLM Interactions

234K 3K 414

judgeval

The Continuous-Improvement Stack for Agents. Our environment data and evals power agent improvement and monitoring.

147K 1K 93

promptflow

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

117K 11K 1K

ell-ai

A language model programming library.

80K 6K 344

promptflow-tools

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

64K 11K 1K

pdd-cli

Prompt Driven Development (PDD): The Last Programming Language™. Prompt files are source; code is generated output.

61K 783 67

promptflow-azure

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

60K 11K 1K

vincio

The context engineering platform for AI applications — compile prompts, memory, retrieval, tools, schemas & policies into optimized, validated, observable context packets.

47K 2 0

langchain-decorators

syntactic sugar 🍭 for langchain

45K 234 12