PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Prompt Engineering Python Packages

Python packages with the GitHub topic prompt-engineering. Sorted by relevance, with stars and monthly downloads.
mlflow
mlflow-skinny

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

38.2M 26K 6K
mlflow
mlflow

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

37.1M 26K 6K
mlflow
mlflow-tracing

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

17M 26K 6K
promplate
partial-json-parser

Parse partial JSON generated by LLM

6.7M 130 9
comet-ml
opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

6.2M 19K 1K
masci
banks

LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. It allows attaching metadata to prompts to ease their management, and versioning is first-class citizen. Banks provides ways to store prompts on disk along with their metadata.

4.7M 127 20
Arize-ai
arize-phoenix

AI Observability & Evaluation

2.4M 10K 882
Arize-ai
arize-phoenix-otel

AI Observability & Evaluation

1.8M 10K 882
dottxt-ai
outlines

Structured Outputs

1.8M 14K 697
Arize-ai
arize-phoenix-client

AI Observability & Evaluation

952K 10K 882
Arize-ai
arize-phoenix-evals

AI Observability & Evaluation

765K 10K 882
JudgmentLabs
judgeval

The Continuous-Improvement Stack for Agents. Our environment data and evals power agent improvement and monitoring.

479K 1K 93
microsoft
promptflow-devkit

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

479K 11K 1K
microsoft
promptflow-core

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

476K 11K 1K
microsoft
promptflow-tracing

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

475K 11K 1K
microsoft
promptflow

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

372K 11K 1K
protectai
llm-guard

The Security Toolkit for LLM Interactions

285K 3K 391
chopratejas
headroom-ai

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

176K 2K 160
microsoft
promptflow-tools

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

162K 11K 1K
microsoft
promptflow-azure

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

76K 11K 1K
MadcowD
ell-ai

A language model programming library.

58K 6K 346
agenta-ai
agenta

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

51K 4K 520
ju-bezdek
langchain-decorators

syntactic sugar 🍭 for langchain

47K 234 12
comet-ml
opik-optimizer

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

42K 19K 1K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery