Evaluations

Puzzlet Docs

Puzzlet is the git-based Prompt Engineering Platform that empowers both application developers and domain experts to collaborate seamlessly on GenAI products. Puzzlet enables companies to manage, evaluate, and improve their full-stack LLM application - with version control, type-safety, and local development built-in.

Overview

Start building awesome GenAI products in under 5 minutes.

Quickstart

The main concepts to help you get started with Puzzlet

Core Concepts

Learn how to configure Puzzlet for your application needs

Configure models and their user interfaces in Puzzlet

Model Schemas

Set up a webhook endpoint to test your Puzzlet inference in your application

Test Webhook

Learn how Puzzlet manages prompts in your application

Learn the core syntax and features of AgentMark

AgentMark Syntax

Learn how to migrate your existing prompts to Puzzlet

Migrating Prompts to Puzzlet

Learn how to develop and test Puzzlet prompts

Development

Learn how to create and use reusable components in Puzzlet

Components

Learn how to get structured JSON responses from your prompts

JSON Output

Learn how to extend prompts with tools and create multi-step agents

Tools and Agents

Monitor and debug your prompts with Puzzlet

Monitor and debug your prompts using OpenTelemetry

Traces and Logs

Analytics provide a high-level overview of how your application is using Generative AI. This includes metrics like costs, tokens, requests, latency, top models, etc.

Metrics

Datasets

Learn how to configure CI/CD for your Puzzlet implementation

CI/CD

Learn how to integrate your own models with Puzzlet

Custom Models

Puzzlet allows you to collaborate on prompts while maintaining type safety in a production environment.

Type Safety

Learn how to use Puzzlet Studio for local development

Puzzlet Studio

Create a PR directly through the hosted platform instead of committing to your target branch.

PR Workflows

Follow the instructions below to install AgentMark in your app.

Getting Started

Learn how to add model providers to your AgentMark project.

Model Providers

Learn how to migrate your existing LLM application to AgentMark

Migration Guide

Learn how AgentMark processes and transforms prompts

Architecture

AgentMark provides a powerful and flexible way to create prompts using Markdown and JSX. This section will cover the core concepts and features of prompting with AgentMark.

Configure model parameters using standard settings.

Model Settings

Access and use variables in your prompts using props.

Props

AgentMark supports importing and reusing components across your prompts.

Reusable Components

Conditionals allow you to create dynamic prompts that adapt based on props or other conditions.

Conditionals

AgentMark supports iterating over arrays using the `<ForEach>` tag.

Loops

Transform values in your prompts using filter functions

Filter Functions

Define structured output using JSON Schema

Object Schema

Tools and agents allow you to extend your prompts with external capabilities.

Execute prompts and get responses from language models

Core API

Observability

AgentMark provides robust type safety through JSON Schema definitions in your prompt files. This ensures type checking for both inputs and outputs, making your prompts more reliable and maintainable.

Getting Started

Configuring Puzzlet

Prompt Management

Observability

Testing

Further Reference

Evaluations

Have Questions?