Skip to content
Potato annotation tool

Annotate anything — text, agents, and beyond

Potato is an open-source platform for annotating text, audio, images, and qualitative data — from classification and span labeling to codebook-driven coding. It's also the first annotation tool purpose-built for evaluating AI agents: import traces from any framework, visualize agent actions interactively, and configure it all in YAML.

Featured at·HCOMP 2024 (Best Demo)·EMNLP 2022

Get started in 4 steps

From installation to annotation in minutes.

1.
pip install potato-annotation
# Install Potato
2.
# Create your config.yaml
# Configure your task
3.
potato start config.yaml
# Launch the server
4.
# Open localhost:8000
# Start annotating!
Get started in 4 steps

Everything you need for annotation

Powerful features designed for researchers and teams.

Agent Evaluation

Annotate multi-agent teams on a clickable interaction graph, evaluate computer-use, voice, and video agents, calibrate LLM judges against human labels, edit trajectories into SFT and DPO training data, and rank models in an arena — all via YAML.

30+ Annotation Types

Radio, multiselect, likert, slider, text, span, best-worst scaling, pairwise, number, multirate, video, image, audio, bounding box, polygon, event, and taxonomy annotation, plus qualitative coding (QDA Mode) with a living codebook, in-vivo codes, memos, and cases.

AI-Powered

LLM integration with OpenAI, Claude, Gemini for intelligent hints, keyword highlighting, label suggestions, MACE competence estimation, option highlighting, and diversity ordering.

Multimedia Support

Annotate audio with waveforms, images with bounding boxes and polygons, and video with playback controls.

Active Learning

5 query strategies — uncertainty sampling, diversity-based selection, BADGE, BALD, and hybrid ensemble — plus LLM cold-start for intelligent instance selection before any labels exist.

Zero Code Setup

Configure everything in YAML. No programming required to create sophisticated annotation interfaces.

Built for every domain

Text, images, audio, or video — Potato has you covered.

Natural Language Processing
Computer Vision
Audio & Speech
Conversational AI
Multimodal
Biomedical & Social Science
Agent Evaluation

350+

Browse 350+ annotation designs

Ready-to-use configurations from the community.

Ready to start annotating?

Join researchers and teams worldwide using Potato.