Scorecard Blog

AI Tools

February 25, 2025

—

min read

Introducing AgentEval.org

Introducing AgentEval.org: An Open-Source Benchmarking Initiative for AI Agent Evaluation

No results found.
Try filtering for other categories.

July 17, 2025

—

min read

Deep Dive: Exploring ChatGPT's Agent Mode

AI Tools

June 30, 2025

—

min read

Scorecard MCP 2.0: 1,000 Lines -> 70

Introducing Scorecard MCP 2.0 built with the new MCP spec

AI Tools

May 30, 2025

—

min read

Introducing Scorecard's MCP Server

We're excited to announce the launch of the first remote Model Context Protocol (MCP) server for evaluation.

News

November 25, 2024

—

min read

Scorecard ❤️ Anthropic

Empowering Informed AI Evaluation

Simulate, Test, Repeat: The Key to Robust AI System Development

Workflows

November 7, 2024

—

min read

Simulate, Test, Repeat: The Key to Robust AI System Development

Simulations are transforming the development and testing of AI systems across industries, far beyond just self-driving cars.

LLM Evaluation

September 29, 2024

—

min read

5 Must-Have Features for LLM Evaluation Frameworks

Unlock the full potential of Large Language Models (LLMs) with a comprehensive evaluation framework. Discover the 5 must-have features to ensure reliable performance and cost-effectiveness in your LLM applications.

Take Control of AI Performance

Join forward-thinking teams using Scorecard to upgrade the way they build, test, and improve AI PRODUCTS.

Learn More

All Blogs

Take Control of AI Performance