AI evaluation tools - AI tools

  • Arize
    Arize Unified Observability and Evaluation Platform for AI

    Arize is a comprehensive platform designed to accelerate the development and improve the production of AI applications and agents.

    • Freemium
    • From 50$
  • Future AGI
    Future AGI World’s first comprehensive evaluation and optimization platform to help enterprises achieve 99% accuracy in AI applications across software and hardware.

    Future AGI is a comprehensive evaluation and optimization platform designed to help enterprises build, evaluate, and improve AI applications, aiming for high accuracy across software and hardware.

    • Freemium
    • From 50$
  • LastMile AI
    LastMile AI Ship generative AI apps to production with confidence.

    LastMile AI empowers developers to seamlessly transition generative AI applications from prototype to production with a robust developer platform.

    • Contact for Pricing
    • API
  • Braintrust
    Braintrust The end-to-end platform for building world-class AI apps.

    Braintrust provides an end-to-end platform for developing, evaluating, and monitoring Large Language Model (LLM) applications. It helps teams build robust AI products through iterative workflows and real-time analysis.

    • Freemium
    • From 249$
  • Freeplay
    Freeplay The All-in-One Platform for AI Experimentation, Evaluation, and Observability

    Freeplay provides comprehensive tools for AI teams to run experiments, evaluate model performance, and monitor production, streamlining the development process.

    • Paid
    • From 500$
  • AI Monitor
    AI Monitor Don’t Remain Blind in the Age of AI!

    AI Monitor is a Generative Engine Optimization (GEO) platform helping brands track visibility and reputation across AI platforms like ChatGPT and Google AI Overviews.

    • Contact for Pricing
  • Okareo
    Okareo Error Discovery and Evaluation for AI Agents

    Okareo provides error discovery and evaluation tools for AI agents, enabling faster iteration, increased accuracy, and optimized performance through advanced monitoring and fine-tuning.

    • Freemium
    • From 199$
  • Evidently AI
    Evidently AI Collaborative AI observability platform for evaluating, testing, and monitoring AI-powered products

    Evidently AI is a comprehensive AI observability platform that helps teams evaluate, test, and monitor LLM and ML models in production, offering data drift detection, quality assessment, and performance monitoring capabilities.

    • Freemium
    • From 50$
  • AI Parabellum
    AI Parabellum #1 AI Tools Directory

    AI Parabellum is the #1 AI Tools Directory, curating hand-tested AI tools and SaaS solutions. Discover, compare, and implement innovative AI technologies tailored to your needs.

    • Free
  • Mureka.ai
    Mureka.ai Find the Best AI Tools Instantly

    Mureka.ai is a comprehensive directory helping users discover, filter, and compare the latest AI tools across various categories for different needs.

    • Free
  • Lisapet.ai
    Lisapet.ai AI Prompt testing suite for product teams

    Lisapet.ai is an AI development platform designed to help product teams prototype, test, and deploy AI features efficiently by automating prompt testing.

    • Paid
    • From 9$
  • Gentrace
    Gentrace Intuitive evals for intelligent applications

    Gentrace is an LLM evaluation platform designed for AI teams to test and automate evaluations of generative AI products and agents. It facilitates collaborative development and ensures high-quality LLM applications.

    • Usage Based
  • Relari
    Relari Trusting your AI should not be hard

    Relari offers a contract-based development toolkit to define, inspect, and verify AI agent behavior using natural language, ensuring robustness and reliability.

    • Freemium
    • From 1000$
  • ToolList.ai
    ToolList.ai Your Ultimate Directory of Artificial Intelligence Tools

    ToolList.ai is a comprehensive directory for discovering, submitting, and exploring innovative AI tools across various categories. It provides a platform for users and developers to connect and stay updated on AI advancements.

    • Free
  • AI Score My Site
    AI Score My Site Discover your website's AI search engine readiness

    AI Score My Site is a specialized tool that evaluates websites for AI search engine optimization and provides actionable insights for improving AI discoverability and ranking potential.

    • Free
  • AIDetect
    AIDetect The Most Powerful Free AI Detector

    AIDetect is a comprehensive AI detection platform that offers high-accuracy identification of AI-generated content from various sources like ChatGPT, Google Gemini, and Claude Opus, along with AI text humanization capabilities.

    • Freemium
    • From 10$
  • The AI Digest
    The AI Digest Interactive AI Explainers and Demos to Understand AI's Future

    The AI Digest offers interactive explainers and demos showcasing current AI capabilities, helping users understand AI advancements and plan for the future.

    • Free
  • Theee.ai
    Theee.ai Access Over 50,000 AI Tools for Free on a One-Stop Platform

    Theee.ai is a comprehensive platform that aggregates over 50,000 AI tools in one place, eliminating the need to trial multiple AI sites separately.

    • Free
  • Autoblocks
    Autoblocks Improve your LLM Product Accuracy with Expert-Driven Testing & Evaluation

    Autoblocks is a collaborative testing and evaluation platform for LLM-based products that automatically improves through user and expert feedback, offering comprehensive tools for monitoring, debugging, and quality assurance.

    • Freemium
    • From 1750$
  • AI NavHub
    AI NavHub Discover the top AI tools of 2025 with the AI NavHub Tools Directory!

    AI NavHub is a comprehensive directory listing various AI tools across categories like writing, image generation, video, business, and more, helping users discover and access relevant AI solutions.

    • Free
  • Langtrace
    Langtrace Transform AI Prototypes into Enterprise-Grade Products

    Langtrace is an open-source observability and evaluations platform designed to help developers monitor, evaluate, and enhance AI agents for enterprise deployment.

    • Freemium
    • From 31$
  • Is It AI?
    Is It AI? AI Detection Made Simple

    Is It AI? offers quick and accurate AI content detectors for identifying AI-generated images and text. Improve trust and verify authenticity with advanced detection tools.

    • Freemium
    • From 8$
  • Passed.AI
    Passed.AI Beyond AI Detection: Guide Students to use AI Appropriately

    Passed.AI is an educational tool that helps educators guide students in responsible AI usage through comprehensive document auditing, AI detection, and plagiarism checking capabilities.

    • Free Trial
    • From 10$
  • Humanloop
    Humanloop The LLM evals platform for enterprises to ship and scale AI with confidence

    Humanloop is an enterprise-grade platform that provides tools for LLM evaluation, prompt management, and AI observability, enabling teams to develop, evaluate, and deploy trustworthy AI applications.

    • Freemium
  • BenchLLM
    BenchLLM The best way to evaluate LLM-powered apps

    BenchLLM is a tool for evaluating LLM-powered applications. It allows users to build test suites, generate quality reports, and choose between automated, interactive, or custom evaluation strategies.

    • Other
  • scite.ai
    scite.ai AI for Research

    Discover, evaluate, and understand scientific articles through Smart Citations.

    • Paid
  • Maihem
    Maihem Enterprise-grade quality control for every step of your AI workflow.

    Maihem empowers technology leaders and engineering teams to test, troubleshoot, and monitor any (agentic) AI workflow at scale. It offers industry-leading AI testing and red-teaming capabilities.

    • Contact for Pricing
  • teammately.ai
    teammately.ai The AI Agent for AI Engineers that autonomously builds AI Products, Models and Agents

    Teammately is an autonomous AI agent that self-iterates AI products, models, and agents to meet specific objectives, operating beyond human-only capabilities through scientific methodology and comprehensive testing.

    • Freemium
  • HoneyHive
    HoneyHive AI Observability and Evaluation Platform for Building Reliable AI Products

    HoneyHive is a comprehensive platform that provides AI observability, evaluation, and prompt management tools to help teams build and monitor reliable AI applications.

    • Freemium
  • Popular AI
    Popular AI Find the Most Popular AI tools for every task

    Popular AI is a comprehensive directory featuring the best AI tools available online, organized by category to help users find solutions to boost productivity.

    • Free
  • Didn't find tool you were looking for?

    Be as detailed as possible for better results
    EliteAi.tools logo

    Elite AI Tools

    EliteAi.tools is the premier AI tools directory, exclusively featuring high-quality, useful, and thoroughly tested tools. Discover the perfect AI tool for your task using our AI-powered search engine.

    Subscribe to our newsletter

    Subscribe to our weekly newsletter and stay updated with the latest high-quality AI tools delivered straight to your inbox.

    © 2025 EliteAi.tools. All Rights Reserved.