LangGraph Agent Evaluation Runner

Instructions:

This space uses a LangGraph agent with multimodal, search, math, and YouTube tools powered by OpenRouter.

  1. Log in to your Hugging Face account using the button below.
  2. Click 'Run Evaluation & Submit All Answers' to fetch questions, run your agent, submit answers, and see the score.

Agent Capabilities:

  • 🎨 Multimodal: Analyze images, extract text (OCR), process audio transcripts
  • 🔍 Search: Web search using multiple providers (DuckDuckGo, Tavily, SerpAPI)
  • 🧮 Math: Basic arithmetic, complex calculations, percentages, factorials
  • 📺 YouTube: Extract captions, get video information

Note: Processing all questions may take some time as the agent carefully analyzes each question and uses appropriate tools.

Questions and Agent Answers

Questions and Agent Answers