LangGraph Agent Evaluation Runner
Instructions:
This space uses a LangGraph agent with multimodal, search, math, and YouTube tools powered by OpenRouter.
- Log in to your Hugging Face account using the button below.
- Click 'Run Evaluation & Submit All Answers' to fetch questions, run your agent, submit answers, and see the score.
Agent Capabilities:
- 🎨 Multimodal: Analyze images, extract text (OCR), process audio transcripts
- 🔍 Search: Web search using multiple providers (DuckDuckGo, Tavily, SerpAPI)
- 🧮 Math: Basic arithmetic, complex calculations, percentages, factorials
- 📺 YouTube: Extract captions, get video information
Note: Processing all questions may take some time as the agent carefully analyzes each question and uses appropriate tools.
Questions and Agent Answers