About IB-bench

Measuring what matters for AI in investment banking.

Mission

IB-bench evaluates LLMs on the work junior analysts actually do: parsing complex filings, building and debugging financial models, and extracting critical data from documents. If you're building AI for finance, these are the tasks that matter.

Why IB-bench?

Real tasks: Built from actual analyst workflows, not synthetic problems
Multi-format: Tests LLMs on PDFs, spreadsheets and web content
Quality data: Materials sourced from industry professionals
Open source: Full methodology, prompts, and rubrics on GitHub

Contact

Questions, feedback, or want to contribute tasks? Reach out on GitHub or X.

GitHub X