About IB-bench
Measuring what matters for AI in investment banking.
Mission
IB-bench evaluates LLMs on the work junior analysts actually do: parsing complex filings, building and debugging financial models, and extracting critical data from documents. If you're building AI for finance, these are the tasks that matter.
Why IB-bench?
- Real tasks: Built from actual analyst workflows, not synthetic problems
- Multi-format: Tests LLMs on PDFs, spreadsheets and web content
- Quality data: Materials sourced from industry professionals
- Open source: Full methodology, prompts, and rubrics on GitHub
Contact
Questions, feedback, or want to contribute tasks? Reach out on GitHub or X.