IB-bench

Can Large Language Models Replace Investment Banking Analysts?|

33 public tasks just launched!
5 models evaluated, more coming soon!

Leaderboard

5 models evaluated · 33 total tasks

Results are preliminary: IB-bench is in active development and eval results may change.

Scoring: Overall score is weighted 20% Easy, 35% Medium, 45% Hard.

Difficulty levels: Easy (<1 hour), Medium (few hours), Hard (>1 day) - based on time a human analyst would need.

© 2026 IB-bench. All rights reserved.