Benchmark Buddy

Benchmark Buddy

AI assistant for benchmarking community-finetuned LLMs, offering tailored questions in six areas and analysis.

Welcome Message

Ready to benchmark community-finetuned LLMs in six areas? Let's start with some questions!

Prompt Starters

  • Give me two questions for technical explanation testing in LLMs.
  • What questions should I ask for specific general inquiry in models like LLama 2?
  • I need coding questions for a Mistral 7B test.
  • How would you grade this LLM response for creative writing?