VetLLM evaluates LLMs with veterinary and medical programming tasks.
| # | Model | Pass@1 |
|---|---|---|
| 1 | Instruction-tuned Mistral 7B model | 3.17 |
| 2 | Instruction-tuned LLaMA 3.1 (8B) | 3.07 |
| 3 | Strong 7B conversational model | 2.97 |
| 4 | Fine-tuned Mistral 7B for veterinary tasks | 2.9 |
| 5 | 🧠google/gemma-7b Official 7B model from Google's Gemma family | 2.41 |