At Valere, we understand that selecting the right Large Language Model (LLM) is crucial for achieving the best results in AI-driven tasks. Each LLM has its own strengths and weaknesses, and choosing the right one can significantly impact your project's success. While there are numerous metrics available to compare these models, user experience often tells a different story.
On this page
So, How Do You Choose the Right LLM?
The most reliable source of information we’ve found is the LMSYS Chatbot Arena Leaderboard via Substack. This leaderboard is unique because it relies on scores assigned by over 800,000 human users who compare models in A/B tests. Here are the current top models as of May 25, 2024:
LMSYS Chatbot Arena Leaderboard 5/24/2024
Chat GPT-4o (1287 points)Overview:
The top-performing model released on May 10, 2024. It will soon feature interactive tables, charts, and new multimodal capabilities available through the API.
Price: $20/month
Get access: https://chat.openai.com/
Chat GPT-4 Turbo (1262 points)
Overview: The second-best model available for free since March 2024 via Microsoft Copilot in Bing. Microsoft is expected to replace it with Chat GPT-4o as it’s 50% more cost-effective.
Price: Free
Get access: https://www.bing.com/chat
Google Gemini 1.5 (1248 points)
Overview: Available through the Gemini Advanced plan. The free version offers Gemini Pro, which scores slightly lower (1208).
Price: $20/month
Get access: https://gemini.google.com/app
Claude 3 Opus (1246 points)
Overview: Excellent for philosophical discussions but less effective in structuring data. Limited availability in some countries, often requiring a US phone number and VPN for access.
Price: $20/month
Get access: https://claude.ai/
Llama3 70b (1203 points)
Overview: An open-source model from Meta, known for its speed. Available for free on the Groq website, along with other open-source models.
Price: Free
Get access here: https://groq.com/
*Note: While we commonly refer to these models as LLMs (Large Language Models), many are actually LMMs (Large Multimodal Models). The LMSYS Chatbot Arena Leaderboard focuses on text responses to text questions.*
Choosing the right LLM can elevate your projects and streamline your workflows, ensuring you stay ahead in the competitive landscape. At Valere, we strive to provide you with the best tools and insights to make informed decisions in the rapidly evolving field of AI.
Share