A human-ranked ELO-style rating for LLMs exists

Today I learned that Huggingface operates the LMSYS Chatbot Arena Leaderboard. It’s a comprehensive list of the most important LLMs available and how they rank based on user preferences. By going to the Chatbot Arena you can pose a question, have it answered by two randomly selected LLMs and then vote which performed better.