ChatGPT Images 2.0

(chatgpt.com)

24 points | by pretext 9 hours ago ago

8 comments

gpt5 9 hours ago ago
Blow everything out of the water on llmarena
https://arena.ai/leaderboard/text-to-image
[-]
- be7a 9 hours ago ago
  242 Elo points clear of the next best model and 93% win rate against random models (96% against nano banana) while Gemini 3.1 (second best) sits at 67%. That’s quite the leap.
- brcmthrowaway 9 hours ago ago
  How is text to image even scored? Seems like a subjective measurement..
  [-]
  - be7a 9 hours ago ago
    Users get two completions for their prompt and rank them. From this you can then use Bradley-Terry to get Elo scores per model.
  - vunderba 7 hours ago ago
    LM Arena is a particularly bad comparison site too. Prompts that they use are usually incredibly generic like "A digital render of a sleek, futuristic motorcycle racing through a neon-lit cityscape."
    I actually built GenAI Showdown a while back because I was deeply unsatisfied with LM Arena and other purported comparison tables which either (A) relied solely on visual fidelity (which is a far less interesting benchmark than adherence, IMHO) and/or (B) relied on extremely simplistic and banal prompts.
7 hours ago ago
[deleted]
9 hours ago ago
[deleted]
ChrisArchitect 8 hours ago ago
[dupe] https://news.ycombinator.com/item?id=47853000