Skip to main content
Latent Space
Latent Space
Blog Profile
Admin
Blog Profile
Jaesol Shin

Jaesol Shin

Towards observable, reliable, scalable AI

GitHub

Categories

All Posts 30 Research 1 AI 21 Development 3 DB 5

Tags

AI11 Benchmark8 PostgreSQL8 RAG7 LLM6 Agent5 zsh4 Claude Code4 OpenAI4 GraphRAG4 LightRAG4 Harness4 Multi-account3 Dotfiles3 Configuration3 Developer Workflow3 API3 vLLM2 Productivity2 GPT-52 DeepSeek2 GraphDB2 RCTE2 Neo4j2 Apache AGE2 Recursive CTE2 Memory2 Skills2

Archive

2026 30

#GPT

한국어

May 15, 2026 · AI

10 OpenAI Models Through Quick Benchmarks — The Model Isn't as Smart as You Pay

I ran 30 trials per configuration across GPT-4, GPT-5, and o-series models using three reasoning problems. gpt-5-nano on minimal scored 4.4%. o1 scored lower than gpt-4o.

#OpenAI #GPT #model comparison #AI
Jaesol Shin

Jaesol Shin

Towards observable, reliable, scalable AI

GitHub

Categories

All Posts 30 Research 1 AI 21 Development 3 DB 5

Tags

AI11 Benchmark8 PostgreSQL8 RAG7 LLM6 Agent5 zsh4 Claude Code4 OpenAI4 GraphRAG4 LightRAG4 Harness4 Multi-account3 Dotfiles3 Configuration3 Developer Workflow3 API3 vLLM2 Productivity2 GPT-52 DeepSeek2 GraphDB2 RCTE2 Neo4j2 Apache AGE2 Recursive CTE2 Memory2 Skills2

Archive

2026 30