Skip to main content
Latent Space
Latent Space
Blog Profile
Admin
Blog Profile
Jaesol Shin

Jaesol Shin

Towards observable, reliable, scalable AI

GitHub

Categories

All Posts 26 Research 1 AI 17 Development 3 DB 5

Tags

AI8 PostgreSQL8 Benchmark7 RAG6 LLM4 zsh4 Claude Code4 OpenAI4 GraphRAG4 LightRAG4 Multi-account3 Dotfiles3 Configuration3 Developer Workflow3 API3 Productivity2 GPT-52 DeepSeek2 GraphDB2 RCTE2 Neo4j2 Apache AGE2 Recursive CTE2 Markdown2 PDF2 Agent2 Self-Improvement2 POMDP2

Archive

2026 26

#MLX

한국어

May 20, 2026 · AI

Apple Silicon LLM Inference — Five Backends Compared

Benchmarking Qwen3.5-9B on Apple Silicon across MLX, llama.cpp, Ollama, omlx, and vLLM Metal — single-request throughput, prefill scaling, decode vs input length, and concurrency response

#LLM #Apple Silicon #MLX #llama.cpp
Jaesol Shin

Jaesol Shin

Towards observable, reliable, scalable AI

GitHub

Categories

All Posts 26 Research 1 AI 17 Development 3 DB 5

Tags

AI8 PostgreSQL8 Benchmark7 RAG6 LLM4 zsh4 Claude Code4 OpenAI4 GraphRAG4 LightRAG4 Multi-account3 Dotfiles3 Configuration3 Developer Workflow3 API3 Productivity2 GPT-52 DeepSeek2 GraphDB2 RCTE2 Neo4j2 Apache AGE2 Recursive CTE2 Markdown2 PDF2 Agent2 Self-Improvement2 POMDP2

Archive

2026 26