Skip to main content
Latent Space
Latent Space
Blog Profile
Admin
Blog Profile
Jaesol Shin

Jaesol Shin

Towards observable, reliable, scalable AI

GitHub

Categories

All Posts 26 Research 1 AI 17 Development 3 DB 5

Tags

AI8 PostgreSQL8 Benchmark7 RAG6 LLM4 zsh4 Claude Code4 OpenAI4 GraphRAG4 LightRAG4 Multi-account3 Dotfiles3 Configuration3 Developer Workflow3 API3 Productivity2 GPT-52 DeepSeek2 GraphDB2 RCTE2 Neo4j2 Apache AGE2 Recursive CTE2 Markdown2 PDF2 Agent2 Self-Improvement2 POMDP2

Archive

2026 26

#LLM

한국어

May 22, 2026 · AI

Weights, Prompts, Codes as Parameters

Weights, prompts, and code as parameters at different layers of a learnable policy space

#AI #Agent #LLM #POMDP

May 20, 2026 · AI

Apple Silicon LLM Inference — Five Backends Compared

Benchmarking Qwen3.5-9B on Apple Silicon across MLX, llama.cpp, Ollama, omlx, and vLLM Metal — single-request throughput, prefill scaling, decode vs input length, and concurrency response

#LLM #Apple Silicon #MLX #llama.cpp

May 20, 2026 · AI

Long-Context Evaluation — NIAH and Lost in the Middle

NIAH limits, the Lost in the Middle effect, alternative benchmarks, and measured recall across four reasoning-effort modes

#LLM #Long-context #NIAH #Benchmark

May 20, 2026 · AI

NVIDIA NIM API — Free Inference for GLM, Kimi, Nemotron, and Gemma 4

NVIDIA's build.nvidia.com offers 100+ models on H100 infrastructure for free. Plug it directly into Claude Code, Cursor, or any OpenAI-compatible coding agent.

#NVIDIA #NIM #AI #API
Jaesol Shin

Jaesol Shin

Towards observable, reliable, scalable AI

GitHub

Categories

All Posts 26 Research 1 AI 17 Development 3 DB 5

Tags

AI8 PostgreSQL8 Benchmark7 RAG6 LLM4 zsh4 Claude Code4 OpenAI4 GraphRAG4 LightRAG4 Multi-account3 Dotfiles3 Configuration3 Developer Workflow3 API3 Productivity2 GPT-52 DeepSeek2 GraphDB2 RCTE2 Neo4j2 Apache AGE2 Recursive CTE2 Markdown2 PDF2 Agent2 Self-Improvement2 POMDP2

Archive

2026 26