株式会社オブライト

Articles tagged "ローカルLLM"

10 articles

NVIDIA DGX Spark in 2026 — A Two-Stage Workflow for Code Migrations Where "Confidential Analysis Stays Local, Cloud LLMs Only Touch Sanitized Code"

An overview of NVIDIA DGX Spark (GB10 Grace Blackwell Superchip, 128GB unified memory, up to 1 PFLOP at FP4, $4,699) and a concrete two-stage workflow for confidential code-migration projects: analyze and sanitize locally, then hand a clean, PII-free representation to cloud frontier LLMs for the actual migration. Practical answers to the "executives won't approve cloud AI even with opt-out" problem.

NVIDIADGX SparkローカルLLM

NousResearch Hermes Complete Guide — Hermes 4.3 36B, Function Calling & Hermes Agent [2026]

Complete guide to NousResearch Hermes 4.3 36B (512K context) and the Hermes Agent framework. Covers Function Calling implementation, Ollama setup, hardware requirements, and benchmarks including RefusalBench dominance — updated for 2026.

HermesNousResearchFunction Calling

Local LLM Landscape April 2026 — Top 10 Open-Source Models Comprehensive Comparison [Ollama Guide]

Comprehensive comparison of the top 10 local LLMs as of April 2026. Covers SWE-bench scores, Japanese language performance, VRAM requirements, Ollama commands, and licensing for Gemma 4, Llama 4, Qwen 3.5, GLM-5.1, Kimi K2.5, MiniMax M2.5, and more.

ローカルLLMオープンソースAI2026年

Qwen 3.5 27B Dense & 35B-A3B MoE Complete Guide — DFlash Acceleration Breaks 24GB GPU Limits [2026]

Compare Qwen 3.5 27B Dense vs 35B-A3B MoE, check 24GB GPU requirements, learn DFlash 2–3x acceleration, and follow step-by-step Ollama setup instructions.

Qwen 3.527B Dense35B-A3B MoE

Gemma 4 E4B Complete Guide — 4.5B Parameter Multimodal Model for Edge Deployment [2026]

Gemma 4 E4B is Google's 4.5B parameter edge AI model released in April 2026. This guide covers local deployment on Apple Silicon and Raspberry Pi, multimodal features, quantization settings, and benchmark comparisons.

Gemma 4Gemma 4 E4BエッジAI

Claude Alternative Local LLM Comparison 2026 — Qwen 3.5, Mistral Small 4, DeepSeek R1 & Gemma 4 Reviewed

Following Anthropic Claude restrictions, comprehensive comparison of local LLMs including Qwen 3.5-9B, Mistral Small 4, DeepSeek R1, Gemma 4, and Llama 4. Detailed analysis of Japanese performance, hardware requirements, and use-case recommendations.

ローカルLLMQwen 3.5Mistral Small 4

AI API Cost Optimization in the Pay-Per-Use Era — Smart Strategies for Claude, GPT, Gemini & Local LLMs [2026]

Comprehensive guide to AI API cost optimization in the pay-per-use era. Covers Claude, GPT, Gemini pricing comparisons, 5 reduction techniques including prompt caching, batch APIs, local LLM hybrid operations, monthly cost simulations, and ROI calculation methods.

AI APIコスト最適化従量課金

Hybrid AI Strategy Guide — Achieving 50% Cost Reduction with Cloud API + Local LLM [2026]

A practical guide to reducing AI operational costs by over 50% with a hybrid AI strategy combining cloud APIs and local LLMs. Learn optimal architecture design and implementation steps using local models like Qwen 3.5 and DeepSeek R1 with Claude, GPT, and Gemini.

ハイブリッドAIローカルLLMコスト削減

Gemma 4 Complete Guide — Features, System Requirements & Ollama Setup [2026]

Complete guide to Google Gemma 4 (released April 2, 2026): 4 model variants (E2B/E4B/26B MoE/31B Dense), Apache 2.0 license, system requirements, multimodal capabilities, AIME 89% benchmark, 140+ languages, and step-by-step Ollama installation and setup instructions.

Gemma 4OllamaGoogle

Gemma 4 vs Llama 4 vs Qwen 3.5 Comparison — 2026 Local LLM Selection Guide

Comprehensive comparison of Gemma 4, Llama 4, and Qwen 3.5 local LLMs. Detailed analysis of benchmark performance, licensing, Japanese support, hardware requirements, and use case selection criteria.

Gemma 4Llama 4Qwen 3.5