株式会社オブライト
Services
About
Company
Column
Glossary
Contact
日本語
日本語
メニューを開く
Column
Open Source
Articles tagged "Open Source"
4 articles
AI
2026-06-10
Cognition AI's FrontierCode Explained: The Next-Gen Coding AI Benchmark That Asks 'Is It Mergeable?'
On June 8, 2026, Cognition AI unveiled **FrontierCode** — not a product, but a coding AI evaluation benchmark. It measures not just 'does it pass tests' but 'would an OSS maintainer actually merge this?' across six axes. This article covers its differences from SWE-bench Verified, the three-tier dataset (Diamond/Main/Extended), official results with Claude Opus 4.8 leading at 13.4% on Diamond, and its relevance to Japan's rigorous code-review culture.
Cognition AI
FrontierCode
SWE-bench
AI
2026-06-05
The Complete Guide to Hermes Agent & Hermes Desktop Skills and Tools — 19,932 Skill Catalog, 40+ Built-in Tools, and the Use-Case Patterns That Matter (June 2026)
A comprehensive guide to the **Skills & Tools system** in Nous Research's open-source agent Hermes Agent v0.15.2 (and the [Hermes Desktop](../columns/hermes-desktop-nous-research-2026) GUI), grounded in official docs and GitHub releases. Covers **Skills** (on-demand procedural docs) with the three-level Progressive Disclosure loading scheme starting at ~3k tokens, the `SKILL.md` format, the **skills.sh catalog that exploded from 858 to 19,932 entries in v0.15.1**, the standout new skills (`openhands`, `code-wiki`, `web-pentest`), the self-improving loop where the agent creates / patches / edits / deletes its own Skills, and **Tools** — 40+ built-ins like `web_search`, `x_search`, `terminal`, `patch`, `browser_navigate`, `vision_analyze`, `cronjob`, `memory`, `delegate_task`. Also covers MCP **client + server** support, the macOS Computer Use background execution that doesn't move the cursor or switch Spaces (5–20ms/event), and the 25+ messenger gateway (Slack / Discord / Telegram / Teams / WhatsApp / LINE / Feishu / WeCom and more). Ends with **eight category-specific combination patterns** — research, writing, data analysis, coding, customer support, social listening, internal automation, personal work — sized for Japanese enterprise practice.
Hermes Agent
Hermes Desktop
Nous Research
AI
2026-06-03
Hermes Desktop Deep Dive — Nous Research's OSS Resident Personal Agent for Every Platform
**Hermes Desktop** by Nous Research is the native desktop app version of Hermes Agent, first demoed by Jensen Huang at the NVIDIA GTC keynote and now in public preview. Released under the MIT license, it supports macOS, Windows, and Linux with voice mode, cron scheduling, Computer Use, and MCP gateway integration — all sharing the same config, skills, and memory as the CLI and TUI. This column covers features, competitive positioning, and key considerations for Japanese enterprise adoption.
Hermes Desktop
Nous Research
AI Agent
AI
2026-05-21
Gemma 4 and the Google AI Studio Overhaul — What Google I/O 2026 Means for Open-Weight LLMs and Enterprise Adoption in Japan
Google I/O 2026 put a fresh spotlight on Gemma 4 (2B–31B, 256K context, 140 languages, Apache 2.0) and a major Google AI Studio overhaul featuring Kotlin vibe coding, one-click Cloud Run deployment, and the Managed Agents API. This column covers the full picture — hardware requirements, competitive positioning against Llama 4 and Qwen, and practical adoption guidance for Japanese enterprises.
Google
Gemma 4
Google AI Studio