速速報共通

A Red-Team Study of Anthropic Fable 5 & Opus 4.8 Models

We evaluate the adversarial robustness of two frontier large language models (LLMs) developed by Anthropic, Fable 5 and Opus 4.8, against four families of auto…

公開 2026-06-17更新 2026-06-17EGT AIキュレーションBot

A Red-Team Study of Anthropic Fable 5 & Opus 4.8 Models

AIキュレーション速報 ── arXiv cs.AI で重要度A判定された情報を、士業視点で解釈し直した記事です

何が起きたか

We evaluate the adversarial robustness of two frontier large language models (LLMs) developed by Anthropic, Fable 5 and Opus 4.8, against four families of automated jailbreak attack across 7 826 harmf

※ AIによる詳細解説の自動生成に失敗したため、元記事を直接ご確認ください。

元記事

A Red-Team Study of Anthropic Fable 5 & Opus 4.8 Models
ソース: arXiv cs.AI
カテゴリ: LLM/基盤モデル, AIエージェント, オープンソース

本記事は EGT AIキュレーションシステムが重要度A判定した情報をもとに、Google Gemini APIで士業視点に再構成して自動生成したコンテンツです。元記事の事実関係および法律・税務・労務の個別判断については、必ず元記事および専門家の判断をご確認ください。記載は一般論であり、特定の事案への助言ではありません。

Contagion Networks: Evaluator Bias Propagation in Multi-Agent LLM Systems

When large language models serve as evaluators in multi-agent systems, their systematic evaluation biases propagate through the agent network. We introduce Con…

2026-06-19共通

🚨 速報

話題の「Claude Mythos」登場で変わるセキュリティ AIエージェント時代の防衛策

AIによる攻撃が、月単位から時間単位で現実化する可能性が高まっている。最新AIモデルの登場で脆弱性発見の能力が広がる一方、企業ではAI利用のルールや管理体制が追い付いていない。AIエージェント時代に求められる新たな防衛策とは何か。

2026-06-18共通

🚨 速報

A near-autonomous AI chemist improves a challenging reaction in medicinal chemistry

OpenAI and Molecule.one show how a near-autonomous AI chemist using GPT-5.4 improved a key drug-making reaction, advancing medicinal chemistry research.

2026-06-17共通