Journal of Artificial Intelligence Resarch: Volume 37の論文一覧

Journal of Artificial Intelligence Resarch Vol. 37 (2010)に記載されている内容を一覧にまとめ、機械翻訳を交えて日本語化し掲載します。

1 論文
2 参考文献
3 関連情報

論文

Text Relatedness Based on a Word Thesaurus

Text Relatedness Based on a Word Thesaurus / 単語シソーラスに基づくテキストの関連性

The computation of relatedness between two fragments of text in an automated manner requires taking into account a wide range of factors pertaining to the meaning the two fragments convey, and the pairwise relations between their words. Without doubt, a measure of relatedness between text segments must take into account both the lexical and the semantic relatedness between words. Such a measure that captures well both aspects of text relatedness may help in many tasks, such as text retrieval, classification and clustering. In this paper we present a new approach for measuring the semantic relatedness between words based on their implicit semantic links. The approach exploits only a word thesaurus in order to devise implicit semantic links between words. Based on this approach, we introduce Omiotis, a new measure of semantic relatedness between texts which capitalizes on the word-to-word semantic relatedness measure (SR) and extends it to measure the relatedness between texts. We gradually validate our method: we first evaluate the performance of the semantic relatedness measure between individual words, covering word-to-word similarity and relatedness, synonym identification and word analogy; then, we proceed with evaluating the performance of our method in measuring text-to-text semantic relatedness in two tasks, namely sentence-to-sentence similarity and paraphrase recognition. Experimental evaluation shows that the proposed method outperforms every lexicon-based method of semantic relatedness in the selected tasks and the used data sets, and competes well against corpus-based and hybrid approaches.

2つのテキスト断片間の関連性を自動的に計算するには、2つの断片が伝える意味や、それらの単語間の対の関係など、さまざまな要因を考慮する必要があります。テキストセグメント間の関連性の尺度は、単語間の語彙的関連性と意味的関連性の両方を考慮する必要があることは間違いない。テキストの関連性の両面を適切に捉えるこのような尺度は、テキスト検索、分類、クラスタリングなど、多くのタスクに役立つ可能性があります。本論文では、単語間の暗黙的な意味的リンクに基づいて意味的関連性を測定する新しいアプローチを提示します。このアプローチでは、単語間の暗黙的な意味的リンクを考案するために、単語シソーラスのみを活用します。このアプローチに基づいて、単語間の意味的関連性の尺度(SR)を活用し、それをテキスト間の関連性の測定に拡張した、テキスト間の意味的関連性の新しい尺度であるOmiotisを紹介します。段階的に手法を検証します。最初に、単語間の類似性と関連性、同義語の識別、単語の類推を含む、個々の単語間の意味的関連性の尺度のパフォーマンスを評価します。次に、文間の類似性と言い換え認識という2つのタスクでテキスト間の意味的関連性を測定する際の手法のパフォーマンスを評価します。実験的評価により、提案された手法は、選択されたタスクと使用されたデータセットにおいて、意味的関連性のあらゆる語彙ベースの手法よりも優れており、コーパスベースの手法やハイブリッド手法と十分に競合することが示されました。

From Frequency to Meaning: Vector Space Models of Semantics

From Frequency to Meaning: Vector Space Models of Semantics / 頻度から意味へ：意味論のベクトル空間モデル

Computers understand very little of the meaning of human language. This profoundly limits our ability to give instructions to computers, the ability of computers to explain their actions to us, and the ability of computers to analyse and process text. Vector space models (VSMs) of semantics are beginning to address these limits. This paper surveys the use of VSMs for semantic processing of text. We organize the literature on VSMs according to the structure of the matrix in a VSM. There are currently three broad classes of VSMs, based on term-document, word-context, and pair-pattern matrices, yielding three classes of applications. We survey a broad range of applications in these three categories and we take a detailed look at a specific open source project in each category. Our goal in this survey is to show the breadth of applications of VSMs for semantics, to provide a new perspective on VSMs for those who are already familiar with the area, and to provide pointers into the literature for those who are less familiar with the field.

コンピュータは人間の言語の意味をほとんど理解していません。そのため、コンピュータに指示を与える能力、コンピュータが自らの動作を人間に説明する能力、そしてテキストを分析・処理する能力は大きく制限されています。意味論のベクトル空間モデル（VSM）は、こうした限界に対処し始めています。本稿では、テキストの意味処理におけるVSMの利用について調査します。VSMに関する文献を、VSMのマトリックス構造に基づいて整理します。現在、VSMには、用語-文書、単語-文脈、ペア-パターンのマトリックスに基づく3つの大まかなクラスがあり、それぞれに3つの応用クラスがあります。本稿では、これら3つのカテゴリにおける幅広い応用を調査し、各カテゴリのオープンソースプロジェクトを詳細に考察します。本調査の目的は、意味論におけるVSMの応用範囲の広さを示し、既にこの分野に精通している人々にVSMに関する新たな視点を提供し、この分野にあまり精通していない人々には文献への指針を提供することです。

On Action Theory Change

On Action Theory Change / 行為理論の変化について

As historically acknowledged in the Reasoning about Actions and Change community, intuitiveness of a logical domain description cannot be fully automated. Moreover, like any other logical theory, action theories may also evolve, and thus knowledge engineers need revision methods to help in accommodating new incoming information about the behavior of actions in an adequate manner. The present work is about changing action domain descriptions in multimodal logic. Its contribution is threefold: first we revisit the semantics of action theory contraction proposed in previous work, giving more robust operators that express minimal change based on a notion of distance between Kripke-models. Second we give algorithms for syntactical action theory contraction and establish their correctness with respect to our semantics for those action theories that satisfy a principle of modularity investigated in previous work. Since modularity can be ensured for every action theory and, as we show here, needs to be computed at most once during the evolution of a domain description, it does not represent a limitation at all to the method here studied. Finally we state AGM-like postulates for action theory contraction and assess the behavior of our operators with respect to them. Moreover, we also address the revision counterpart of action theory change, showing that it benefits from our semantics for contraction.

行為と変化に関する推論コミュニティにおいて歴史的に認められているように、論理ドメイン記述の直観性は完全に自動化することはできません。さらに、他の論理理論と同様に、行為理論も進化する可能性があり、そのため知識エンジニアは、行為の振る舞いに関する新たな情報を適切に取り入れるための修正手法を必要とします。本研究は、マルチモーダル論理における行為ドメイン記述の変更に関するものです。その貢献は3つあります。まず、先行研究で提案された行為理論縮約の意味論を再検討し、クリプキモデル間の距離の概念に基づいて最小変化を表現する、より堅牢な演算子を提示します。次に、統語的行為理論縮約のためのアルゴリズムを提示し、先行研究で検討されたモジュール性原理を満たす行為理論について、その意味論に対する正しさを確立します。モジュール性はすべての行為理論に対して保証可能であり、ここで示すように、ドメイン記述の進化中に最大で一度しか計算する必要がないため、ここで検討する手法には全く制限はありません。最後に、行為理論の縮約に関するAGMのような公理を述べ、それらに対する演算子の挙動を評価します。さらに、行為理論の変化の修正版についても取り上げ、それが縮約に関する意味論から恩恵を受けることを示す。

Context-based Word Acquisition for Situated Dialogue in a Virtual World

Context-based Word Acquisition for Situated Dialogue in a Virtual World / 仮想世界における状況依存対話のための文脈に基づく単語獲得

To tackle the vocabulary problem in conversational systems, previous work has applied unsupervised learning approaches on co-occurring speech and eye gaze during interaction to automatically acquire new words. Although these approaches have shown promise, several issues related to human language behavior and human-machine conversation have not been addressed. First, psycholinguistic studies have shown certain temporal regularities between human eye movement and language production. While these regularities can potentially guide the acquisition process, they have not been incorporated in the previous unsupervised approaches. Second, conversational systems generally have an existing knowledge base about the domain and vocabulary. While the existing knowledge can potentially help bootstrap and constrain the acquired new words, it has not been incorporated in the previous models. Third, eye gaze could serve different functions in human-machine conversation. Some gaze streams may not be closely coupled with speech stream, and thus are potentially detrimental to word acquisition. Automated recognition of closely-coupled speech-gaze streams based on conversation context is important. To address these issues, we developed new approaches that incorporate user language behavior, domain knowledge, and conversation context in word acquisition. We evaluated these approaches in the context of situated dialogue in a virtual world. Our experimental results have shown that incorporating the above three types of contextual information significantly improves word acquisition performance.

会話システムにおける語彙問題に対処するために、これまでの研究では、対話中の共起音声と視線に教師なし学習アプローチを適用し、新しい単語を自動的に獲得してきた。これらのアプローチは有望性を示しているものの、人間の言語行動や人間と機械の会話に関連するいくつかの問題は解決されていない。第一に、心理言語学的研究は、人間の眼球運動と言語生成の間に一定の時間的規則性があることを示しています。これらの規則性は獲得プロセスを導く可能性があるものの、これまでの教師なしアプローチには組み込まれていなかった。第二に、会話システムは一般的に、ドメインと語彙に関する既存の知識ベースを持っています。既存の知識は、獲得した新しい単語のブートストラップと制約に役立つ可能性があるものの、これまでのモデルには組み込まれていなかった。第三に、視線は人間と機械の会話において異なる機能を果たす可能性があります。一部の視線ストリームは音声ストリームと密接に結合しておらず、単語獲得に悪影響を及ぼす可能性があります。会話の文脈に基づいて、密接に結合した音声・視線ストリームを自動的に認識することが重要です。これらの問題に対処するため、ユーザーの言語行動、ドメイン知識、会話の文脈を単語獲得に組み込む新しいアプローチを開発しました。これらのアプローチを、仮想世界における状況依存対話の文脈で評価しました。実験結果から、上記の3種類の文脈情報を組み込むことで、単語獲得のパフォーマンスが大幅に向上することが示されました。

Join-Graph Propagation Algorithms

Join-Graph Propagation Algorithms / 結合グラフ伝播アルゴリズム

The paper investigates parameterized approximate message-passing schemes that are based on bounded inference and are inspired by Pearls belief propagation algorithm (BP). We start with the bounded inference mini-clustering algorithm and then move to the iterative scheme called Iterative Join-Graph Propagation (IJGP), that combines both iteration and bounded inference. Algorithm IJGP belongs to the class of Generalized Belief Propagation algorithms, a framework that allowed connections with approximate algorithms from statistical physics and is shown empirically to surpass the performance of mini-clustering and belief propagation, as well as a number of other state-of-the-art algorithms on several classes of networks. We also provide insight into the accuracy of iterative BP and IJGP by relating these algorithms to well known classes of constraint propagation schemes.

本論文では、パールのビリーフプロパゲーションアルゴリズム（BP）に着想を得た、境界付き推論に基づくパラメータ化された近似メッセージパッシング方式について考察します。まず境界付き推論ミニクラスタリングアルゴリズムについて述べ、次に反復と境界付き推論を組み合わせた反復方式である反復結合グラフ伝播（IJGP）について考察します。IJGPアルゴリズムは、統計物理学の近似アルゴリズムとの連携を可能にするフレームワークである一般化ビリーフプロパゲーションアルゴリズムのクラスに属し、ミニクラスタリングやビリーフプロパゲーション、そして様々なネットワーククラスにおける他の最先端アルゴリズムの性能を上回ることが実証されています。また、反復BPとIJGPアルゴリズムを既知の制約伝播方式のクラスと関連付けることで、これらのアルゴリズムの精度に関する知見も提供します。

An Investigation into Mathematical Programming for Finite Horizon Decentralized POMDPs

An Investigation into Mathematical Programming for Finite Horizon Decentralized POMDPs / 有限時間領域分散型POMDPのための数理計画法の調査

Decentralized planning in uncertain environments is a complex task generally dealt with by using a decision-theoretic approach, mainly through the framework of Decentralized Partially Observable Markov Decision Processes (DEC-POMDPs). Although DEC-POMDPS are a general and powerful modeling tool, solving them is a task with an overwhelming complexity that can be doubly exponential. In this paper, we study an alternate formulation of DEC-POMDPs relying on a sequence-form representation of policies. From this formulation, we show how to derive Mixed Integer Linear Programming (MILP) problems that, once solved, give exact optimal solutions to the DEC-POMDPs. We show that these MILPs can be derived either by using some combinatorial characteristics of the optimal solutions of the DEC-POMDPs or by using concepts borrowed from game theory. Through an experimental validation on classical test problems from the DEC-POMDP literature, we compare our approach to existing algorithms. Results show that mathematical programming outperforms dynamic programming but is less efficient than forward search, except for some particular problems.The main contributions of this work are the use of mathematical programming for DEC-POMDPs and a better understanding of DEC-POMDPs and of their solutions. Besides, we argue that our alternate representation of DEC-POMDPs could be helpful for designing novel algorithms looking for approximate solutions to DEC-POMDPs.

不確実な環境における分散計画は、主に分散型部分観測マルコフ決定プロセス（DEC-POMDP）の枠組みを用いた意思決定理論的アプローチを用いて一般的に扱われる複雑なタスクです。DEC-POMDPは汎用的で強力なモデリングツールですが、その解決は二重指数関数的になる可能性のある圧倒的な複雑さを伴うタスクです。本稿では、ポリシーのシーケンス形式表現に依存するDEC-POMDPの代替定式化を検討します。この定式化から、解くとDEC-POMDPの正確な最適解を与える混合整数線形計画（MILP）問題を導出する方法を示します。これらのMILPは、DEC-POMDPの最適解のいくつかの組合せ特性を使用するか、ゲーム理論から借用した概念を使用することによって導出できることを示します。DEC-POMDPの文献からの古典的なテスト問題に対する実験的検証を通じて、既存のアルゴリズムと私たちのアプローチを比較します。結果は、数理計画法が動的計画法よりも優れているものの、いくつかの特定の問題を除いて、前向き探索よりも効率が悪いことを示しています。本研究の主な貢献は、DEC-POMDPに数理計画法を使用し、DEC-POMDPとその解をより深く理解したことです。さらに、DEC-POMDPの代替表現が、DEC-POMDPの近似解を探す新しいアルゴリズムの設計に役立つ可能性があると主張します。

Training a Multilingual Sportscaster: Using Perceptual Context to Learn Language

Training a Multilingual Sportscaster: Using Perceptual Context to Learn Language / 多言語スポーツキャスターの訓練：知覚的文脈を用いた言語学習

We present a novel framework for learning to interpret and generate language using only perceptual context as supervision. We demonstrate its capabilities by developing a system that learns to sportscast simulated robot soccer games in both English and Korean without any language-specific prior knowledge. Training employs only ambiguous supervision consisting of a stream of descriptive textual comments and a sequence of events extracted from the simulation trace. The system simultaneously establishes correspondences between individual comments and the events that they describe while building a translation model that supports both parsing and generation. We also present a novel algorithm for learning which events are worth describing. Human evaluations of the generated commentaries indicate they are of reasonable quality and in some cases even on par with those produced by humans for our limited domain.

知覚的コンテキストのみを教師として用いて、言語を解釈および生成することを学習するための新しいフレームワークを紹介します。言語固有の事前知識なしに、英語と韓国語の両方でシミュレートされたロボットサッカーの試合をスポーツキャストすることを学習するシステムを開発することにより、その機能を実証します。トレーニングでは、説明的なテキストコメントのストリームとシミュレーショントレースから抽出された一連のイベントで構成される曖昧な教師のみを使用します。システムは、構文解析と生成の両方をサポートする翻訳モデルを構築しながら、個々のコメントとそれらが説明するイベントとの対応を同時に確立します。また、どのイベントを記述する価値があるかを学習するための新しいアルゴリズムも紹介します。生成された解説に対する人間による評価では、それらの品質は妥当であり、場合によっては、私たちの限られた領域で人間が作成した解説と同等であることが示されています。

Predicting the Performance of IDA* using Conditional Distributions

Predicting the Performance of IDA* using Conditional Distributions / 条件付き分布を用いたIDA*のパフォーマンス予測

Korf, Reid, and Edelkamp introduced a formula to predict the number of nodes IDA* will expand on a single iteration for a given consistent heuristic, and experimentally demonstrated that it could make very accurate predictions. In this paper we show that, in addition to requiring the heuristic to be consistent, their formula’s predictions are accurate only at levels of the brute-force search tree where the heuristic values obey the unconditional distribution that they defined and then used in their formula. We then propose a new formula that works well without these requirements, i.e., it can make accurate predictions of IDA*’s performance for inconsistent heuristics and if the heuristic values in anylevel do not obey the unconditional distribution. In order to achieve this we introduce the conditional distribution of heuristic values which is a generalization of their unconditional heuristic distribution. We also provide extensions of our formula that handle individual start states and the augmentation of IDA* with bidirectional pathmax (BPMX), a technique for propagating heuristic values when inconsistent heuristics are used. Experimental results demonstrate the accuracy of our new method and all its variations.

Korf、Reid、およびEdelkampは、与えられた一貫性のあるヒューリスティックに対してIDA*が1回の反復で拡張するノードの数を予測する式を導入し、実験的に非常に正確な予測ができることを実証しました。本論文では、ヒューリスティックが一貫していることを要求することに加え、ヒューリスティックの値が彼らが定義して式で使用した無条件分布に従うブルートフォース検索ツリーのレベルでのみ、式の予測が正確であることを示します。次に、これらの要件がない場合でも適切に機能する新しい式を提案します。つまり、一貫性のないヒューリスティックの場合や、任意のレベルのヒューリスティック値が無条件分布に従わない場合でも、IDA*のパフォーマンスを正確に予測できます。これを実現するために、無条件ヒューリスティック分布の一般化であるヒューリスティック値の条件付き分布を導入します。また、個々の開始状態を扱うための公式の拡張と、矛盾するヒューリスティックが使用される場合にヒューリスティック値を伝播する手法である双方向パスマックス(BPMX)によるIDA*の拡張も提供します。実験結果は、この新しい手法とそのすべてのバリエーションの精度を実証します。

Reasoning About the Transfer of Control

Reasoning About the Transfer of Control / 制御権の移行に関する推論

We present DCL-PC: a logic for reasoning about how the abilities of agents and coalitions of agents are altered by transferring control from one agent to another. The logical foundation of DCL-PC is CL-PC, a logic for reasoning about cooperation in which the abilities of agents and coalitions of agents stem from a distribution of atomic Boolean variables to individual agents — the choices available to a coalition correspond to assignments to the variables the coalition controls. The basic modal constructs of DCL-PC are of the form `coalition C can cooperate to bring about phi’. DCL-PC extends CL-PC with dynamic logic modalities in which atomic programs are of the form `agent i gives control of variable p to agent j’; as usual in dynamic logic, these atomic programs may be combined using sequence, iteration, choice, and test operators to form complex programs. By combining such dynamic transfer programs with cooperation modalities, it becomes possible to reason about how the power of agents and coalitions is affected by the transfer of control. We give two alternative semantics for the logic: a `direct’ semantics, in which we capture the distributions of Boolean variables to agents; and a more conventional Kripke semantics. We prove that these semantics are equivalent, and then present an axiomatization for the logic. We investigate the computational complexity of model checking and satisfiability for DCL-PC, and show that both problems are PSPACE-complete (and hence no worse than the underlying logic CL-PC). Finally, we investigate the characterisation of control in DCL-PC. We distinguish between first-order control — the ability of an agent or coalition to control some state of affairs through the assignment of values to the variables under the control of the agent or coalition — and second-order control — the ability of an agent to exert control over the control that other agents have by transferring variables to other agents. We give a logical characterisation of second-order control.

DCL-PCは、エージェントおよびエージェントの連合の能力が、あるエージェントから別のエージェントに制御を移すことによってどのように変化するかを推論するための論理です。DCL-PCの論理的基盤はCL-PCです。CL-PCは、エージェントおよびエージェントの連合の能力が個々のエージェントへのアトミックなブール変数の分配に由来する、協力についての推論のための論理です。連合が利用できる選択肢は、連合が制御する変数への割り当てに対応します。DCL-PCの基本的な様相構成は、「連合Cは協力してphiをもたらすことができる」という形式です。DCL-PCは、アトミックプログラムが「エージェントiが変数pの制御をエージェントjに渡す」という形式である動的論理形式によってCL-PCを拡張します。動的論理ではよくあるように、これらの原子プログラムは、シーケンス、反復、選択、およびテスト演算子を使用して組み合わせることで、複雑なプログラムを作成できます。このような動的転送プログラムを協力様式と組み合わせることで、エージェントと連合の力が制御の転送によってどのように影響を受けるかを推論できるようになります。私たちは、この論理に対して2つの代替的な意味論を提示します。1つは、エージェントへのブール変数の分布を捉える「直接的な」意味論、もう1つは、より一般的なクリプキ意味論です。これらの意味論が同等であることを証明し、次にこの論理の公理化を提示します。DCL-PCのモデル検査と充足可能性の計算複雑性を調査し、両方の問題がPSPACE完全であることを示します（したがって、基礎となる論理CL-PCよりも悪くはありません）。最後に、DCL-PCにおける制御の特性を調査します。我々は、一次制御（エージェントまたは連合が、自身の制御下にある変数に値を割り当てることで、何らかの事態を制御する能力）と二次制御（エージェントが変数を他のエージェントに移転することで、他のエージェントが有する制御を制御する能力）を区別します。二次制御の論理的特徴付けを示す。

Multiattribute Auctions Based on Generalized Additive Independence

Multiattribute Auctions Based on Generalized Additive Independence / 一般化加法独立性に基づく多属性オークション

We develop multiattribute auctions that accommodate generalized additive independent (GAI) preferences. We propose an iterative auction mechanism that maintains prices on potentially overlapping GAI clusters of attributes, thus decreases elicitation and computational burden, and creates an open competition among suppliers over a multidimensional domain. Most significantly, the auction is guaranteed to achieve surplus which approximates optimal welfare up to a small additive factor, under reasonable equilibrium strategies of traders. The main departure of GAI auctions from previous literature is to accommodate non-additive trader preferences, hence allowing traders to condition their evaluation of specific attributes on the value of other attributes. At the same time, the GAI structure supports a compact representation of prices, enabling a tractable auction process. We perform a simulation study, demonstrating and quantifying the significant efficiency advantage of more expressive preference modeling. We draw random GAI-structured utility functions with various internal structures, generate additive functions that approximate the GAI utility, and compare the performance of the auctions using the two representations. We find that allowing traders to express existing dependencies among attributes improves the economic efficiency of multiattribute auctions.

一般化加法独立（GAI）選好を考慮した多属性オークションを開発します。本稿では、重複する可能性のあるGAI属性クラスターの価格を維持する反復オークション機構を提案します。これにより、誘導と計算負荷が軽減され、多次元領域において供給者間のオープンな競争が創出されます。最も重要な点は、このオークションは、トレーダーの合理的な均衡戦略の下で、小さな加法係数まで最適な福祉に近い余剰を達成することが保証されている点です。GAIオークションが従来の研究と大きく異なる点は、トレーダーの非加法的な選好を考慮し、トレーダーが特定の属性の評価を他の属性の価値に基づいて条件付けることができる点です。同時に、GAI構造は価格の簡潔な表現をサポートし、扱いやすいオークションプロセスを可能にします。本稿ではシミュレーション研究を行い、より表現力豊かな選好モデリングの顕著な効率性の利点を実証し、定量化します。様々な内部構造を持つランダムなGAI構造効用関数を描画し、GAI効用を近似する加法関数を生成し、2つの表現を用いたオークションのパフォーマンスを比較します。トレーダーが属性間の既存の依存関係を表現できるようにすることで、多属性オークションの経済効率が向上することがわかった。

Mechanisms for Multi-Unit Auctions

Mechanisms for Multi-Unit Auctions / 複数ユニットオークションのメカニズム

We present an incentive-compatible polynomial-time approximation scheme for multi-unit auctions with general k-minded player valuations. The mechanism fully optimizes over an appropriately chosen sub-range of possible allocations and then uses VCG payments over this sub-range. We show that obtaining a fully polynomial-time incentive-compatible approximation scheme, at least using VCG payments, is NP-hard. For the case of valuations given by black boxes, we give a polynomial-time incentive-compatible 2-approximation mechanism and show that no better is possible, at least using VCG payments.

一般k-マインドプレイヤー評価を持つマルチユニットオークションのための、インセンティブ両立型多項式時間近似スキームを提示します。このメカニズムは、適切に選択された可能な割り当てのサブレンジ全体にわたって完全に最適化し、その後、このサブレンジにおいてVCG支払いを使用します。少なくともVCG支払いを用いた場合、完全に多項式時間でインセンティブ両立型近似スキームを得ることはNP困難であることを示す。ブラックボックスによって評価が与えられるケースについては、多項式時間でインセンティブ両立型2近似メカニズムを提示し、少なくともVCG支払いを用いた場合、これより優れた近似スキームは不可能であることを示す。

Interactive Cost Configuration Over Decision Diagrams

Interactive Cost Configuration Over Decision Diagrams / 決定図における対話型コスト設定

In many AI domains such as product configuration, a user should interactively specify a solution that must satisfy a set of constraints. In such scenarios, offline compilation of feasible solutions into a tractable representation is an important approach to delivering efficient backtrack-free user interaction online. In particular,binary decision diagrams (BDDs) have been successfully used as a compilation target for product and service configuration. In this paper we discuss how to extend BDD-based configuration to scenarios involving cost functions which express user preferences.We first show that an efficient, robust and easy to implement extension is possible if the cost function is additive, and feasible solutions are represented using multi-valued decision diagrams (MDDs). We also discuss the effect on MDD size if the cost function is non-additive or if it is encoded explicitly into MDD. We then discuss interactive configuration in the presence of multiple cost functions. We prove that even in its simplest form, multiple-cost configuration is NP-hard in the input MDD. However, for solving two-cost configuration we develop a pseudo-polynomial scheme and a fully polynomial approximation scheme. The applicability of our approach is demonstrated through experiments over real-world configuration models and product-catalogue datasets. Response times are generally within a fraction of a second even for very large instances.

製品構成など、多くのAI分野において、ユーザーは一連の制約を満たす解を対話的に指定する必要があります。このようなシナリオでは、実行可能解を扱いやすい表現にオフラインでコンパイルすることが、オンラインで効率的かつバックトラックフリーなユーザーインタラクションを実現するための重要なアプローチとなります。特に、二項決定図（BDD）は、製品およびサービス構成のコンパイル対象として効果的に利用されてきました。本稿では、BDDベースの構成を、ユーザーの嗜好を表すコスト関数を含むシナリオに拡張する方法について考察します。まず、コスト関数が加法的で、実行可能解が多値決定図（MDD）を用いて表現されている場合、効率的で堅牢かつ実装が容易な拡張が可能であることを示します。また、コスト関数が非加法的である場合、またはコスト関数がMDDに明示的にエンコードされている場合のMDDサイズへの影響についても考察します。次に、複数のコスト関数が存在する場合の対話型構成について考察します。入力MDDにおいて、最も単純な形式であっても、複数コスト構成はNP困難であることを証明します。しかし、2コスト構成を解決するために、擬似多項式スキームと完全多項式近似スキームを開発しました。このアプローチの適用可能性は、実際の構成モデルと製品カタログデータセットでの実験によって実証されています。応答時間は通常、非常に大規模なインスタンスであっても数分の1秒以内です。

参考文献

Journal of Artificial Intelligence Resarch Vol. 37 (2010)

論文