SBIR Proposal Summary on The Agency

 

The Lexikos Agency implements a three-part model of English language processing by its contained intelligent agents.  Each such agent functionally integrates:

 

(A)    The Lexikos Transcriber (now existing) as its textual input module;

(B)    A web-based Generator for text paragraphs (and speech/GUI) outputs;

(C)    A Context module using text content to refine a collection of PLANS.

 

The Context module defines the logical DOMAIN (AKA microworld) the input and output paragraphs discuss.  Each agent within The Agency gets its own evolving instance, which models and uses a hierarchy of partial-order PLANS featuring:

 

(D)    At each node in the hierarchy, a MISSION, an AGENT (soldier, device, military unit…), and a dynamic STATE-SPACE (battle zone, supply depot, classroom…) in which the modeled AGENT seeks to achieve its MISSION.

 

(E)    Verb-centric OPs (events and processes) modeled under Lexikos’ INGLISH ontology.  Each OP cites expected STATE changes of its verb’s complements using PRECONDITION, EFFECT, and TENSE models.  Each OP may be declared an AGENT-specific primitive, or decomposed into a new sub-PLAN.

 

(F)    A PLANNING-LANGUAGE based on FOPC, defined in layers.  All agents share the core layer, kept simple by design.  A related planning algorithm tries to refine each partial PLAN using declared OPS plus “casual-link” sequencing constraints, aided by operator inputs.  Optionally, real-time perceptions can be supported in extension layers. (Many are possible).

 

Transcriber exploits the PLAN-base as a flexible but robust model of “what is being discussed”.  Generator uses it as AGENT to plan English outputs.  The integration of Agency emphasizes shared usage of these other system elements:

 

(G)    Vocabulary: STATE-SPACE literals, OP models, and other objects in Context help each agent link relevant meanings to TERMS in a core lexicon, which is otherwise too large.  (The Agency’s base vocabulary holds 8,000 core roots plus Roget’s Thesaurus; each agent independently extends it.)

 

(H)    Discourse model: a FOCUS mechanism tracks topics in Transcriber output charts, links them to PLAN elements, and relates them to abstract speech-acts via coherence relations.  They tell the agent what each operator input sought to accomplish, so Generator can properly respond to it.

 

(I)    Speaker and Listener models: an agent and the human operator typically play these fluent roles, but other bindings may occur inside PLANS.  Good models of Speaker/Listener goals and speech acts are vital for coherent dialogs, and even to handle anaphora.  Such models persist in Context.

 

(J)    Evolving visual maps of Context: session-based web I/O specific to each dialog lets the operator monitor agent data structures to confirm that what her agent “understands” is indeed what she intended.  This I/O channel also aids review of each PLAN as it develops or afterwards.

 

This functionality for The Agency is all packed into one portable Java Web App, to which operators may post English paragraphs and get replies.  As each dialog progresses, a PLAN-base in FOPC evolves as a side effect, to be serialized as desired by the operator into metadata (RDF, etc.) that saves all recent changes.


                 Appendix 1: Static Extensions Now Planned

 

The theme of an agent planning a hierarchy of sequenced tasks within a limited domain has been studied in AI classrooms for decades.  Combining it with English I/O is nearly as old, as I saw first from Winograd, the MIT teaching assistant in my ‘72 AI 1.01 course.  His SHRDLU design got me “hooked” on AI and NLP. 

 

Improved planning design patterns now exist.  Each needs special-case handling, especially in a Parser/Planner hybrid like The Agency.  They will extend its core PLANNING LANGUAGE – adding special-purpose English vocabulary and FOCP logic, plus related operator training.  To help extensions rapidly develop, we plan an “open source” model for their software R&D.  Targeted options include:

 

1)    Complex end-states within the basic OP model:  These involve competing core planning algorithms, each with specific refinements in goal decomposition, sub-task ordering, sequential constraints, conditional effects, disjunctive or negated goals, universal quantifiers, theorem-proving and other aspects of FOPC often mysterious to those not expert in logic programming.

 

2)    Time and space models: expertise in them arises more often in NLP work, where verb tenses and prepositions need detailed exploration.  Microworlds on both have developed able to support optional features for all OPs that need them, and pave the way for treating time and space as resources inside PLANs.

 

3)    Uncertainty: planning strategies using coercion, abstraction, aggregation, and “intel” GOALS can reduce this effect.  But for simple agent, any notion of uncertainty could be a flaw of overdesign, so we view it as an optional extension.  Typical PLANs should come from agent reasoning, not guessing.

 

4)    Belief networks: rules handling beliefs on (a)typical class members may be needed to support common NLP parsing heuristics.  More complex extensions based on Bayesian reasoning, or other statistics-linked logics can probably co-exist in some PLANs, but will demand special training and vocabulary.

 

5)    Utility Theory: complex preferences can be handled with models of money and economics; depletion of AGENT resources; and the costs of OPs based on simple formulae.  Such extensions seem unneeded in the PLANs for most simple AGENTS.  Complex AGENTS (e.g., teams) might use them, but details of such models seem likely to required AGENT-specific rules.  Open source coding allows that.

 

Extensions such as (1-5) will augment or replace the core planning mechanisms of The Agency, so that more powerful types of PLANs can be discussed and refined by an operator tapping the needed training and specialized technical vocabulary.

 

Even with all the above, however, any PLAN emerging from such a dialog would be static – essentially frozen at design time, long before the AGENT who carries it out starts work on the details.  Such plans may be helpful, but each can only be applied by passing the AGENT specific parameters, then watching it execute.

 

Interactions with the real world as AGENTS execute can make other types of PLAN more powerful.  The Agency seeks also to support these dynamic PLANs, which can exploit real-time inputs and optimize their AGENTS’ behaviors accordingly.

 

Extensions that address such dynamic affects are discussed separately, as they require a PLANNING process more interleaved with PLAN execution, able to monitor what happens and/or change PLANs based on unexpected events in each environment, to more flexibly optimize overall AGENT performance.


              Appendix 2: Dynamic Extensions for Active AGENT I/0

 

These extensions let a PLAN arise which incorporates future decisions made after the AGENT starts working on its tasks.  They may be based on perceptions of each STATE-SPACE after it is accessible; or on bulk data coming in from other AGENTs under fixed a communication PLAN; or on new linguistic rules and info from any operator of The Agency with the rights to add them via Transcriber inputs.

 

This dynamic mode is what most people envision for “intelligent agents”.  Key extensions to help handle it are below, simple and complex.  Either way, they effectively make The Agency a smart interpreter for very high-level scripts:

 

1)    Changeable AGENT Resources: with vocabulary for measures and fluents, each PLAN can include contingency sub-tasks chosen per real-time inputs. (Even Boolean inputs can help).  E.g., “If you still have adequate resources, move on to your next planned task; else first solve the resource problem.”

 

2)    Operator advice DURING execution (perhaps via PDA-like I/O channels):  Such extensions let qualitative human judgments on STATES replace measurements.  E.g., “The target structure appears empty; so now what do we do?”

 

3)    Deferred sub-PLANs: reliably built DURING execution of a MISSION, these let many tasks be planned only when needed (versus exhaustively, all in advance).  E.g. “Hey! I found one still operating.  What should I do with it?”

 

4)    Non-monotonic reasoning if assumptions later prove wrong.  (Must be used with care, as standard FOPC rules may do not relate well).  E.g, “Okay, I got to the radio as we planned, but it’s busted.  So now what?”

 

5)    Replanning based on new MISSIONS: they may adjust key elements, yet reuse most of a PLAN to save time and avoid starting from scratch.  (E.g. “We must abort.  The men need a quick exit or they’ll be trapped by that fire”.)

 

6)    Metadata communication with similar AGENTS, arranged by passing bulk, low-level XML files.  E.g. “Send unit #2 a dump of our current situation.”

 

7)    Linguistic communication with similar AGENTS.  E.g. “Ask unit #2 what help they can offer us on this unexpected problem.”

 

Important: such extensions are incremental, and must be.  To create them, one first needs the static, lower-level parts of a Transcriber, a Generator, and a Context to be extended, plus a compatible, pre-constructed PLAN-base defining the MISSION-specific vocabulary, facts, and rules required to quickly absorb real-time data into the current Context and react to it semi-intelligently.

 

Without such a baseline, scenarios like those above – and the practical data- fusion and embedded-training scenarios they can jointly support – become wishful thinking and pipe dreams.  If real MISSIONS are to include such scenarios, solid tools must arise first, able to help out on the bulk data entry of sub-PLANs. 

 

The Agency will provide that help, by refactoring the Lexikos Transcriber to add partial parsing and web I/O; then adding a Context based on FOPC, planning and INGLISH verb models.  We will then solicit extensions and a (voice) Generator to enhance the core, using help and advice from other AI experts and “open source” models of extension R&D, so that many smart people can semi-independently work in compatible ways.  Please join us in refining and implementing this PLAN.