Project financed by: |
|
Development of Generic Resources for Language Technology
|
|
|
Project prototypes
Language Technology research activities at NLPLAB, the Natural
Language Processing Laboratory at IDA, Linköping university, are
centered around two major areas of language technology research:
natural language interaction and information extraction from natural
language texts. Other activities include system design and methods for
development of language technology software.
Software developed within both these areas are to be included in the
first code resource library. At present we have the following running
research prototypes that we wish to make more flexible and robust and place
in the library:
- JavaChart: A chart parser for interpretation of natural
language. Currently we are rewriting and extending a lisp-based
chart parser prototype called Flexchart to Java. The new system
JavaChart will be more compliant with other new network-based
systems than its predecessor and have a more flexible interface,
easily integrated with a user interface as well as with a dialogue
engine.
- DG: A compiler written in Java for so-called dialogue grammars
that handles high-level control of the dialogue interface of a dialogue
system. Future work include, for instance, extending the user
support and making DG more clearly separated from its main use today as
a help-tool for the dialogue manager module in the MALIN framework.
- DialogueHead: A Java-module for high-level control concerning
dialogue aspects of a visually speaking head. Currently the module
supervises the dialogue behavior of an animated head (a system called
Orator) and synthesis-module from Telia Research (a system called Telia
Text-Till-Tal, or T4).
- MALIN: an object-oriented framework in Java that supports
construction of complete speech-based dialogue
systems. The MALIN framework has evolved
over a number of research projects and today consists of the following
components:
- Interpretation Manager: a Java-module that supervises the
interpretation and transformation of incoming speech into a
syntactic interpretation-structure. The software needs more work in
various ways for the module to become a robust
application-independent system.
- Dialogue Manager (DM): The kernel module of the MALIN
framework kernel is the Dialogue Manager which
interprets utterances in context and directs the
dialogue. The Dialogue Manager is now in a third version,
written in Java. Major refinements involves separating
generic parts from more domain specific and
also techniques to make it more robust, to handle for instance
unknown situations better and more uniformly.
- Domain Knowledge Manager: This is a new module
developed for dialogue interfaces, aimed at handling various
types of domain knowledge utilised in a dialogue
interface. The Domain Knowledge
Manager is based on the Open Agent Architecture and is,
thus, suitable for network based software development.
- Linköping Word Aligner(LWA): A system, that can be used for processing
bilingual texts, the one a translation of the other. It links
(i.e. align) words, phrases and tokens, that are the translations of
each other in the two texts, aligning more than 50% of the
texts.
- Frasse: A module for identifying phrases and multi-word terms
from a raw text. Currently, it can handle Swedish and English texts.
The module can for instance be used for extracting multi-word units
for bi-lingual word alignment and terminology
management.
Most recently the MALIN framework was utilised in Miina, a joint
project between Nokia Home Communications and the Natural Language
Processing Laboratory (NLPLAB) at HCS, Linköping University. In this
project a natural language dialogue system was developed with
information on TV programs to be used in the Nokia Media
Terminal.
We are also currently involved in a dialogue systems project (SCIN)
together with Telia Research AB where we are utilising a synthetic
talking head as output modality. In this project we have further
developed the software that controls the talking head (Orator) and the
speech synthesis module (T4) by introduction of a new level of control
in the new module DialogueHead. The new module also makes the system
network-based, i.e. other modules in the Dialogue System utilise the
synthesis resources remotely as services running via the Internet.
Research prototypes for text extraction are currently used in a joint
project with IDA Infront and Riksskatteverket, as another part of the
Miina project, where they are to be used, together with the MALIN
framework, to develop structured databases from unstructured texts and
then used by the dialogue interface through the Domain Knowledge
Manager.
|
Page responsible: Webmaster
Last updated: 2012-05-07