TCC Division    
 Cognitive and Communication
Via Sommarive, 18 I-38050
Povo-Trento, Italy
 Technologies
  You Are Here: HomePublications1999

Publications
 
1999 (next)(back)

The following is a list of articles, conference papers and reports authored or co-authored by researchers of the Cognitive and Communication Technologies division at ITC-irst (Istituto per la Ricerca Scientifica e Tecnologica). Entries refer to papers published internally or which have appeared in refereed media - in particular, journals, conference proceedings, and books - in 1999.
The publications in the list are organized alphabetically by the first author.

Articles, Conference Papers, Books

  • Bagnasco C., Cappelli A., Magnini B., Zamatteo D., 'Natural Language Access to Public Adiministration Data: the TAMIC-P System', in Lamma E., Mello P. (eds.) AI*IA 99: Advances in Artificial Intelligence , Lecture Notes on Artificial Intelligence 1792, Springer-Verlag, 2000, pp. 249-260.

    The main goal of the TAMIC-P project is to demonstrate the opportunities offered by the use of Natural Language Processing technologies in the human-machine interaction, in particular related to data access in complex environments. The Natural Language interface has been proposed as a modality of access complementary to other techniques, such as graphical interfaces. It shows its power when used in scenarios characterised by a relevant number of distributed information sources, in which current interfaces do not offer appropriate solutions to the complexity of data handling, and often generate difficulties in navigation. These problems turns out to become even more critical in the presence of not skilled users, having low technical knowledge. Using Natural Language, as normally used between persons for communicating, reduces the skill requirements and enhances the system usability. The evaluation tests performed on the TAMIC-P system confirm this point, showing that upon just a short training, a non-skilled operator is able to operate with effectiveness.

    Ref. No. 9912-86

  • Bagnasco C., Cappelli A., Magnini B., Zamatteo D., 'Accesso in linguaggio naturale ai dati della Pubblica Amministrazione: il sistema TAMIC-P', in Proceedings of the Sesto Congresso della Associazione Italiana per l'Intelligenza Artificiale, [AIIA-99], Pitagora, Bologna, 1999.

    L'obiettivo principale del progetto TAMIC-P è dimostrare le opportunità offerte dall'utilizzo di tecniche di Elaborazione del Linguaggio Naturale nell'interazione uomo-macchina, in particolare nell'accedere a dati in ambienti complessi. L'interfaccia in Linguaggio Naturale è proposta come modalità di accesso complementare ad altre tecniche, quali le interfacce basate su grafica, e dimostra tutta la sua potenzialità in scenari caratterizzati dalla presenza di un numero rilevante di informazioni distribuite. In questi scenari le attuali interfacce non forniscono soluzioni soddisfacenti ai problemi di complessità e di disorientamento nella navigazione. Problemi che risultano ancor più critici in presenza di utenti poco esperti, in possesso di minori conoscenze tecniche. L'impiego di una modalità come il linguaggio naturale, usata normalmente nella comunicazione tra persone, e pertanto più naturale, riduce i requisiti di competenza e rende il sistema facilmente utilizzabile. I test di valutazione realizzati sul sistema TAMIC-P confortano questa ipotesi, confermando che già dopo un breve training anche un operatore poco esperto è in grado di operare in modo efficace e completo.

    Ref. No. 9912-90

  • Bentivogli L., 'Relazioni lessicali e semantiche nella costruzione di un lessico computazionale multilingue: Problematiche Tecniche e Filosofiche', Tesi di Laurea in Filosofia del Linguaggio, Università degli Studi di Bologna, Facoltà di Lettere e Filosofia, Anno Accademico 1997-1998. Bologna, Italy, March, 1999.

    Ref. No.: 9912-112

  • Bentivogli L., Pianta E., 'Looking for Lexical Gaps', December 1999, in Proceedings of the Ninth EURALEX International Congress, Stuttgart, Germany, August 8 - 12, 2000.

In this paper we present the results of a quantitative evaluation of the discrepancies between the Italian and English lexica in terms of lexical gaps. This evaluation has been carreid out in the context of MultiWordNet, an ongoing project that aims at building a multilingual lexical database. Unlike EuroWordNet, MultiWordNet preserves most of the semantic relations available in the Princeton WordNet. The quantitative evaluation of the English-to-Italian lexical gaps shows that the English and Italian lexica are highly comparable and give empirical support to the MultiWordNet model.

Ref. No. 9912-110

  • Bentivogli L., Pianta E., Pianesi F., 'Coping with Lexical Gaps when Building Aligned Multilingual Wordnets', December 1999, 5 pp., in Proceedings of the Second International Conference on Language Resources and Evaluation [LREC 2000], Athens, Greece, May 31 - June 2, 2000.

    In this paper we present a methodology that automatically classifies the translation equivalents of a machine readable bilingual dictionary in three sets: lexical units, lexical gaps and translation equivalents that need to be manually classified as lexical units or lexical gap. This preventive classification minimizes the manual work necessary to cope with lexical gaps in the construction of aligned multilingual wordnets.

    Ref. No. 9912-91

  • Bianchi A., Zancanaro M., 'Tracking users'movements in an artistic physical space', in Proceedings of the i³ Annual Conference, European Network for Intelligent Information Interfaces, Siena, Italy, October 20-22, 1999, pp. 103-106.

    HIPS studies new technologies and interaction modalities that allow people to navigate both a physical space and a related information space at the same time. In this paper we briefly describe the steps from detecting a set of infrared sensors to the building of an internal representation of the visitor behaviour and discuss how this knowledge is employed to build a personalised presentation. In HIPS the Physical Organisation Knowledge Base generalises the notion of space relying on a notion of naive geometry rather the usual Cartesian geometry. A Visiting Style Module estimates the degree of compatibility between the user's movement patterns and four categories. Hence the Presentation Composer builds a new presentation selecting from an existing repository of audio files using knowledge about the visitor's area/orientation in the Physical Organisation Knowledge Base, while information about the Visiting Style is exploited to collect specific information to be delivered to the user.

    Ref. No. 9912-92

  • Cavaglià G., 'Estrazione di Informazione da testi e WordNet: aspetti pratici e applicativi in un dominio specifico', Thesis, December 1999, Tesi di Laurea in Filosofia, Università degli Studi di Torino, Facoltà di Lettere e Filosofia, Anno Accademico 1997-1998

    Ref. No. 9912-113

  • Cavaglià G., 'The Development of Lexical Resources for Information Extraction from Text Combining WordNet and Dewey Decimal Classification', in Proceedings of the Ninth Conference of the European Chapter of the Association for Computational Linguistics [EACL'99], Bergen, Norway, June 8-12, 1999.

    Lexicon definition is one of the main bottlenecks in the development of new applications in the field of Information Extraction from text. Generic resources (e.g. lexical database) are promising for reducing the cost of specific lexica definition, but they introduce lexical ambiguity. This paper proposes a methodology for building application-specific lexica by using WordNet. Lexical ambiguity is kept under control by marking synsets in WordNet with field labels taken from the Dewey Decimal Classification.

    Ref. No. 9905-04

  • Ciravegna F., Lavelli A., 'Full Text Parsing using Cascades of Rules: an Information Extraction Perspective', in Proceedings of 9th Conference of the European Chapter of the Association for Computational Linguistics [EACL'99], Bergen, Norway, June 8-12, 1999.

    This paper proposes an approach to full parsing suitable for Information Extraction from texts. Sequences of cascades of rules deterministically analyze the text, building unambiguous structures. Initially basic chunks are analyzed; then argumental relations are recognized; finally modifier attachment is performed and the global parse tree is built. The approach was proven to work for three languages and different domains. It was implemented in the IE module of FACILE, a EU project for multilingual text classification and IE.

    Ref. No.9905-06

  • Ciravegna F., Lavelli A., 'Grammar Organization for Cascade-Based Parsing in Information Extraction', December 1999, in Proceedings of the Sixth International Workshop on Parsing Technologies [IWPT 2000], Trento, Italy, February 23-25, 2000.

    Ref. No.9912-93

  • Ciravegna F., Lavelli A., Mana N., Matiasek J., Gilardoni L., Mazza S., Ferraro M., Balck W.J., Rinaldi F., Mowatt D., 'FACILE: Classifying Text Integrating Pattern Matching and Information Extraction', May 1999, 6 pp., in Proceedings of 16th International Joint Conference on Artificial Intelligence [IJCAI'99], Stockholm, Sweden, July 31-August 6, 1999.

    Successfully managing information means being able to find relevant new information and to correctly integrate it with pre-existing knowledge. Much information in nowadays stored as multilingual textual data; therefore advanced classification systems are currently considered as strategic components for effective knowledge management. We describe an experience integrating different innovative AI technologies such as hierarchical pattern matching and information extraction to provide flexible multilingual classification adaptable to user needs. Pattern matching produces fairly accurate and fast categorisation over a large number of classes, while information extraction provides fine-grained classification for a reduced number of classes. The resulting system was adopted by the main Italian financial news agency providing a pay-to-view service.

    Ref. No.9905-05

  • Ciravegna F., Lavelli A., Satta G., 'Bringing Information Extraction out of the Labs: the Pinocchio Environment', November 1999, 5pp., in Horn W. (ed.) Proceedings of the 14th European Conference on Artificial Intelligence [ECAI-2000], Berlin, Germany, August 20-25, 2000. IOS Press, Amsterdam, 2000.

    Pinocchio is an environment for developing information extraction applications. New applications and languages can be covered by just writing declarative resources. Information is represented uniformly through the architecture: all the modules use the same input structure and the same type of declarative resources. Modules are implemented via the same basic processors and share a common environment for resource development and debugging. The result is an environment easy to use with limited training and skills.

    Ref. No.9911-09

  • Corazza A., Cettolo M., Lazzari G., Pianta E., Pianesi F., L.M. Tovena, 'The ITC-irst Speech Translation System', in Proceedings of the Annual Workshop of the Associazione Italiana per l'Intelligenza Artificiale (Gruppo di lavoro sul linguaggio naturale), on Elaborazione del linguaggio e riconoscimento del parlato, Trento, Italy, December 16-17, 1999, pp. 30-32.

    The C-STAR consortium recently demonstrated the results achieved in speech-to-speech machine translation by means of a public demo. ITC-irst participated hosting its own demo, playing the role of a tourist who aims at organizing a travel to New York and Hiedelberg. For this purpose, he contacts, via a video-conference, two tourism agencies, the first in the United States (played by the CMU lab) and the other in Germany (Karlsruhe University). In other occasions, ITC-irst also played the role of the agent. Moreover, another demo was played with ETRI, in Korea. The system performs speech-to-speech translation via a kind of (simple) inter-lingua, the Interchange Format (IF): this allowed us to use the same systems while considering different partner laguages (such as English, German, Korean). In fact, each partner in the Consortium provides the translation chains from its own language to the IF and from the IF to its language. In the following, the two chains for Italian are described.

    Ref. No.9912-04

  • De Angeli A., Gerbino W., Nodari E., Petrelli D., 'From Tools to Friends: Where Is the Borderline?', September 1999, to be published in Proceedings of the Workshop on 'Attitude, Personality and Emotions in User-Adapted Interaction', held in conjunction with the Seventh International Conference on User Modeling [UM'99], Banff, Canada, June 20-24, 1999.

    Ref. No. 9909-09

  • Giorgi A., Pianesi F., 'The Double Access Reading and Complementiser Deletion in Italian', in Proceedings of the XVII West Coast Conference on Formal Linguistics [WCCFL XVII], Vancouver, Canada, February 20-22, 1998. CSLI Publications, 1999.

    Ref. No. 9912-94

  • Giorgi A., Pianesi F. 'Ways of Terminating', September 1999, 51 pp., in Cecchetto C., Guasti M.T., Chierchia G. (eds.), At the Interface with Semantics, CSLI Publications, Stanford University, 2000.

    Ref. No. 9909-10

  • Giorgi A., Pianesi F., 'Imperfect Dreams: the Temporal Dependencies of Fictional Predicates', to be published Probus, International Journal of Latin and Romance Linguistics, Muton De Gruyter, Amsterdam, The Netherlands.

    Ref. No. 9912-116

  • Not E., Tovena L., Zancanaro M., 'Bridging Anaphora in Deverbal NPs', in Proceedings of the Sixth Congress of the Italian Association for Artificial Intelligence [AI*IA99], Bologna, Italy, September 15-17, 1999.

    Ref. No. 9912-100

  • Not E., Tovena L., Zancanaro M., 'Positing and Resolving Bridging Anaphora in Deverbal NPs', in Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, [ACL'99],Workshop on The Relationship Between Discourse/Dialogue Structure and Reference , University of Maryland, USA, June 20-26, 1999.

    Ref. No. 9912-99

  • Not E., Zancanaro M., 'Reusing Information Repositories for Flexibly Generating Adaptive Presentations', in Proceedings of IEEE International Conference of Information, Intelligence and Systems, [ICIIS'99], Washington, USA, November 1-3, 1999.

    Ref. No. 9912-101

  • Petrelli D., 'Museums as Theatre: Suggestions for an Emotional Visit', in Proceedings of the International Workshop on Affect in Interactions held in conjunction with the Annual Conference of the EC I3, [AC'99], Siena, Italy, October 21-22, 1999.

    HIPS1 is a hand-held electronic guide that adapts its behaviour to museum visitors. It uses physical movements as a main interaction modality and responds to this implicit interaction with suitable commentaries created on the fly. To achieve more involving presentations two further steps are needed: inference of visitor's emotional status and composition of emotional presentation. Investigations on how visitor behaviour is influenced by external (environment) or internal (attitude) factors and on the effect of emotional cues are currently being carried on at the University of Siena. Nevertheless, when the picture will be clear, a computational model to support emotional presentations will be needed. This paper discusses a first proposal on this more technological aspect.

    Ref. No. 9912-102

  • Petrelli D., Not E., Zancanaro M., 'Getting Engaged and Getting Tired: What Is in a Museum Experience', in Proceedings of the Workshop on 'Attitude, Personality and Emotions in User-Adapted Interaction', held in conjunction with the Seventh International Conference on User Modeling [UM'99], Banff, Canada, June 20-24, 1999.

    The physical experience of visiting a museum includes emotion and intellect. The way a person feels during an experience becomes an integral part of that memory. Evoking the feeling or mood at a later time may trigger details of a memory associated with it. Thus in a museum environment it is essential that personal discovering is appropriately supported providing visitors with the suitable amount of information they need, at the right time and place, and in the form that makes it the most acceptable and enjoyable. The quality of the received presentation is liable to subjective judgement. Personal taste typically overcomes objective effectiveness and can prejudice the pleasure of the visit. When designing new personal electronic guide to support museum visit, designers have to bear in mind all the known effects and be aware of others unexpected side effects. Moreover the system should monitor visitor's reactions in order to infer the effectiveness of the choices done and, if necessary, to rearrange the presentation style. This paper presents the work that is going to be done in the HIPS project to cope with the feedback-rearrangement process when emotion and involvement are directed to museum visitors.

    Ref. No. 9909-08

  • Roventini A., Alonge A., Bertagna F., Magnini B., Calzolari N., 'ItalWordNet: a large semantic database for Italian', December 1999, in Gravilidou M., Crayannis G., markantonatu S., Piperidis S., Stainhaouer G. (eds.), Proceedings of the Second International Conference on Language Resources and Evaluation [LREC-2000], Athens, Greece, May 31 - June 2, 2000, pp. 783-790.

    The focus of this paper is on the work we are carrying out to develop a large semantic database within an Italian national project, SI-TAL, aiming at realizing a set of integrated (compatible) resources and tools for the automatic processing of the Italian language. Within SI-TAL, ItalWordNet is the reference lexical resource which will contain information related to about 130,000 word senses grouped into synsets. This lexical database is not being created ex novo, but extending and revising the Italian lexical wordnet built in the framework of the EuroWordNet project. In this paper we firstly describe how the lexical coverage of our wordnet is being extended by adding adjectives, adverbs and proper nouns, plus a terminological subset belonging to the economic and financial domain. The relevant changes involved by these extensions both in the linguistic model and in the data structure are then illustrated. In particular we discuss: i) the new semantic relations identified to encode information on adjectives and adverbs; ii) the new architecture including the terminological subset .

    Ref. No. 9912-117

  • Huber M., Magnini B., 'Processing of Object Oriented Dialogues', in Proceedings of EUROLAN'99, Workshop on Procedures in Discourse Interpretation, Iasi, Romania, July 23-27, 1999.

In this paper we introduce object oriented dialogues (OODs), a subclass of dialogues where the communicative goal is just the retrieval of information about objects in a particular domain. OODs allow a number of computational simplifications which make them attractive for many information access applications. Starting point for this approach was the TAMIC-P prototype system, which will be briefly presented. Finally, OODs are well suited to be represented as graphic object (e.g. folders and documents), easily guiding the user through the discourse structure and at the same time through parts of the underlying knowledge base: in this way typical problems of discourse ambiguity and domain opacity can be overcome.

Ref. No. 9912-95

  • Huber M., Magnini B., 'Toward a Robust Processing of Information-Seeking Dialogues', in Proceedings of AI*IA'99, Workshop on 'Elaborazione del linguaggio e riconoscimento del parlato', Trento, Italy, December 13-14, 1999.

    The main assumption of this paper is that the interpretation of information-seeking dialogues is mainly a domain driven process, where much of the knowledge necessary for their processing can be derived from the Domain Model represantation. Following this line, some discourse structure and the recovering from error situations, are all addressed building a semantic dialogue tree which mirrors the Domain Model structure. The algorithm we propose relies on three steps: (i) first a logical form is built for a single turn, whose clauses are grouped according to their functional role in the user request; (ii) then a dialogue chain for a single turn exploiting the semantic relations among the nodes is derived from the logical form; (iii) finally, the overall dialogue tree is searched for the positions where the chain can be best appended or merged.

    Ref. No. 9912-96

  • Magnini B., Artale A., Huber M., Strapparava C., Zancanaro M., 'Efficient Natural Language Access to Databases: The TAMIC-P System', Presented at the 37th Annual Meeting of the Association for Computational Linguistics, [ACL'99], University of Maryland, USA, June 20-26, 1999.

    TAMIC-P proposes an interaction style in which direct manipulation and natural language are strictly integrated, each of them providing a communication style complementary to the other. The system has been designed to be used by a Public Administration operator during an information session with a citizen. On field evaluations demonstrated the high usability of the system even with a short training phase.

    Ref. No. 9912-97

  • Magnini B., Cavaglià G., 'Integrating Subject Field Codes into WordNet', in Proceedings of the Second International Conference on Language Resources and Evaluation, [LREC-2000], Athens, Greece, May 31 - June 2, 2000, pp. 1413-1418.

    In this paper we present a lexical resource which provides an annotation of WordNet synsets with Subject Field Codes. We discuss both the methodological issues we dealt with and the annotation techniques we implemented. A quantitative analysis of the resource coverage is reported, as well as a qualitative evaluation of the proposed annotations.

    Ref. No. 9912-98

  • Pianesi F., Pianta E., Tovena L.M., 'Comparing Methodologies for Evaluating the Generator in a Speech-to-Speech Translation System' , in Proceedings of the Seventh European Workshop on Natural Language Generation [EWNLG'99], Tolouse, France, May 13-15, 1999, pp. 145-154.

    This paper compares two formal methods for evaluating the output of generation systems. In one case humans are used for hand coding and assessment, while in the other case they also perform the task of producing the reference text. It is argued that the methodology that tries to limit the human intervention gives less accurate results.

    Ref. No. 9912-103

  • Pianesi F., Varzi A., 'Events and Event Talks: an Introduction', in Higginbotham J, Pianesi F., Varzi A. (eds.) Speaking of Events, Oxford University Press, New York, 2000.

    Ref. No. 9912-87

  • Pianta E., Tovena L.M., 'XIG: Generating from Interchange Format using Mixed Representations', in Proceedings Sixth Congress of the Italian Association for Artificial Intelligence, (AI*IA'99) , Bologna, Italy, September 14-17, 1999.

    We present the C-STAR Italian Generator (XIG), a system for generating Italian text from the interlingua content representation (Interchange Format) adopted within the C-STAR II speech to speech translation project. The constraints of the application scenario led us to develop a generator based on Mixed Representations, providing a good trade-off between flexibility and efficiency.

    Ref. No. 9912-104

  • Pianta E., Tovena L.M., 'Mixing Representation levels: the Hybrid Approach to Automatic Text Generation', in Proceedings of the AISB'99 Workshop on Reference Architectures and Data Standards for NLP, Edinburgh, Scotland, UK, April 6-9, 1999, pp. 8-13.

    Natural language generation systems (NLG) map non-linguistic representations into strings of words through a number of steps using intermediate representations of various levels of abstraction. Template based systems, by contrast, tend to use only one representation level, i.e. fixed strings, which are combined, possibly in a sophisticated way, to generate the final text. In some circumstances, it may be profitable to combine NLG and template based techniques. The issue of combining generation techniques can be seen in more abstract terms as the issue of mixing levels of representation of different degrees of linguistic abstraction. This paper aims at defining a reference architecture for systems using mixed representations. We argue that mixed representations can be used without abandoning a linguistically grounded approach to language generation.

    Ref. No. 9912-105

  • Sarini A., Strapparava C., 'User Modelling in a Museum Exploration Adaptive System', in Poster Proceedings of the Seventh International Conference on User Modeling '99 [UM'99], Banff, Canada, June 20-24, 1999.

    Ref. No. 9912-106

  • Stefani A., Strapparava C., 'One to One Marketing on the Web: How to Exploit User Modelling in the SiteIF Project', in Posters Proceedings of the 8th International World Wide Web Conference [WWW8], Toronto, Canada, May 11-14, 1999.

    Ref. No. 9912-108

  • Stefani A., Strapparava C., 'Exploiting NLP Techniques to Build User Model for WEB Sites: the Use of WordNet in SiteIF Project', in Proceedings of the 8th International World Wide Web Conference [WWW8], the Second Workshop on Adaptive Systems and User Modeling on the World Wide Web, Toronto, Canada, May 11-14, 1999.

    Ref. No. 9912-107

  • Tovena L.M., Pianta E., 'Generating felicitous sentences from underspecified semantic representations', in Proceedings 3rd International Workshop on Computational Semantics, Tilburg, The Netherlands, January 1999, pp. 410-412.

    Interchange Format is a semantic formalism used as interlingua in a multilingual speech-to-speech translation project. Underspecification, in the sense of nonspecification of aspects of the sentence semantics, plays a crucial role in easying the analysis of spontaneous speech. However, it causes difficulties when generating. We put forth solutions which rely mainly on domain knowledge.

    Ref. No. 9912-109

Technical Reports

  • Ciravegna F., Lavelli A., Satta G., 'Full Parsing Approximation, Finite-State Cascades and Grammar Organization for Information Extraction', Technical Report, November 1999, 7 pp.

    This paper proposes an approach to full parsing approximation suitable for Information Extraction from texts. Sequences of cascades of finite-state rules deterministically analyze the text, building unambiguous structures. Initially basic chunks are analyzed; then clauses are recognized and nested; finally modifier attachment is performed and the global parse tree is built. The approach has been extensively proven to work mainly for Italian, but it was also tested for English and Russian. A parser based on such approach has been implemented as part of Pinocchio, an environment for developing and running IE applications.

    Ref. No. 9911-02