Bibliography on Texts/Textes (2017-06-06)
Jérôme Euzenat, L'annotation formelle de documents en (8) questions, in: Régine Teulier, Jean Charlet, Pierre Tchounikine (éds), Ingénierie des connaissances, L'Harmattan, Paris (FR), 2005, pp251-271
Annoter un ensemble de documents informels à l'aide de représentations formelles appelle plusieurs questions qui doivent trouver une réponse si l'on veut développer un système cohérent. Ces questions sont liées à la forme et à l'objet des représentations retenues, à la nécessité d'utiliser de la connaissance indépendante du contenu des documents (ontologies, connaissance de contexte) et au statut du système résultant (grande base de connaissance ou éléments de connaissance distribués). Ces questions sont décrites et illustrées par l'annotation de résumés d'articles en génétique moléculaire.
Web sémantique, recherche de documents par le contenu, annotation formelle, représentation du contenu, ontologie, connaissance de contexte
Jean-François Baget, Simple conceptual graphs revisited: hypergraphs and conjunctive types for efficient projection algorithms, in: Proc. 11th international conference on conceptual sructures (ICCS), Dresden (DE), (Aldo De Moor, Wilfried Lex, Bernhard Ganther (eds), Conceptual structures for knowledge creation and communication, Lecture notes in computer science 2746, 2003), pp229-242, 2003
Simple Conceptual Graphs (SGs) form the cornerstone for the "Conceptual Graphs" family of languages. In this model, the subsumptio operation is called projection; it is a labelled graphs homomorphism (a NP-hard problem). Designing efficient algorithms to compute projections between two SGs is thus of uttermost importance for the community building languages on top of this basic model. This paper presents some such algorithms, inspired by those developped for Constraint Satisfaction Problems. In order to benefit from the optimization work done in this community, we have chosen to present an alternate version of SGs, differences being the definition of these graphs as hypergraphs and the use of conjunctive types.
Jérôme Euzenat, Eight questions about semantic web annotations, IEEE Intelligent systems 17(2):55-62, 2002
Improving information retrieval is annotation¹s central goal. However, without sufficient planning, annotation - especially when running a robot and attaching automatically extracted content - risks producing incoherent information. The author recommends answering eight questions before you annotate. He provides a practical application of this approach, and discusses applying the questions to other systems.
Semantic web, Search by content, Content representation, Ontology, Background knowledge
Farid Cerbah, Jérôme Euzenat, Traceability between models and texts through terminology, Data and knowledge engineering 38(1):31-43, 2001
Modeling often concerns the translation of informal texts into representations. This translation process requires support for itself and for its traceability. We pretend that inserting a terminology between informal textual documents and their formalization can help to serve both of these goals. Modern terminology extraction tools support the formalization process by using terms as a first sketch of formalized concepts. Moreover, the terms can be employed for linking the concepts and the textual sources. They act as a powerful navigation structure. This is exemplified through the presentation of a fully implemented system.
Terminology extraction, Traceability, Model generation, Hypertext, Object-oriented modeling, Natural language
Jérôme Euzenat, L'annotation formelle de documents en huit (8) questions, in: Jean Charlet (éd), Actes 6e journées sur ingénierie des connaissances (IC), Grenoble (FR), pp95-110, 2001
Annoter un ensemble de documents informels à l'aide de représentations formelles appelle plusieurs questions qui doivent trouver une réponse si l'on veut développer un système cohérent. Ces questions sont liées à la forme et à l'objet des représentations retenues, à la nécessité d'utiliser de la connaissance indépendante du contenu des documents (ontologies, connaissance de contexte) et au statut du système résultant (grande base de connaissance ou éléments de connaissance distribués). Ces questions sont décrites et illustrées par la tentative d'annotation de résumés d'articles en génétique moléculaire.
Web sémantique, recherche par le contenu, annotation formelle, représentation du contenu, ontologie, connaissance de contexte
Farid Cerbah, Jérôme Euzenat, Using terminology extraction techniques for improving traceability from formal models to textual requirements, in: Proc. 5th international conference on applications of natural language to information systems (NLDB), Versailles (FR), (Mokrane Bouzeghoub, Zoubida Kedad, Élisabeth Métais (eds), Natural Language Processing and Information Systems, Lecture notes in computer science 1959, 2001), pp115-126, 2000
This article deals with traceability in sotfware engineering. More precisely, we concentrate on the role of terminological knowledge the mapping between (informal) textual requirements and (formal) object models. We show that terminological knowledge facilitates production of traceability links, provided that language processing technologies allow to elaborate semi-automatically the required terminological resources. The presented system is one step towards incremental formalization from textual knowledge.
XML, terminology, knowledge extraction
Farid Cerbah, Jérôme Euzenat, Integrating textual knowledge and formal knowledge for improving traceability, in: Proc. ECAI workshop on Knowledge Management and Organizational Memory, Berlin (DE), pp10-16, 2000
This article deals with traceability in knowledge repositories. More precisely, we concentrate on the role of terminological knowledge in the mapping between (informal) textual requirements and (formal) object models. We show that terminological knowledge facilitates the production of traceability links, provided that language processing technologies allow to elaborate semi-automatically the required terminological resources. The presented system is one step towards incremental formalization from textual knowledge. As such, it is a valuable tool for building knowledge repositories.
XML, terminology, knowledge extraction
Farid Cerbah, Jérôme Euzenat, Integrating textual knowledge and formal knowledge for improving traceability, in: Proc. 12th international conference on knowledge engineering and knowledge management (EKAW), Juan-les-Pins (FR), (Rose Dieng, Olivier Corby (eds), Knowledge engineering and knowledge management: methods, models and tools, Lecture notes in computer science 1937, 2000), pp296-303, 2000
Knowledge engineering often concerns the translation of informal knowledge into a formal representation. This translation process requires support for itself and for its We pretend that inserting a terminological structure between informal textual documents and their formalization serves both of these goals. Modern terminology extraction tools support the process where the terms are a first sketch of formalized concepts. Moreover, the terms can be used for linking the concepts and the pieces of texts. This is exemplified through the presentation of an implemented system.
XML, terminology, knowledge extraction
Jérôme Euzenat, Vers une plate-forme de diffusion de textes sur internet : étude préliminaire, Rapport de conseil, 63p., juin 2000