Accéder directement au contenu Accéder directement à la navigation
Communication dans un congrès

Recognizing Textual Entailment by Generality using Informative Asymmetric Measures to Summarize Ephemeral Clusters

Abstract : In the context of Ephemeral Clustering of web Pages, it can be interesting to label each cluster with a small summary instead of just a label. Within this scope, we introduce the paradigm of Textual Entailment by Generality, which can be defined as the entailment from a specific web snippet towards a more general web snippet. The subjacent idea is to find the best web snippet, which summarizes and subsumes all the other web snippets within an ephemeral cluster. To reach this objective, we first propose a new informative asymmetric similarity measure called the Simplified Asymmetric InfoSimba (AISs), which can be combined with different asymmetric association measures. In particular, the AISs proposes an unsupervised language-independent solution to infer Textual Entailment by Generality and as such can help to encounter the web snippet with maximum semantic coverage. This new methodology is tested against the first Recognizing Textual Entailment data set (RTE-1)1 for an exhaustive number of asymmetric association measures with and without the identification of Multiword Units. The comparative experiments with existing state-of-the-art methodologies show promising results.
Type de document :
Communication dans un congrès
Liste complète des métadonnées

https://hal-mines-paristech.archives-ouvertes.fr/hal-00916353
Contributeur : Claire Medrala <>
Soumis le : mardi 10 décembre 2013 - 10:48:45
Dernière modification le : jeudi 24 septembre 2020 - 16:36:01

Lien texte intégral

Identifiants

Citation

Gaël Dias, Sebastiao Pais, Katarzyna Wegrzyn-Wolska, Robert Mahl. Recognizing Textual Entailment by Generality using Informative Asymmetric Measures to Summarize Ephemeral Clusters. International Conferences on Web Intelligence and Intelligent Agent Technology, Aug 2011, Lyon, France. pp. 284 - 287, ⟨10.1109/WI-IAT.2011.122⟩. ⟨hal-00916353⟩

Partager

Métriques

Consultations de la notice

428