Data

Computational argumentation

The Dagstuhl-15512 ArgQuality Corpus, version 1
A corpus for studying the assessment of argumentation quality. The corpus contains 320 debate portal arguments, annotated for 15 different quality dimensions by three annotators. We also provide the annotation guidelines:

In case you publish any results related to the corpus, please cite our EACL 2017b publication.

 

The Webis-ArgRank-17 dataset, version 1
A benchmark dataset for studying argument relevance. The dataset contains 32 argument rankings as well a ground-truth argument graph with more than 30,000 argument units. In addition, we provide the source code to reproduce our ranking experiments based on the dataset.

In case you publish any results related to the corpus, please cite our EACL 2017a publication.

 

The Webis-Editorials-16 corpus, version 1
A corpus with 300 news editorials from three online news portals, annotated for the types of all argumentative discourse units. 

In case you publish any results related to the corpus, please cite our COLING 2016a publication.

 

The ArguAna TripAdvisor corpus, version 1 (revised)
Revised version of the corpus below. In particular, the annotated XMI files have been changed according to a new underlying type system that is more easily extendable. Notice that some adaptations of the software might be necessary to make it work with this revised version.

  

The ArguAna TripAdvisor corpus, version 1
A text corpus, which compiles 2,100 manually annotated English hotel reviews balanced with respect to the reviews’ sentiment scores. All review texts have been segmented into subsentence-level statements with a rule-based statement segmentation algorithm. Each statement has then been manually classified as a fact, a positive, or a negative opinion. Also, all hotel aspects and amenities have been manually marked as product features.

The corpus is free-to-use, but only for scientific purposes, not for commercial applications! In case you refer to the corpus in any way in your publications, please cite our CICLING 2014 paper, which presents the design and a first study of the corpus.

Impressum | Webmaster | Letzte Änderungen am : 17.07.2017