eComTag Dataset

Description

eComTag is collected from Chinese e-commerce websites, containing reviews and opinion tags for 50,068 items. This dataset is to facilitate the abstractive opinion tagging task, which aims to generate opinion tags based on large volumes of item reviews.

Disclaimer

The eComTag dataset is available strictly for non-commercial research purposes only.

  1. All reviews and tags are obtained from the Internet, which is not the property of the authors, or any associated employers, entities or institutions. We do not bear responsibility for either the content or meaning of these reviews and answers.

  2. By requesting and/or using this dataset, you agree not to reproduce, duplicate, copy, sell, trade, resell, rent or exploit for any commercial purpose, any portion of the contexts and any portion of derived data.

  3. We reserve the right to terminate your access to the eComTag dataset at any time.

Download

Please use this Google form to submit your information and request access to eComTag.

Data Format

Readers can directly refer to the README.md in the downloaded file for more extensive instructions. Also, we provide a brief overview in the github repository.

The whole dataset, including train, validation, and test, is saved in a pickle file (ecomtag_dataset_preproc.p) Besides, we provide items by domain for future work.

Paper

Abstractive Opinion Tagging

Qintong Li, Piji Li, Xinyi Li, Zhaochun Ren, Zhumin Chen, and Maarten de Rijke.

@inproceedings{li2020aot,
  title={Abstractive Opinion Tagging},
  author={Qintong, Li and Piji, Li and Xinyi Li and Zhaochun, Ren and Zhumin, Chen and Maarten, de Rijke},
  booktitle={WSDM},
  year={2021}
}

Contact

Please reach out to qtleo@outlook.com for any questions about the dataset.

Qintong Li
Qintong Li
PhD student

My research interests are building machine learning models for open-ended text generation and commonsense reasoning.