This repository contains data presented in our LREC-COLING 2024 paper CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched Summarization.
Please find the dataset on HF hub.
Thank you for checking out our dataset. If you have used our data in your work, please consider using the reference below.
@inproceedings{zhang-eickhoff-2024-crocosum,
title = "{C}ro{C}o{S}um: A Benchmark Dataset for Cross-Lingual Code-Switched Summarization",
author = "Zhang, Ruochen and Eickhoff, Carsten",
booktitle = "Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)",
month = may,
year = "2024",
address = "Torino, Italia",
publisher = "ELRA and ICCL",
url = "https://aclanthology.org/2024.lrec-main.367/",
pages = "4113--4126",
}