Data Contamination Report from the 2024 CONDA Shared Task Paper • 2407.21530 • Published Jul 31, 2024 • 10
Medical mT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain Paper • 2404.07613 • Published Apr 11, 2024
NoticIA: A Clickbait Article Summarization Dataset in Spanish Paper • 2404.07611 • Published Apr 11, 2024
NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark Paper • 2310.18018 • Published Oct 27, 2023 • 1
This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models Paper • 2310.15941 • Published Oct 24, 2023 • 6
A Common Semantic Space for Monolingual and Cross-Lingual Meta-Embeddings Paper • 2001.06381 • Published Jan 17, 2020
IXA/Cogcomp at SemEval-2023 Task 2: Context-enriched Multilingual Named Entity Recognition using Knowledge Bases Paper • 2304.10637 • Published Apr 20, 2023
HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine Paper • 2306.06029 • Published Jun 9, 2023
T-Projection: High Quality Annotation Projection for Sequence Labeling Tasks Paper • 2212.10548 • Published Dec 20, 2022 • 1
Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings Paper • 2210.12623 • Published Oct 23, 2022
GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction Paper • 2310.03668 • Published Oct 5, 2023 • 1