GitXplorerGitXplorer
j

MLSUM-Catalan

public
2 stars
0 forks
0 issues

Commits

List of commits on branch main.
Unverified
6cda166de1d6aa159dbb29cbad402471acaae8bb

Remove metadata and until we can futher ivnest on this

jjordimas committed 3 years ago
Unverified
4dc89b3948e9261e100811f6e81eea7b3847faa4

Fix typo

jjordimas committed 3 years ago
Unverified
1ae0114c07112307ce115cb27eea46e89396a899

Add creator

jjordimas committed 3 years ago
Unverified
fd5c98591f919edf587813c383e1e605b97e139e

Add Creator

jjordimas committed 3 years ago
Unverified
06ded93450cca6a77038d1bd75efb555a9b18fa7

Add meta dataset information

jjordimas committed 3 years ago
Unverified
01f3479e6174af2bf760f45aff925ea5cdecde2d

MOre entries

jjordimas committed 3 years ago

README

The README file for this repository.

MLSUM-Catalan

A Catalan corpus based on https://github.com/recitalAI/MLSUM concepts.

Original context is from Vilaweb licensed under Attribution-NonCommercial-NoDerivs which allows sharing.

Files:

  • URLs used at urls/train.ca.txt.urls
  • Text and summaries: processed/ca_train.txt (2678 entries)

The text and summaries are in the same format that MLSum corpus (tab separated).