Search
Results
OSCAR dataset
[https://oscar-project.org/] - - public:mzimmerm
The OSCAR project (Open Super-large Crawled Aggregated coRpus) is an Open Source project aiming to provide web-based multilingual resources and datasets for Machine Learning (ML) and Artificial Intelligence (AI) applications.
Newspeak-test-dataset
[https://www.kaggle.com/datasets/mzimmerm/newspeak-test-dataset?select=NewspeakGrammar.ns] - - public:mzimmerm
Dataset is just a zip of files
Introduction to Constructing Your Dataset | Machine Learning | Google for Developers
[https://developers.google.com/machine-learning/data-prep/construct/construct-intro] - - public:mzimmerm