Manage cookies that are used for advertising, such as ad personalization, remarketing, and ad effectiveness analysis.
Germany 100k.zip -
This dataset typically contains extracted from German Wikipedia . It is widely used by researchers for tasks such as:
While exact versions vary (such as the dataset hosted on Hugging Face ), these files generally include: Germany 100k.zip
: Identifying specific locations, organizations, or names within German-language text. Dataset Composition Germany 100k.zip
: These datasets often represent millions of individual word tokens, making them suitable for training small-to-medium scale language models. Germany 100k.zip