Witllm/dataset/MNBVC.py

6 lines
166 B
Python
Raw Normal View History

2024-02-04 23:48:24 +08:00
from datasets import load_dataset
dataset = load_dataset("liwu/MNBVC", "wikipedia", split="train", streaming=True)
print(next(iter(dataset))) # get the first line