unitxt.dataset module

class unitxt.dataset.Dataset(cache_dir: str | None = None, dataset_name: str | None = None, config_name: str | None = None, hash: str | None = None, base_path: str | None = None, info: DatasetInfo | None = None, features: Features | None = None, token: bool | str | None = None, use_auth_token='deprecated', repo_id: str | None = None, data_files: str | list | dict | DataFilesDict | None = None, data_dir: str | None = None, storage_options: dict | None = None, writer_batch_size: int | None = None, name='deprecated', **config_kwargs)

Bases: GeneratorBasedBuilder

TODO: Short description of my dataset.

VERSION = 1.4.6
property generators
unitxt.dataset.fetch(artifact_name)
unitxt.dataset.get_dataset_artifact(dataset_str)
unitxt.dataset.parse(query: str)

Parses a query of the form ‘key1=value1,key2=value2,…’ into a dictionary.