unitxt.api module

unitxt.api.evaluate(predictions, data) List[Dict[str, Any]]
unitxt.api.infer(instance_or_instances, recipe, engine)
unitxt.api.load(source: SourceOperator | str) DatasetDict
unitxt.api.load_dataset(dataset_query: str | None = None, **kwargs) DatasetDict

Loads dataset.

If the ‘dataset_query’ argument is provided, then dataset is loaded from a card in local catalog based on parameters specified in the query. Alternatively, dataset is loaded from a provided card based on explicitly given parameters.

Parameters:
  • dataset_query (str, optional) – A string query which specifies dataset to load from local catalog. For example: “card=cards.wnli,template=templates.classification.multi_class.relation.default”.

  • **kwargs – Arguments used to load dataset from provided card, which is not present in local catalog.

Returns:

DatasetDict

Examples

dataset = load_dataset(

dataset_query=”card=cards.stsb,template=templates.regression.two_texts.simple,max_train_instances=5”

) # card must be present in local catalog

card = TaskCard(…) template = Template(…) loader_limit = 10 dataset = load_dataset(card=card, template=template, loader_limit=loader_limit)

unitxt.api.post_process(predictions, data) List[Dict[str, Any]]
unitxt.api.produce(instance_or_instances, recipe_query)