Contents:

  • Introduction
  • Loading Datasets
  • Installation
  • Adding Datasets
  • Adding Stream Operators and Metrics
  • Components
  • Backend
  • Operators
  • Contributors Guide
  • unitxt
  • Catalog
    • Augmentors
    • Benchmarks
    • Cards
      • CFPB
      • Ai2_arc
      • AlmostEvilML_qa_by_lang
      • Amazon_mass
      • Belebele
      • Clinc_oos
      • Head_qa
      • Mlsum
      • Mmlu
      • Multidoc2dial
      • Reuters21578
      • Winogrande
      • Wmt
      • Xlsum
        • amharic
        • arabic
        • azerbaijani
        • bengali
        • burmese
        • chinese_simplified
        • chinese_traditional
        • english
        • french
        • gujarati
        • hausa
        • hindi
        • igbo
        • indonesian
        • japanese
        • kirundi
        • korean
        • kyrgyz
        • marathi
        • nepali
        • oromo
        • pashto
        • persian
        • pidgin
        • portuguese
        • punjabi
        • russian
        • scottish_gaelic
        • serbian_cyrillic
        • serbian_latin
        • sinhala
        • somali
        • spanish
        • swahili
        • tamil
        • telugu
        • thai
        • tigrinya
        • turkish
        • ukrainian
        • urdu
        • uzbek
        • vietnamese
        • welsh
        • yoruba
      • Xnli
      • Xwinogrande
      • 20_newsgroups
      • ag_news
      • almostEvilML_qa
      • argument_topic
      • atta_q
      • banking77
      • bold
      • boolq
      • claim_stance_topic
      • cnn_dailymail
      • cola
      • copa
      • dbpedia_14
      • ethos_binary
      • financial_tweets
      • hellaswag
      • law_stack_exchange
      • ledgar
      • mbpp
      • medical_abstracts
      • mnli
      • mrpc
      • openbookQA
      • openbook_qa
      • piqa
      • piqa_all
      • piqa_high
      • piqa_middle
      • pop_qa
      • qnli
      • qqp
      • race_all
      • race_high
      • race_middle
      • rte
      • sciq
      • squad
      • sst2
      • stsb
      • toxigen
      • unfair_tos
      • wmt_en_de
      • wmt_en_fr
      • wmt_en_ro
      • wnli
      • wsc
      • xsum
      • yahoo_answers_topics
    • Formats
    • Instructions
    • Metrics
    • Operators
    • Processors
    • Recipes
    • Splitters
    • Tasks
    • Templates
Unitxt
  • Catalog
  • Cards
  • Xlsum
  • Edit on GitHub


Xlsum

  • amharic
  • arabic
  • azerbaijani
  • bengali
  • burmese
  • chinese_simplified
  • chinese_traditional
  • english
  • french
  • gujarati
  • hausa
  • hindi
  • igbo
  • indonesian
  • japanese
  • kirundi
  • korean
  • kyrgyz
  • marathi
  • nepali
  • oromo
  • pashto
  • persian
  • pidgin
  • portuguese
  • punjabi
  • russian
  • scottish_gaelic
  • serbian_cyrillic
  • serbian_latin
  • sinhala
  • somali
  • spanish
  • swahili
  • tamil
  • telugu
  • thai
  • tigrinya
  • turkish
  • ukrainian
  • urdu
  • uzbek
  • vietnamese
  • welsh
  • yoruba
Previous Next

© Copyright 2023, IBM Research. Revision e3fb5ed7.

Built with Sphinx using a theme provided by Read the Docs.