augment_whitespace_prefix_and_suffix_task_input
Note
ID: catalog.augmentors.augment_whitespace_prefix_and_suffix_task_input | Type: AugmentPrefixSuffix
{
"augment_task_input": true,
"prefix_len": 5,
"prefixes": {
"": 30,
" ": 20,
"\\n": 40,
"\\t": 10
},
"remove_existing_whitespaces": true,
"suffix_len": 5,
"suffixes": {
"": 30,
" ": 20,
"\\n": 40,
"\\t": 10
},
"type": "augment_prefix_suffix"
}
Explanation about AugmentPrefixSuffix
Augments the input by prepending and appending to it a randomly selected (typically, whitespace) patterns.
- Args:
- prefixes, suffixes (list or dict)the potential (typically, whitespace) patterns to select from.
The dictionary version allows to specify relative weights of the different patterns.
- prefix_len, suffix_len (positive int)The added prefix or suffix will be of length
prefix_len of suffix_len, respectively, repetitions of the randomly selected patterns.
- remove_existing_whitespacesallows to first clean any existing leading and trailing whitespaces.
The strings made of repetitions of the selected pattern(s) are then prepended and/or appended to the potentially trimmed input.
If only one of prefixes/suffixes is needed, set the other to None.
- Examples:
To prepend the input with a prefix made of 4 ‘n’-s or ‘t’-s, employ AugmentPrefixSuffix(augment_model_input=True, prefixes=[’n’,’t’], prefix_len=4, suffixes = None) To append the input with a suffix made of 3 ‘n’-s or ‘t’-s, with triple ‘n’ suffixes being preferred over triple ‘t’, at 2:1 ratio, employ AugmentPrefixSuffix(augment_model_input=True, suffixes={’n’:2,’t’:1}, suffix_len=3, prefixes = None) which will append ‘n’-s twice as often as ‘t’-s.