Regex.tokenize
tokenizeinput
Group: Conversions
Aliases: parse, split
Documentation
Takes an input string and returns all the matches as a Vector Text.
If the pattern contains marked groups, the values are concatenated
together; otherwise the whole match is returned. Non-participating groups
are omitted.
Arguments
input: The text to tokenize.
Examples
Split to blocks of 3 characters.
Regex.compile '...' . tokenize 'ABCDEF' == ['ABC','DEF']
Split to blocks of 3 characters taking first and third letters.
Regex.compile '(.).(.)' . tokenize 'ABCDEF' == ['AC','DF']
Split a text on any white space.
Regex.compile '(\S+)(?:\s+|$)' . tokenize 'Hello Big\r\nWide\tWorld\nGoodbye!'
== ['Hello','Big','Wide','World','Goodbye!']