If the code page is
|
And the characters are
|
The characters behave (by default)
|
Double byte
|
Single byte
|
Depending on whether they are alphabetic or nonalphabetic. This is specified in the code page's character-attribute table.To change the default word-break behavior, supply a word-break table input file.
|
Double byte
|
Double byte
|
As separate words.
|
UTF-8
|
Single byte
|
Depending on whether they are alphabetic or nonalphabetic. This is specified in the code page's character-attribute table.To change the default word-break behavior, supply a word-break table input file.
|
UTF-8
|
Two-byte UTF-8
|
Corresponding to the USE_IT word-delimiter attribute.
|
UTF-8
|
Three- and four-byte UTF-8
|
As separate words.
|