A pure-LUA implementation of UTF-16 decoding with little-endian ordering.
Signature |
cp.utf16.le.char(...) -> string |
Type |
Function |
Description |
Receives zero or more integers, converts each one to its corresponding UTF-16 byte sequence and returns a string with the concatenation of all these sequences. |
Parameters |
... - The list of UCL codepoint integers to convert.
|
Returns |
- All the codepoints converted to UTF-16, concatonated into a string.
|
Signature |
cp.utf16.le.codepoint(s [, i [, j]]) -> integer... |
Type |
Function |
Description |
Returns the codepoints (as integers) from all characters in s that start between byte position i and j (both included). The default for i is 1 and for j is i . It raises an error if it meets any invalid byte sequence. |
Parameters |
s - The stringi - The starting index. Defaults to 1 .j - The ending index. Defaults to i .
|
Returns |
- a list of codepoint integers for all characters in the matching range.
|
Signature |
cp.utf16.le.codes(s) -> iterator |
Type |
Function |
Description |
Returns values so that the construction |
Parameters |
s - The string to iterate through.
|
Returns |
|
| Signature | cp.utf16.len (bigEndian, s [, i [, j]]) -> number | boolean, number
|
| -----------------------------------------------------|---------------------------------------------------------------------------------------------------------|
| Type | Function |
| Description | Returns the number of UTF-16 characters in string s
that start between positions i
and j
(both inclusive). The default for i
is 1 and for j
is -1. If it finds any invalid byte sequence, returns a false value plus the position of the first invalid byte. |
| Parameters |
bigEndian
- If true, the string is 'big-endian'.s
- The UTF-16 stringi
- The starting index. Defaults to 1
.j
- The ending index. Defaults to -1
.
|
| Returns | - the length, or
false
and the first invalid byte index.
|
| Signature | cp.utf16.le.len (s [, i [, j]]) -> number | boolean, number
|
| -----------------------------------------------------|---------------------------------------------------------------------------------------------------------|
| Type | Function |
| Description | Returns the number of UTF-16 characters in string s
that start between positions i
and j
(both inclusive). The default for i
is 1 and for j
is -1. If it finds any invalid byte sequence, returns a false value plus the position of the first invalid byte. |
| Parameters |
s
- The UTF-16 stringi
- The starting index. Defaults to 1
.j
- The ending index. Defaults to -1
.
|
| Returns | - the length, or
false
and the first invalid byte index.
|
Signature |
cp.utf16.le.offset (s, n [, i]) -> number |
Type |
Function |
Description |
Returns the position (in bytes) where the encoding of the n -th character of s (counting from position i ) starts. A negative n gets characters before position i . The default for i is 1 when n is non-negative and #s + 1 otherwise, so that utf8.offset(s, -n) gets the offset of the n -th character from the end of the string. If the specified character is neither in the subject nor right after its end, the function returns nil. |
Parameters |
s - The stringn - The character number to find.i - The initial position to start from.
|
Returns |
|