Normalizer

Marker-aware Unicode string normalizer operating on in-memory atom strings.

This normalizer is marker-aware, as spectified by UTS#35. That is, it can properly handle and normalize strings that contain markers at any position, without corrupting the meaning or presence of the marker. For strings without markers, this normalizer behaves as if it were implemented without marker support.

See also

Constructors

Link copied to clipboard
constructor(data: UnicodeData)

Types

Link copied to clipboard
object Companion
Link copied to clipboard

Result of a quick check for normalization form.

Functions

Link copied to clipboard
Link copied to clipboard
fun decomposeTo(dest: MutableList<Int>, codePoint: Int, compatibility: Boolean)
fun decomposeTo(dest: MutableList<Int>, atoms: Iterable<K3StringAtom>, compatibility: Boolean)
Link copied to clipboard

Returns the quick check attribute for the given codePoint and normalization form using the NF(K)C/NF(K)D_Quick_Check property from DerivedNormalizationProps.txt.

Performs a quick check on the given atoms for the specified normalization form, as defined by Unicode UAX#15 Section 9.

Link copied to clipboard
fun reorder(atoms: MutableList<Int>)

Performs the Canonical Ordering Algorithm as defined by Section 3.11.5 in the Unicode Core Spec.