Skip to content

Releases: cometkim/unicode-segmenter

[email protected]

02 Sep 18:07
66e7f83
Compare
Choose a tag to compare

Patch Changes

  • 3665cf7: Fix Hindi text segmentation

[email protected]

01 Sep 03:56
c1e6464
Compare
Choose a tag to compare

Minor Changes

  • 73f5e6b: Significantly reduced bundle size by compressing data table. So the grapheme segmentation library is only takes 6.6kB (gzip) or 4.4kB (brotli)!

Patch Changes

  • b045320: Fix isSMP, and add more plane utils (isSIP, isTIP, isSSP)

[email protected]

05 Jul 05:54
03d1051
Compare
Choose a tag to compare

Patch Changes

  • 447b484: Fix polyfill to do not override existing, and also to be assigned as non-enumerable

[email protected]

14 Jun 02:26
6d02503
Compare
Choose a tag to compare

Patch Changes

  • 04fe2fc: Fix sourcemap reference error

    • Include missing sourcemap files for transformed cjs entries
    • Remove unnecessary transforms for esm entries and remove source map reference

[email protected]

13 Jun 19:29
56b3b74
Compare
Choose a tag to compare

Minor Changes

  • 657e31a: semi-breaking: removed _cat from grapheme cluster segments because it was useless

    Instead, added _catBegin and _catEnd as beginning/end category of segments, which are possibly useful to infer applied boundary rules.

[email protected]

12 Jun 17:02
2e84f7f
Compare
Choose a tag to compare

Minor Changes

  • f5ec709: Deprecated isEmoji(cp) in favor of isExtendedPictogrphic(cp).

    There are no differences, but it was confused with the \p{Emoji} Unicode property.

    (Note: \p{Emoji} is not useful in actual use cases, see)

Patch Changes

  • 5bf4d29: Fix the TypeScript definition for GraphemeCategory enum

[email protected]

16 May 03:16
954dc8e
Compare
Choose a tag to compare

Minor Changes

  • f1a43ff: Cleanup mixed use of takeCodePoint and String.prototype.codePointAt
    • grapheme: Use String.prototype.codePointAt
    • grapheme: Optimize character length checking, also reduce the size a bit
    • utils: Add isBMP and isSMP util to check a codepoint number is being BMP(Basic Multilingual Plane) range
    • utils: Deprecated takeCodePoint and takeChar in favor of ES6 String.prototype.codePointAt and String.fromCodePoint
    • utils: takeChar is no longer depends on String.fromCodePoint internally

[email protected]

12 May 02:40
ca5b4e3
Compare
Choose a tag to compare

Patch Changes

  • 03e121c: Optimize grapheme cluster boundary check

[email protected]

12 May 01:25
fd729cf
Compare
Choose a tag to compare

Minor Changes

  • 04455e0: Implement GB9c rule from Unicode® Standard Annex #29
  • f9d3dd1: Hide the internal fields of the Intl adapter to prevent auto-completion

[email protected]

07 May 04:01
9f3074d
Compare
Choose a tag to compare

Minor Changes

  • 06159a4: Fix ESM module resolution, and make ESM-first (still support CommonJS by condition)