Word Snapper

10/28/25Less than 1 minute

Word Snapper

Ensures text annotations align to word boundaries for consistent and readable text selections.

This improves the consistency and readability of text highlights, preventing awkward partial word selections.

Basic Usage

import { WordSnapper } from '@ghentcdh/annotated-text';

// Create an annotated text with word snapping enabled
createAnnotatedText(id, {
  annotation: {
    snapper: new WordSnapper(),
  },
})
  .setText(text)
  .setAnnotations(annotations);

How It Works

The WordSnapper operates in two phases:

1. Initialization

When text is provided via setText(), the snapper:

Tokenizes the text into words
Records each token's start and end positions
Builds boundary maps for efficient lookups

2. Snapping

When an annotation is created or modified, fixOffset():

Snaps the start position to the beginning of its containing word
Snaps the end position to the end of its containing word
Ensures the resulting range is valid (start < end)
Returns the adjusted boundaries

Custom tokenization

A custom tokenization function can be provided to the WordSnapper to handle specific text structures or languages.

new WordSnapper((text: string): Token[] => {
  const lexer = new Tokenizr();

  //ignore word boundries
  lexer.rule(/†/, (ctx: { accept: (arg0: string) => void }) => {
    //ctx.accept("start char")
    ctx.accept("start");
  })
})

Word Snapper

Word Snapper

Basic Usage

How It Works

1. Initialization

2. Snapping

Custom tokenization

Example